All roles

[Remote] Data Engineer | Remote

Remote · USA Full-time New today

Note: The job is a remote job and is open to candidates in USA. CodeGeniusRecruit is a company looking for a Data Engineer to work remotely on a contract basis. The role involves authoring complex benchmark tasks, designing evaluations for language models, and contributing to benchmark datasets for professional document understanding.

Responsibilities

  • Author complex, multi-step benchmark tasks grounded in real-world workspace files including technical specs, architecture docs, API references, and codebases
  • Design tasks that evaluate the ability of language models and systems to reason over technical documentation and follow precise instructions
  • Pair each task with a clearly defined ground truth output and an objective evaluation rubric
  • Incorporate web search and code execution into task design to reflect professional, real-world workflows
  • Contribute to the development of benchmark datasets used to assess LLMs on professional document understanding and instruction following within the Technology domain

Skills

  • Strong experience in software engineering, data science, or analytics in a hands-on professional capacity
  • Strong experience in authoring or evaluating technical documentation, including architecture docs, API references, or codebases
  • Strong experience in designing structured, multi-step tasks with measurable outputs and evaluation criteria
  • Strong experience working independently in a remote, self-directed contract environment

Company Overview

  • CodeGeniusRecruit connects developers with high-quality remote opportunities across AI, machine learning, and software engineering. It was founded in undefined, and is headquartered in , with a workforce of 2-10 employees. Its website is https://www.codegeniusrecruit.com.
  • Apply To This Job

    Related roles

    [Remote] Microsoft Business Development Manager

    Remote · USA Full-time

    [Remote] Software Engineer II - Model Platform

    Remote · USA Full-time

    [Remote] Technical Writer & Editor

    Remote · USA Full-time

    [Remote] Principal Statistical Programmer FSP - RWD/EPI

    Remote · USA Full-time

    [Remote] Staff Software Engineer

    Remote · USA Full-time

    [Remote] IRA Customer Service Specialist II, Trust Services

    Remote · USA Full-time

    [Remote] Game Product Manager

    Remote · USA Full-time

    [Remote] Software Engineer in Test (Mobile)

    Remote · USA Full-time

    [Remote] Data Platform Engineer

    Remote · USA Full-time

    [Remote] PCI QSA Consultant

    Remote · USA Full-time

    Part-Time Online Adjunct Professor- Criminal Justice

    Remote · USA Full-time

    Manager, Workforce & Vendor Management

    Remote · USA Full-time

    Market Specialist - Long Island

    Remote · USA Full-time

    Product Operations Lead

    Remote · USA Full-time

    arenaflex Remote Hybrid Customer Service Representative – Client Support, Issue Resolution, Data Management, and Positive Experience Champion

    Remote · USA Full-time

    Experienced Part-Time Remote Data Entry Specialist – Efficient Data Management for arenaflex

    Remote · USA Full-time

    Senior Data Science Analyst – Part‑Time Remote Position Shaping Recommendation Systems at arenaflex – $24/hr

    Remote · USA Full-time

    Customer Service Representative (Remote)

    Remote · USA Full-time

    Workers' Comp Defense Paralegal

    Remote · USA Full-time

    Internal Audit Associate Director, Americas

    Remote · USA Full-time