All roles

STEM Generalist Evaluator

Remote · USA Full-time New today
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

Mercor is seeking PhD holders, doctoral candidates, and exceptional Master’s graduates in biology, physics, chemistry, or related disciplines to join a high-impact AI research initiative in partnership with a leading AI lab. This role involves evaluating and enhancing large language models (LLMs) by applying deep subject-matter expertise in graduate-level science to rigorously benchmark model performance.

  • Evaluate accuracy, scientific depth, and domain relevance of LLM-generated answers across the listed domains.
  • Review outputs spanning advanced topics in molecular biology, genetics, classical mechanics, quantum physics, physical chemistry, computer science, engineering, and humanities.
  • Identify factual inaccuracies, logical flaws, and reasoning gaps.
  • Work independently and asynchronously using provided tools.

Qualifications

  • PhD (or PhD candidate) in Biology, Physics, Chemistry, Engineering, Computer Science, Mathematics, or related STEM field.
  • Strong familiarity with graduate-level science and research problem solving.
  • Excellent written communication and attention to detail.
  • Comfortable working independently and remotely.
  • Basic Python knowledge preferred, not required.

Requirements

  • Part-time (20 hours/week).
  • Remote and asynchronous.

Compensation

  • Contractor position via Mercor.
  • $35–$60/hour based on expertise.
  • Weekly payments via Stripe Connect.
Apply To This Job

Related roles

Media & Ads Domain Expert

Remote · USA Full-time

Retail & E-commerce Expert

Remote · USA Full-time

Bengali Language Consultant

Remote · USA Full-time

Tamil Language Consultant

Remote · USA Full-time

Japanese Language Consultant

Remote · USA Full-time

Pharmacy Technician

Remote · USA Full-time

Korean Language Consultant

Remote · USA Full-time

Economics Expert

Remote · USA Full-time

Mechanical Engineer

Remote · USA Full-time

API Design Engineer

Remote · USA Full-time

Product and Services Data Operations Manager : HPE Networking Aruba

Remote · USA Full-time

Work From Home | Property Inspector | Flexible Hours

Remote · USA Full-time

Remote Freelance Chat Support Specialist – Customer Experience Champion for arenaflex’s Global Marketplace

Remote · USA Full-time

Experienced Remote Customer Service Representative – Travel and Tourism Industry Expertise – Full-Time or Part-Time Opportunities at arenaflex

Remote · USA Full-time

Business Director: Capital One Shopping (Remote-Eligible)

Remote · USA Full-time

Experienced Remote Data Entry Specialist – Contributing to Operational Excellence with Accurate Data Management and Team Collaboration at blithequark

Remote · USA Full-time

Experienced Remote Customer Service Representative – Part-Time Opportunity at arenaflex

Remote · USA Full-time

[Remote-Position] Entry Level Cyber Security Analyst | Remote

Remote · USA Full-time

Nurse Clinician - Utilization Management - 100% - Evening Hours

Remote · USA Full-time

Housing Support Operations Lead - Human Services Program Representative 2

Remote · USA Full-time