Benture logo
next job →

PhD Rater – STEM & AI Evaluation at Mercor

posted 3 hours ago
mercor.com Part Time remote in US 50-100/hr 34 views

PhD Rater – STEM & AI Evaluation | $50–$100/hr | Remote (US)

Mercor is seeking experienced PhD researchers and technical experts to contribute to a frontier AI model evaluation project focused on agentic workflows. You'll design and validate challenging benchmark tasks across data science, machine learning, finance, and coding — helping surface and diagnose reasoning gaps in cutting-edge STEM models.

Key Responsibilities

  • Design challenging, real-world STEM benchmark problems that rigorously test model reasoning and problem-solving capabilities
  • Implement each task within an agentic development environment using Python
  • Analyze model and agent behavior traces to identify and diagnose failure modes beyond surface-level errors
  • Produce reproducible, testable deliverables with clear specifications and documented environments

Core Qualifications

  • Deep expertise in data science, machine learning, finance, and/or Python-based coding
  • Active or recently graduated PhD from a Top 20 U.S.-based institution
  • Strong research background in frontier STEM topics
  • Availability for 30+ hours/week, primarily on weekdays
  • Demonstrated technical output such as high-quality open-source contributions, especially in agentic or LLM tooling ecosystems
  • Ability to read and reason about agent behavior traces to diagnose complex failure modes

Nice to Have

  • Familiarity with agentic frameworks and open-source ecosystems such as LangChain, MetaGPT, AutoGen, CrewAI, LlamaIndex, BabyAGI, Dify, and similar tools

About Mercor

Mercor is a talent marketplace connecting top experts with leading AI labs and research organizations. Backed by investors including Benchmark, General Catalyst, Adam D'Angelo, Larry Summers, and Jack Dorsey, Mercor has helped thousands of professionals across law, engineering, research, and creative fields contribute to frontier AI projects shaping the next era of technology.

Benture is an independent job board and is not affiliated with or employed by Mercor.

Tips for Applying to Mercor Jobs from Benture

Increase your chances of success!
1
Four Simple Steps

Upload resumeAI interviewComplete formSubmit application

2
Perfect Your Resume

Upload your best, up-to-date resume in English. Mercor will extract details and fill out your profile automatically. Review and adjust as needed.

3
Complete = Win

SHOCKING FACT: Only ~20% of applicants complete their application! Take the 15-minute AI interview about your experience and you'll have MUCH HIGHER chances of getting hired!

AI Interview Tips: The interview focuses on your resume and work experience. Be ready to discuss specific projects and how you solved challenges.

Takes about 15 minutes | Dramatically improves your chances

Related Jobs

Benture logo
See All Jobs