Benture logo

This job post has expired on September 01, 2025. It is likely that the position has already been filled.

Mercor logo

Researcher, AI Evaluation at Mercor

posted 8 months ago
mercor.com Full Time San Francisco, CA 180-300k 545 views

Researcher, AI Evaluation

Full-time | Onsite in San Francisco | $180,000-$300,000/year

Mercor is seeking experienced AI researchers to join our industry-leading evaluation team in San Francisco. As a Researcher in AI Evaluation, you will drive the development and benchmarking of large language models and other AI systems that are already shaping the future of work. This is a high-impact, highly visible position for candidates who want to publish, collaborate with top industry labs, and advance the science of AI evaluation at scale.

About Mercor

Mercor is the fastest-growing company in the world, pioneering automated talent evaluation with LLMs and powering hiring decisions for all of the top 5 AI labs. With over $100M in revenue run rate and 59% month-over-month growth, we combine a small, elite team with extraordinary profitability. Our technology automates resume review, interviews, and candidate selection using state-of-the-art language models, and is relied upon by industry leaders.

About the Role

  • Collaborate with top Silicon Valley AI companies and Mercor’s Forward Deployed Research, Applied AI, and Operations teams

  • Build, validate, and publish industry benchmarks for model evaluation, including multimodal, code, and tool-use

  • Design novel data collection, annotation, and evaluation protocols for industry-leading labs

  • Regularly publish evaluation and dataset papers in major AI conferences with support from our team

  • Access and evaluate the most advanced frontier models and datasets in the industry

What We’re Looking For

  • PhD or M.S. with 2+ years of experience in computer science, electrical engineering, econometrics, or another STEM field with ML/model evaluation background

  • Strong publication record in AI research; experience publishing on LLM evaluation or AI dataset/evaluation papers is ideal

  • Expertise in LLMs, data annotation, and benchmarking workflows

  • Excellent communication and data presentation skills

  • Familiarity with statistics and ML evaluation best practices

Compensation & Perks

  • Base salary: $180,000-$300,000/year

  • Generous equity grant

  • $20,000 relocation bonus for moving to the Bay Area

  • $10,000 housing bonus for living within 0.5 miles of our office

  • $1,000/month meal stipend

  • Free Equinox membership

  • Health insurance

Application Process

  • Submit your CV and complete the application form

  • Interviews scheduled promptly upon application review

Mercor considers all qualified applicants without regard to legally protected characteristics and provides reasonable accommodations upon request.

Apply now to work on the frontier of AI evaluation in San Francisco and shape the future of model performance standards.

How to apply for this role
  • Upload your resume — keep it up-to-date and in English. Mercor will auto-fill your profile from it.
  • Complete the AI interview — a 15-minute conversation about your experience. Be ready to discuss specific projects and challenges you've solved.
  • Submit your application — only about 20% of applicants finish all the steps, so completing yours puts you well ahead.
Benture is an independent job board and is not affiliated with Mercor.

Related Jobs

Benture logo
See All Jobs