This job post has expired on February 07, 2026. It is likely that the position has already been filled.

Math PhD - AI Model Evaluator at Mercor

posted 6 months ago

mercor.com Contractor remote: US/UK/CA/EU $73.29/hr 505 views

Math PhD - AI Model Evaluator | $73.29/hr | Remote (US, UK, Canada, EU)

Join Mercor in shaping the future of conversational AI by applying your mathematical expertise to evaluate and improve how AI systems reason about complex mathematical problems. This flexible contract role allows you to work remotely while making a meaningful impact on AI reliability and accuracy.

Why This Role Exists

Mercor partners with leading AI teams to enhance the quality and reliability of general-purpose conversational AI systems. In mathematical contexts, these systems must demonstrate precise formal reasoning, mathematical rigor, and conceptual clarity. Your expertise will directly improve how AI models handle mathematical problems, explanations, and proofs across foundational and advanced areas.

What You'll Do

Write and refine prompts to guide AI model behavior in mathematical contexts
Evaluate LLM-generated responses for correctness, rigor, and logical coherence
Verify mathematical claims, derivations, and proofs using your domain expertise
Conduct fact-checking using authoritative sources and domain knowledge
Annotate model responses by identifying strengths and areas for improvement
Assess clarity, structure, and appropriateness of explanations for different audiences
Ensure model responses align with expected conversational behavior and system guidelines
Apply consistent evaluation standards following clear taxonomies and benchmarks

Who You Are

PhD in Mathematics or a closely related field
Demonstrated experience in Probability & Statistics, and ideally one or more of: Algebra & Number Theory, Calculus & Analysis, Geometry & Topology, or Discrete Mathematics, Logic & Computation
Significant experience using large language models (LLMs) and understanding their practical applications
Excellent writing skills with ability to explain complex mathematical concepts clearly
Strong attention to detail and ability to identify subtle issues
Experience reviewing or editing technical or academic writing

Nice-to-Have Specialties

Prior experience with RLHF, model evaluation, or data annotation work
Experience teaching or explaining mathematical concepts to non-expert audiences
Familiarity with evaluation rubrics, benchmarks, or structured review frameworks

What Success Looks Like

You identify inaccuracies or weak reasoning in mathematical model outputs
Your feedback improves the rigor, clarity, and correctness of AI explanations
You deliver consistent, reproducible evaluation artifacts that strengthen model performance
You help build AI systems that users can trust in mathematical contexts

Contract Details

This is a flexible, remote contract position available for full-time or part-time engagement. Fluent English language skills required.

Apply on Mercor Go back

Show all jobs of Mercor

How to apply for this role

Upload your resume — keep it up-to-date and in English. Mercor will auto-fill your profile from it.
Complete the AI interview — a 15-minute conversation about your experience. Be ready to discuss specific projects and challenges you've solved.
Submit your application — only about 20% of applicants finish all the steps, so completing yours puts you well ahead.

Benture is an independent job board and is not affiliated with Mercor.

Math PhD - AI Model Evaluator at Mercor

How to apply for this role

Related Jobs

Mercor

$170/hr remote in US

Mercor

$70/hr remote

Mercor

$180/hr remote in US

Mercor

$150/hr remote in US

Mercor

$120/hr remote in US

Mercor

$50-70/hr remote

Mercor

$210/hr remote

Mercor

$30/hr Remote

Mercor

$38/hr remote

Mercor

$38/hr remote in Germany

Mercor

$38/hr remote

Mercor

$30/hr remote