This job post has expired on February 07, 2026. It is likely that the position has already been filled.

Engineering PhD - AI Model Evaluation | $73.29/hr | Worldwide Remote
Mercor is seeking PhD-level engineers to evaluate and improve conversational AI systems used in engineering contexts. Apply your technical expertise to ensure AI models deliver accurate, rigorous, and clear explanations of complex engineering concepts.
Why This Role Exists
Mercor partners with leading AI teams to enhance the quality and reliability of general-purpose conversational AI systems. In engineering contexts, these systems must demonstrate accurate applied reasoning, quantitative precision, and practical problem-solving. This project focuses on evaluating how models reason about and explain engineering concepts across multiple disciplines.
What You'll Do
Who You Are
Nice-to-Have Specialties
What Success Looks Like
Why Join Mercor
Apply your PhD-level engineering expertise to improve how AI systems reason about and communicate complex technical concepts. This flexible, remote role enables you to contribute directly to the development of reliable, high-quality AI systems used in real-world applications.