
PhD Rater – STEM & AI Evaluation | $50–$100/hr | Remote (US)
Mercor is seeking experienced PhD researchers and technical experts to contribute to a frontier AI model evaluation project focused on agentic workflows. You'll design and validate challenging benchmark tasks across data science, machine learning, finance, and coding — helping surface and diagnose reasoning gaps in cutting-edge STEM models.
Mercor is a talent marketplace connecting top experts with leading AI labs and research organizations. Backed by investors including Benchmark, General Catalyst, Adam D'Angelo, Larry Summers, and Jack Dorsey, Mercor has helped thousands of professionals across law, engineering, research, and creative fields contribute to frontier AI projects shaping the next era of technology.