This job post has expired on December 08, 2025. It is likely that the position has already been filled.

Software Engineer - Code Reasoning & Evaluation | $85–125/hr | Worldwide Remote
Mercor is seeking elite software engineers to support a leading AI lab in advancing code understanding and reasoning capabilities for next-generation machine learning models. This high-impact 24-hour sprint involves analyzing production-grade repositories, creating technically challenging coding questions, and evaluating how advanced AI systems reason about architecture and data flow.
About the Role
You'll engage in real-world engineering work: systematically exploring large codebases, connecting related functions across multiple files, and assessing AI model reasoning. Your evidence-based analysis—citing specific files, functions, and line numbers—will directly influence how these models learn to think like expert engineers.
You're a Great Fit If You:
Example Projects & Domains
You may work across diverse systems including web APIs, backend services, CLI tools, data pipelines, frontend applications, DevOps tooling, security, observability, and performance-critical architectures.
Engagement Details
About Mercor
Mercor connects elite technical talent with leading AI research labs. Based in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.