
Software Engineer - Code Reasoning & Evaluation | $85–125/hr | Worldwide Remote
Mercor is seeking elite software engineers to support a leading AI lab in advancing code understanding and reasoning capabilities for next-generation machine learning models. This high-impact 24-hour sprint involves analyzing production-grade repositories, creating technically challenging coding questions, and evaluating how advanced AI systems reason about architecture and data flow.
About the Role
You'll engage in real-world engineering work: systematically exploring large codebases, connecting related functions across multiple files, and assessing AI model reasoning. Your evidence-based analysis—citing specific files, functions, and line numbers—will directly influence how these models learn to think like expert engineers.
You're a Great Fit If You:
Example Projects & Domains
You may work across diverse systems including web APIs, backend services, CLI tools, data pipelines, frontend applications, DevOps tooling, security, observability, and performance-critical architectures.
Engagement Details
About Mercor
Mercor connects elite technical talent with leading AI research labs. Based in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Upload resume → AI interview → Complete form → Submit application
Upload your best, up-to-date resume in English. Mercor will extract details and fill out your profile automatically. Review and adjust as needed.
SHOCKING FACT: Only ~20% of applicants complete their application! Take the 15-minute AI interview about your experience and you'll have MUCH HIGHER chances of getting hired!
AI Interview Tips: The interview focuses on your resume and work experience. Be ready to discuss specific projects and how you solved challenges.
Takes about 15 minutes | Dramatically improves your chances