
Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only)
Join Turing, the world's leading AI research accelerator, and play a key role in shaping the next generation of large language models. This contractor role focuses on evaluating and improving AI-generated code across a wide range of languages and systems-level domains — ideal for experienced engineers who thrive in fast-paced, high-impact environments.
About Turing
Headquartered in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy reliable, production-grade AI systems. Our team specializes in software engineering, logical reasoning, STEM, multilinguality, multimodality, and AI agents.
Role Overview
As a Software Engineering Evaluator, you will create high-quality datasets used to train, benchmark, and advance large language models. You'll curate code examples, develop precise solutions, and evaluate AI-generated code for correctness, performance, and scalability — with a strong emphasis on systems-level and infrastructure code.
Key Responsibilities
Required Skills
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. We welcome graduates from top CS programs (Stanford, MIT, CMU, UC Berkeley, Georgia Tech, etc.), though exceptional experience always takes precedence over pedigree.