
Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only)
Turing is seeking an experienced Senior Software Engineer to evaluate and improve large language model outputs, with a focus on code quality, software architecture, and AI-driven development tools. This is a flexible contractor engagement (10–40 hrs/week) ideal for engineers who thrive in fast-paced, high-impact environments.
About Turing
Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps frontier research teams build high-quality training data and supports enterprises in transforming AI from proof of concept into reliable, measurable intelligence.
Role Overview
As a Software Engineering Evaluator, you will create cutting-edge datasets used to train, benchmark, and advance large language models. You'll curate code examples, provide precise solutions, and refine AI-generated code — primarily in Python, with additional work across JavaScript (ReactJS), C/C++, Java, Rust, and Go.
What You'll Do
Required Skills
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. Apply today to contribute to the future of AI development at one of the most innovative companies in the space.