
Senior Software Engineer – LLM Evaluation | Contractor | Remote (US-based)
Turing is seeking an experienced Senior Software Engineer to evaluate and improve large language models through high-quality dataset curation, code assessment, and AI-driven solution refinement. This flexible contractor role is ideal for engineers who have worked at the frontier of AI and want to directly shape the future of intelligent systems.
About Turing
Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI. Turing accelerates frontier research with high-quality data, advanced training pipelines, and top AI researchers — and helps enterprises transform AI from proof of concept into measurable, lasting business impact.
Role Overview
As a Software Engineering Evaluator, you will create cutting-edge datasets used to train, benchmark, and advance large language models. You'll curate code examples, provide precise solutions, and refine AI-generated code — with a primary focus on Python, alongside JavaScript (ReactJS), C/C++, Java, Rust, and Go.
What You'll Do
Required Skills
Ideal Background
This role is a strong fit for engineers with experience at frontier AI organizations such as OpenAI, NVIDIA, Databricks, Palantir, or Snowflake. Graduates from programs with strong CS foundations — including UW, UIUC, UT Austin, University of Michigan, and Purdue — are especially encouraged to apply, though exceptional skill and experience always take precedence.
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. Apply today to contribute to the cutting edge of AI development.