
Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based) | 10–40 hrs/week
Turing is seeking experienced Software Engineers to join its AI Research & Evaluation team, helping shape the next generation of large language models by curating high-quality training data, evaluating AI-generated code, and building verification systems for frontier AI labs and global enterprises.
About Turing
Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for enterprises deploying advanced AI systems. Turing accelerates frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers — and helps enterprises transform AI from proof of concept into measurable business impact.
Role Overview
As a Software Engineering Evaluator, you will create cutting-edge datasets used to train, benchmark, and advance large language models. You'll collaborate closely with researchers to curate code examples, provide precise solutions, and evaluate AI-generated code across multiple languages and domains — with a strong emphasis on systems-level programming, performance-critical applications, and infrastructure.
What You'll Do
Required Skills
Ideal Background
This role is a great fit for engineers who have shipped high-impact products at fast-moving companies such as Stripe, Airbnb, Cloudflare, Datadog, or Coinbase. Graduates from top CS programs (Stanford, MIT, CMU, UC Berkeley, Georgia Tech, etc.) are encouraged to apply — though exceptional experience always takes precedence over pedigree.
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes completion of an AI video interview.