
Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only)
Join Turing, the world's leading AI research accelerator, and help shape the future of large language models. In this role, you'll create high-quality datasets, evaluate AI-generated code, and collaborate with frontier AI researchers — all on a flexible contractor basis with a minimum of 10 hours per week.
About Turing
Based in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy reliable, high-impact AI systems. Our expertise spans software engineering, logical reasoning, STEM, multilinguality, multimodality, and autonomous agents.
Role Overview
As a Software Engineering Evaluator, you will curate and refine code datasets used to train and benchmark large language models. Your work will directly influence the quality and capability of next-generation AI systems, with a strong focus on systems-level programming, performance-critical applications, and infrastructure.
What You'll Do
Required Skills
Ideal Background
This role is a strong fit for engineers with experience at frontier AI or technology organizations such as OpenAI, NVIDIA, Databricks, Palantir, or Snowflake. Graduates from top-tier programs are welcome, though exceptional skill and experience always take precedence.
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. Apply today to contribute to cutting-edge AI research at the frontier of the field.