
Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only) | Flexible Hours (10–40 hrs/week)
Turing is seeking a seasoned Senior Software Engineer to help shape the future of large language models by building and evaluating high-quality AI training datasets. This is a flexible contractor engagement ideal for engineers who thrive in fast-paced, high-impact environments.
About Turing
Headquartered in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing accelerates frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers — and applies that expertise to help enterprises transform AI from proof of concept into measurable business impact.
Role Overview
As a Software Engineering Evaluator, you will create cutting-edge datasets used for training, benchmarking, and advancing large language models. You'll work across the full stack — Python for backend and ML workflows, JavaScript (React, Node.js) for frontend and API layers — as well as C/C++, Java, Rust, and Go. You'll evaluate and refine AI-generated code for efficiency, scalability, and reliability, collaborating closely with researchers and cross-functional teams.
What You'll Do
Required Skills
Engagement Details
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. We look forward to learning more about you!