
Senior Software Engineer – LLM Evaluation | Contractor | 10-40 hrs/week | US/Canada/Western Europe
Join Turing, the world's leading research accelerator for frontier AI labs based in San Francisco, as we advance the future of large language models through cutting-edge evaluation and dataset creation.
About the Role
As a Software Engineering Evaluator, you'll create high-quality datasets for training and benchmarking large language models. You'll curate code examples, provide precise solutions, and evaluate AI-generated code across multiple programming languages including Python, JavaScript/ReactJS, C/C++, Java, Rust, and Go.
Key Responsibilities
Required Qualifications
Engagement Details
Application Process