This job post has expired on May 09, 2026. It is likely that the position has already been filled.

Software Engineer – AI Evaluation at Turing

posted 3 months ago

turing.com Contractor remote in US Varies 337 views

Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based) | 10–40 hrs/week

Turing is seeking experienced software engineers to evaluate and shape the next generation of AI models. In this role, you'll create high-quality datasets, assess AI-generated code, and collaborate with researchers to advance large language model capabilities — all from a flexible, remote contractor engagement.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing accelerates frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers specializing in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents.

What You'll Do

Curate code examples, build solutions, and correct code to support AI model training — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
Build agents and automated verification tools in Python to assess code quality and identify error patterns.
Collaborate with cross-functional teams to benchmark and enhance AI-driven coding solutions.
Hypothesize on software engineering lifecycle stages — from prototyping and architecture design to production, monitoring, and maintenance — and evaluate model capabilities across them.
Design verification mechanisms to automatically validate solutions to software engineering tasks.

Required Skills

3+ years of professional software engineering experience.
Strong expertise in Python, including frameworks, tooling, and production-grade best practices.
Experience building full-stack applications and deploying scalable software with modern languages and tools.
Deep understanding of software architecture, design, debugging, and code quality assessment.
Excellent written and verbal communication skills for producing clear, structured evaluation rationales.

Ideal Background

This role is well-suited for engineers with experience at high-scale organizations such as Google, Microsoft, Apple, Amazon, or Meta, or graduates from top CS programs including Stanford, MIT, Carnegie Mellon, UC Berkeley, or Georgia Tech. Exceptional skill and experience always take precedence over pedigree.

Engagement Details

Type: Contractor (no medical/paid leave benefits)
Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
Duration: 1 month, with potential extensions based on performance
Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes completion of an AI video interview.

Apply on Turing Go back

Show all jobs of Turing

Software Engineer – AI Evaluation at Turing

Related Jobs

Turing

TBD remote in US

Turing

TBD remote in US

Turing

Varies Remote

Turing

Varies remote

Turing

Varies Remote (ex-US)

Turing

TBD Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (Non-US)

Turing

Varies Remote (non-US)

Turing

TBD remote in US

Turing

TBD remote