This job post has expired on May 09, 2026. It is likely that the position has already been filled.

Senior Python Engineer – LLM Eval at Turing

posted 3 months ago

turing.com Contractor remote in US Varies 284 views

Senior Python Engineer – LLM Evaluation | Contractor | Remote (US Only)

Join Turing, the world's leading AI research accelerator, and help shape the future of large language models. In this role, you'll evaluate and refine AI-generated code, build high-quality training datasets, and collaborate with researchers pushing the boundaries of frontier AI. This is a flexible contractor engagement ideal for experienced engineers with a background in production-grade Python development and AI systems.

About Turing

Based in San Francisco, Turing partners with leading AI labs and global enterprises to accelerate frontier research and deploy advanced AI systems. Turing's expertise spans high-quality data, advanced training pipelines, and top-tier AI researchers specializing in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents.

What You'll Do

Curate code examples, build solutions, and correct AI-generated code — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
Build agents and automated verification tools in Python to assess code quality and identify error patterns.
Design verification mechanisms to automatically validate software engineering task solutions.
Collaborate with cross-functional teams to benchmark and improve AI-driven coding solutions.
Analyze and hypothesize on software engineering lifecycle stages — from prototyping and architecture design to production, monitoring, and maintenance.

Required Skills

3+ years of professional software engineering experience.
Strong expertise in Python, including frameworks, tooling, and production-grade best practices.
Experience building full-stack applications and deploying scalable software systems.
Deep understanding of software architecture, design, debugging, and code quality assessment.
Excellent written and verbal communication skills for structured evaluation rationales.

Ideal Background

This role is a strong fit for engineers with experience at frontier AI organizations such as OpenAI, NVIDIA, Databricks, Palantir, or Snowflake. Graduates from top CS programs (UW, UIUC, UT Austin, Michigan, Purdue, etc.) are encouraged to apply — though exceptional skill and experience always take precedence.

Engagement Details

Type: Contractor (no medical/paid leave benefits)
Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
Duration: 1 month, with potential extensions based on performance
Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes completion of an AI video interview.

Apply on Turing Go back

Show all jobs of Turing

Senior Python Engineer – LLM Eval at Turing

About Turing

What You'll Do

Required Skills

Ideal Background

Engagement Details

Application Process

Related Jobs

Turing

TBD remote in US

Turing

TBD remote in US

Turing

Varies Remote

Turing

Varies remote

Turing

Varies Remote (ex-US)

Turing

TBD Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (Non-US)

Turing

Varies Remote (non-US)

Turing

TBD remote in US

Turing

TBD remote