This job post has expired on May 09, 2026. It is likely that the position has already been filled.

Senior Python Engineer – LLM Eval at Turing

posted 3 months ago

turing.com Contractor remote in US Varies 304 views

Senior Python Engineer – LLM Evaluation | Contractor | Remote (US Only)

Join Turing, the world's leading AI research accelerator based in San Francisco, as a Senior Python Engineer focused on LLM Evaluation. In this role, you'll help shape the future of large language models by building high-quality datasets, evaluating AI-generated code, and collaborating with top researchers on frontier AI systems.

About Turing

Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy advanced AI systems at scale. Our team specializes in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents — delivering measurable, lasting impact for our clients.

What You'll Do

Curate code examples, build solutions, and correct code across Python, C/C++, Rust, Go, Java, and JavaScript (including ReactJS).
Evaluate and refine AI-generated code with a focus on systems-level correctness, performance, and reliability.
Collaborate with cross-functional teams to benchmark and improve AI-driven coding solutions.
Build agents to verify the quality of systems-level and infrastructure code and identify error patterns.
Analyze stages of the software engineering lifecycle — from prototyping and architecture to production, monitoring, and maintenance — and evaluate model capabilities across them.
Design automated verification mechanisms for software engineering tasks.

Required Skills

3+ years of software engineering experience.
Strong expertise in systems programming, infrastructure, or backend development using Python, C/C++, Rust, and/or Go.
Proven experience building and deploying scalable, production-grade software.
Deep understanding of software architecture, design, debugging, and code quality assessment.
Excellent written and verbal communication skills for structured evaluation rationales.

Engagement Details

Type: Contractor (no medical/paid leave benefits)
Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
Duration: 1 month, with potential extensions based on performance
Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes an AI video interview. We welcome candidates from top engineering backgrounds and high-growth companies — though exceptional skill always takes precedence over pedigree.

Apply on Turing Go back

Show all jobs of Turing

Senior Python Engineer – LLM Eval at Turing

Related Jobs

Turing

TBD remote in US

Turing

TBD remote in US

Turing

Varies Remote

Turing

Varies remote

Turing

Varies Remote (ex-US)

Turing

TBD Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (Non-US)

Turing

Varies Remote (non-US)

Turing

TBD remote in US

Turing

TBD remote