This job post has expired on May 09, 2026. It is likely that the position has already been filled.

Senior Software Engineer – LLM Eval at Turing

posted 3 months ago

turing.com Contractor remote in US Varies 352 views

Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only)

Join Turing, the world's leading AI research accelerator, as a Senior Software Engineer focused on LLM Evaluation. In this role, you'll help shape the next generation of large language models by creating high-quality datasets, evaluating AI-generated code, and collaborating with frontier AI researchers. This is a flexible contractor engagement (10–40 hrs/week) ideal for experienced engineers from top-tier tech companies or leading academic institutions.

About Turing

Based in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy advanced AI systems. Turing's expertise spans software engineering, logical reasoning, STEM, multilinguality, multimodality, and AI agents — helping enterprises transform AI from proof of concept into measurable business impact.

What You'll Do

Curate code examples, build solutions, and correct code across Python, JavaScript (React, Node.js), C/C++, Java, Rust, and Go for AI model training initiatives.
Evaluate and refine AI-generated code across backend and frontend contexts for efficiency, scalability, and reliability.
Collaborate with cross-functional teams to benchmark and enhance AI-driven coding solutions.
Build agents that verify code quality and identify error patterns across full-stack applications.
Analyze software engineering lifecycle stages — from prototyping and architecture design to production, monitoring, and maintenance — and evaluate model capabilities at each step.
Design automated verification mechanisms to validate solutions to software engineering tasks.

Required Skills

3+ years of professional software engineering experience.
Strong full-stack expertise in Python and JavaScript (React, Node.js), with solid backend and frontend capabilities.
Proven experience deploying scalable, production-grade software using modern languages and tools.
Deep understanding of software architecture, design patterns, debugging, and code quality assessment.
Excellent written and verbal communication skills for producing clear, structured evaluation rationales.

Engagement Details

Type: Contractor (no medical/paid leave benefits)
Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
Duration: 1 month, with potential extensions based on performance
Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes completion of an AI video interview. We look forward to learning about your experience!

Apply on Turing Go back

Show all jobs of Turing

Senior Software Engineer – LLM Eval at Turing

Related Jobs

Turing

TBD remote in US

Turing

TBD remote in US

Turing

Varies Remote

Turing

Varies remote

Turing

Varies Remote (ex-US)

Turing

TBD Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (ex-US)

Turing

Varies Remote (Non-US)

Turing

Varies Remote (non-US)

Turing

TBD remote in US

Turing

TBD remote