Benture logo
 ←  next job →
Turing logo

Senior Python Engineer – LLM Eval at Turing

posted 1 hour ago
turing.com Contractor remote in US Varies 31 views

Senior Python Engineer – LLM Evaluation | Contractor | Remote (US Only)

Join Turing, the world's leading AI research accelerator, as a Senior Python Engineer focused on evaluating and improving large language models. This flexible contractor role involves curating high-quality datasets, assessing AI-generated code, and collaborating with researchers to push the frontier of AI capabilities.

About Turing

Based in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy reliable, high-impact AI systems. Our team specializes in software engineering, logical reasoning, STEM, multilinguality, multimodality, and AI agents.

What You'll Do

  • Curate code examples, build solutions, and correct AI-generated code — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
  • Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
  • Build agents and automated verification tools in Python to assess code quality and identify error patterns.
  • Collaborate with cross-functional teams to benchmark AI-driven coding solutions against industry standards.
  • Hypothesize on software engineering lifecycle stages — from prototyping and architecture to production, monitoring, and maintenance — and evaluate model capabilities across them.
  • Design verification mechanisms to automatically validate solutions to software engineering tasks.

Required Skills

  • 3+ years of professional software engineering experience.
  • Strong expertise in Python, including frameworks, tooling, and production-grade best practices.
  • Experience building full-stack applications and deploying scalable software.
  • Deep understanding of software architecture, design, debugging, and code quality review.
  • Excellent written and verbal communication skills for structured evaluation rationales.

Engagement Details

  • Type: Contractor (no medical/paid leave benefits)
  • Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
  • Duration: 1 month, with potential extensions based on performance
  • Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes an AI video interview. We welcome candidates from top engineering backgrounds and leading academic institutions, though exceptional skill and experience always take precedence.

Go back

Related Jobs

Benture logo
See All Jobs