Benture logo
next job →
Turing logo

Senior Software Engineer – LLM Eval at Turing

posted 1 hour ago
turing.com Contractor remote in US Varies 33 views

Senior Software Engineer – LLM Evaluation | Contractor | Remote (US Only)

Join Turing, the world's leading AI research accelerator, as a Senior Software Engineer focused on LLM Evaluation. In this role, you'll help shape the future of large language models by building high-quality training datasets, evaluating AI-generated code, and collaborating with top researchers and engineers. This is a flexible contractor engagement ideal for experienced engineers from high-scale tech organizations.

About Turing

Headquartered in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy advanced AI systems. Turing's expertise spans software engineering, logical reasoning, STEM, multilinguality, multimodality, and AI agents.

What You'll Do

  • Curate code examples, build precise solutions, and correct code to support AI model training — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
  • Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
  • Build agents and automated verification tools in Python to assess code quality and identify error patterns.
  • Design verification mechanisms to automatically validate solutions to software engineering tasks.
  • Collaborate cross-functionally to benchmark and enhance AI-driven coding solutions.
  • Analyze and hypothesize across the full software engineering lifecycle — from prototyping and architecture design to production, monitoring, and maintenance.

Required Skills

  • 3+ years of professional software engineering experience.
  • Strong expertise in Python, including frameworks, tooling, and production-grade best practices.
  • Experience building full-stack applications and deploying scalable software.
  • Deep understanding of software architecture, design patterns, debugging, and code quality assessment.
  • Excellent written and verbal communication skills for structured evaluation rationales.

Engagement Details

  • Type: Contractor (no medical/paid leave benefits)
  • Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
  • Duration: 1 month, with potential extensions based on performance
  • Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes an AI video interview. We welcome graduates from top CS programs (Stanford, MIT, CMU, UC Berkeley, Georgia Tech, etc.), though exceptional experience always takes precedence over academic background.

Go back

Related Jobs

Benture logo
See All Jobs