Benture logo
 ←  next job →
Turing logo

Software Engineer – AI Evaluation at Turing

posted 1 hour ago
turing.com Contractor remote in US Varies 34 views

Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based) | 10–40 hrs/week

Turing is seeking experienced US-based Software Engineers to evaluate and improve AI-generated code as part of cutting-edge research initiatives for frontier AI labs. This flexible contractor role is ideal for engineers with production-level experience who want to shape the future of large language models.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps accelerate frontier research with high-quality data, advanced training pipelines, and top AI researchers — and applies that expertise to help enterprises transform AI from proof of concept into proprietary, measurable intelligence.

What You'll Do

  • Curate code examples, build precise solutions, and correct code across the full stack — Python, JavaScript (React, Node.js), C/C++, Java, Rust, and Go.
  • Evaluate and refine AI-generated code for efficiency, scalability, and reliability across backend and frontend contexts.
  • Build agents to verify code quality and identify error patterns in full-stack applications.
  • Design automated verification mechanisms for software engineering tasks.
  • Hypothesize on software engineering lifecycle stages — prototyping, architecture, API design, production, launch, monitoring — and evaluate model capabilities across them.
  • Collaborate with cross-functional teams to benchmark and enhance AI-driven coding solutions.

Required Skills

  • 3+ years of software engineering experience in production environments.
  • Strong full-stack expertise in Python and JavaScript (React, Node.js).
  • Experience deploying scalable, production-grade software with modern tools and languages.
  • Deep understanding of software architecture, design, debugging, and code quality assessment.
  • Excellent written and verbal communication skills for structured evaluation rationales.

Ideal Background

This role is especially well-suited for engineers with experience at high-scale organizations such as Google, Microsoft, Apple, Amazon, or Meta, or graduates from leading engineering programs. Exceptional skill and experience always take precedence over pedigree.

Engagement Details

  • Type: Contractor (no medical/paid leave benefits)
  • Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
  • Duration: 1 month, with potential extensions based on performance
  • Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes completion of an AI video interview.

Go back

Related Jobs

Benture logo
See All Jobs