Benture logo
 ←  next job →
Turing logo

Software Engineer – AI Evaluation at Turing

posted 1 hour ago
turing.com Contractor remote in US Varies 40 views

Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based) | 10–40 hrs/week

Turing is seeking experienced software engineers to evaluate and improve AI-generated code as part of cutting-edge research initiatives for frontier AI labs. This flexible contractor role involves curating datasets, building solutions, and refining model outputs across the full stack.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps accelerate frontier research with high-quality data, advanced training pipelines, and top AI researchers specializing in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents.

What You'll Do

  • Curate code examples, build solutions, and correct code across Python, JavaScript (React, Node.js), C/C++, Java, Rust, and Go for AI model training initiatives.
  • Evaluate and refine AI-generated code across backend and frontend contexts for efficiency, scalability, and reliability.
  • Collaborate with cross-functional teams to benchmark AI-driven coding solutions against industry performance standards.
  • Build agents that verify code quality and identify error patterns across full-stack applications.
  • Hypothesize on software engineering lifecycle stages — from prototyping and architecture design to production, launch, and monitoring — and evaluate model capabilities accordingly.
  • Design automated verification mechanisms for software engineering task solutions.

Required Skills

  • 3+ years of software engineering experience.
  • Strong expertise in full-stack development using Python and JavaScript (React, Node.js).
  • Experience deploying scalable, production-grade software with modern languages and tools.
  • Deep understanding of software architecture, design, debugging, and code quality review.
  • Excellent written and verbal communication skills for structured evaluation rationales.

Engagement Details

  • Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
  • Type: Contractor (no medical/paid leave benefits)
  • Duration: 1 month, with potential extensions based on performance
  • Location: Must be based in the United States

Evaluation Process

The application takes approximately 15–30 minutes and includes an AI video interview.

Go back

Related Jobs

Benture logo
See All Jobs