Benture logo
 ←  next job →
Turing logo

Software Engineer – AI Research & Eval at Turing

posted 1 hour ago
turing.com Contractor remote in US Varies 37 views

Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based) | Flexible Hours (10–40 hrs/week)

Turing is seeking experienced Software Engineers to join its AI Research & Evaluation team, helping shape the next generation of large language models by creating high-quality training datasets, evaluating AI-generated code, and collaborating with frontier AI researchers.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing accelerates frontier research through high-quality data, advanced training pipelines, and top-tier AI researchers specializing in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents.

Role Overview

As a Software Engineering Evaluator, you will curate and refine code examples used to train and benchmark large language models. You'll work across systems-level programming, infrastructure, and backend development — ensuring AI-generated code meets the highest standards of efficiency, scalability, and reliability.

What You'll Do

  • Curate code examples, build solutions, and correct code in Python, C/C++, Rust, Go, Java, and JavaScript (including ReactJS).
  • Evaluate and refine AI-generated code with a focus on systems-level correctness, performance, and reliability.
  • Collaborate with cross-functional teams to benchmark and improve AI-driven coding solutions.
  • Build agents to verify the quality of systems-level and infrastructure code and identify error patterns.
  • Analyze and evaluate model capabilities across the full software engineering lifecycle — from prototyping and architecture design to production, monitoring, and maintenance.
  • Design automated verification mechanisms for software engineering tasks.

Required Skills

  • 3+ years of professional software engineering experience.
  • Strong expertise in systems programming, infrastructure, or backend development using Python, C/C++, Rust, and/or Go.
  • Proven experience building and deploying scalable, production-grade software.
  • Deep understanding of software architecture, design patterns, debugging, and code quality assessment.
  • Excellent written and verbal communication skills for structured, clear evaluation rationales.

Ideal Background

This role is a strong fit for engineers with experience at frontier AI or technology organizations such as OpenAI, NVIDIA, Databricks, Palantir, or Snowflake. Graduates from leading engineering and computer science programs are encouraged to apply — though exceptional skill and experience always take precedence.

Engagement Details

  • Type: Contractor (no medical/paid leave benefits)
  • Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
  • Duration: 1 month, with potential extensions based on performance
  • Location: Must be based in the United States

Application Process

The application takes approximately 15–30 minutes and includes completion of an AI video interview.

Go back

Related Jobs

Benture logo
See All Jobs