Software Engineer – AI Research & Evaluation | Contractor | Remote (US-based)
Join Turing, the world's leading AI research accelerator, as a Software Engineering Evaluator. In this role, you'll help shape the future of large language models by creating high-quality training datasets, evaluating AI-generated code, and collaborating with researchers to push the boundaries of frontier AI. This is a flexible contractor engagement ideal for experienced engineers who thrive in fast-paced, high-impact environments.
About Turing
Based in San Francisco, Turing partners with leading AI labs and global enterprises to accelerate AI research and deploy advanced AI systems at scale. Our work spans high-quality data generation, advanced training pipelines, and enterprise AI transformation — turning proof-of-concept into measurable business impact.
What You'll Do
- Curate code examples, build solutions, and correct code for AI model training — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
- Build agents and automated verification tools in Python to assess code quality and identify error patterns.
- Collaborate with cross-functional teams to benchmark AI-driven coding solutions against industry standards.
- Hypothesize on software engineering lifecycle stages — from prototyping and architecture design to production, monitoring, and maintenance — and evaluate model capabilities across them.
- Design verification mechanisms to automatically validate solutions to software engineering tasks.
Required Skills
- 3+ years of professional software engineering experience.
- Strong expertise in Python, including frameworks, tooling, and production-grade best practices.
- Experience building full-stack applications and deploying scalable software with modern tools.
- Deep understanding of software architecture, design, debugging, and code quality review.
- Excellent written and verbal communication skills for structured, clear evaluation rationales.
Ideal Background
This role is a great fit for engineers with experience at high-growth companies like Stripe, Airbnb, Cloudflare, Datadog, or Coinbase. Graduates from top engineering programs are welcome, though exceptional skill and experience always take precedence over pedigree.
Engagement Details
- Type: Contractor (no medical/paid leave benefits)
- Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
- Duration: 1 month, with potential extensions based on performance
- Location: Must be based in the United States
Application Process
The application takes approximately 15–30 minutes and includes an AI video interview. Apply today to contribute to cutting-edge AI research at one of the most innovative companies in the space.