Senior Backend Engineer (Python/FastAPI) – AI Evaluation | Contractor | Remote (US Only)
Join Turing, the world's leading AI research accelerator, as a Senior Backend Engineer focused on AI model evaluation. In this role, you'll help shape the future of large language models by curating high-quality datasets, evaluating AI-generated code, and collaborating with top researchers. This is a flexible contractor engagement ideal for experienced engineers passionate about AI and software quality.
About Turing
Based in San Francisco, Turing partners with frontier AI labs and global enterprises to accelerate AI research and deploy reliable, high-impact AI systems. Our work spans training pipelines, advanced reasoning, software engineering, and enterprise AI transformation.
What You'll Do
- Curate code examples, build solutions, and correct code for AI model training — primarily in Python, with additional work in JavaScript (ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
- Build agents and automated verification tools in Python to assess code quality and identify error patterns.
- Collaborate with cross-functional teams to benchmark and improve AI-driven coding solutions.
- Hypothesize on software engineering lifecycle stages — from prototyping and architecture design to production, monitoring, and maintenance — and evaluate model capabilities across them.
- Design automated verification mechanisms for software engineering tasks.
Required Skills
- 3+ years of professional software engineering experience.
- Strong expertise in Python with deep knowledge of frameworks, tooling, and production-grade best practices.
- Experience building full-stack applications and deploying scalable software.
- Deep understanding of software architecture, design, debugging, and code quality review.
- Excellent written and verbal communication skills for structured evaluation rationales.
Ideal Background
This role is a great fit for engineers with experience at high-growth companies such as Stripe, Airbnb, Cloudflare, Datadog, or Coinbase. Graduates from top engineering programs are welcome, though exceptional skill and experience always take precedence.
Engagement Details
- Type: Contractor (no medical/paid leave benefits)
- Commitment: Flexible — minimum 10 hrs/week, up to 40 hrs/week
- Duration: 1 month, with potential extensions based on performance
- Location: Must be based in the United States
Evaluation Process
- Application takes approximately 15–30 minutes.
- An AI video interview is required to complete the process.