
Senior Software Engineer – Ruby (LLM Evaluation) | Contractor | Remote – Select Countries
Join Turing, one of the world's fastest-growing AI companies, to help build LLM evaluation and training datasets for realistic software engineering tasks. This hands-on role blends practical Ruby engineering with cutting-edge AI research, working on verifiable software engineering tasks derived from real public repository histories.
We are constructing LLM evaluation and training datasets using a synthetic, human-in-the-loop approach based on public GitHub repository histories. The goal is to expand dataset coverage across programming languages, difficulty levels, and task types to better train LLMs on real-world software engineering problems.