Mathematics Specialist | Contractor | Worldwide Remote
Turing is seeking a skilled Mathematics Specialist to design advanced reasoning datasets that evaluate and enhance the capabilities of Large Language Models (LLMs). This is an 8-week remote contractor role requiring 4 hours of daily overlap with PST. Ideal for mathematicians or computer scientists passionate about frontier AI research.
About Turing
Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps accelerate frontier research with high-quality data, advanced training pipelines, and top AI researchers specializing in coding, reasoning, STEM, multilinguality, multimodality, and agents.
Role Overview
As a Mathematical Systems & Abstract Reasoning Engineer, you will design novel mathematical systems, define custom rules and axioms, and create rigorous reasoning challenges that push LLMs to interpret definitions, perform symbolic computations, derive properties, and construct logically sound proofs.
Day-to-Day Responsibilities
- Design and define novel mathematical systems, operations, and axiomatic frameworks for reasoning evaluations.
- Author multi-step reasoning tasks involving symbolic manipulation, expression evaluation, and proof-based problem solving.
- Develop deterministic solutions, detailed reasoning traces, and comprehensive scoring rubrics for model assessment.
- Identify and address edge cases, ambiguities, and logical inconsistencies to ensure task robustness and reproducibility.
- Collaborate with reviewers and AI/LLM teams to refine task definitions, evaluation criteria, and dataset quality standards.
Required Qualifications
- Strong foundation in abstract mathematics, formal logic, algebraic structures, discrete mathematics, or theoretical computer science.
- Minimum 2 years of experience in mathematics, computer science, data science, logic, or a related analytical discipline.
- Proven ability to create structured reasoning problems assessing both conceptual understanding and procedural accuracy.
- Excellent written communication skills with the ability to document rigorous definitions, proofs, and reasoning processes.
- Preferred: Experience evaluating LLM reasoning performance or developing AI training and assessment datasets.
Engagement Details
- Type: Contractor / Freelancer (no medical or paid leave benefits)
- Duration: 8 weeks
- Overlap Required: 4 hours with PST daily
Perks of Freelancing With Turing
- Fully remote work environment.
- Opportunity to contribute to cutting-edge AI projects with leading LLM companies.
- Potential for contract extension based on performance and project needs.