This job post has expired on May 20, 2026. It is likely that the position has already been filled.

AI Research Evaluator (QA Expert) at Turing

posted 2 months ago

turing.com Contractor remote Varies 337 views

AI Research Evaluator (QA) | Varies | Worldwide Remote | Contractor

Turing is seeking experienced Ph.D., Postdoctoral, or Master's-level professionals to evaluate AI-generated deep research reports. This remote freelance role sits at the intersection of content quality assurance and cutting-edge AI development, helping improve Large Language Models (LLMs) through structured, expert evaluation.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps accelerate frontier research and supports enterprises in transforming AI from proof of concept into proprietary intelligence.

What You'll Do

Evaluate AI-generated deep research reports across multiple quality dimensions using a structured rubric, providing numerical ratings and written justifications.
Review and annotate video captions for contextual accuracy, grammatical correctness, and alignment with content guidelines.
Identify inconsistencies and areas for improvement, providing detailed, evidence-based feedback.
Review other annotators' work and offer constructive, actionable feedback.
Perform fact-checking and research to validate accuracy, particularly for non-fiction and technical content.
Collaborate with cross-functional teams to maintain high-quality standards in annotation and content accuracy.

Requirements

Strong command of English, including literature, scripts, and plays.
Excellent close-reading skills with the discipline to follow rubrics precisely.
Ability to write concise, evidence-based evaluations (~50 words per dimension).
Comfortable with structured data entry, including 0.5 increment scoring and sub-dimension averaging.
Self-motivated and able to work independently in a remote environment.
Reliable desktop or laptop setup with a strong internet connection.
Availability for a 4-hour overlap with the Pacific Time (PT) zone.

Ideal Backgrounds

Managing Editor, Copy Chief, or Content/Quality Editor
Senior Fact-Checker or Research Editor (non-fiction)
LQA or Content QA Lead, Academic Grader, or Teaching Assistant
Script/Story Analyst, Copy Editor, Book Reviewer, or Beta Reader
Journalism, Research Assistant, or background in Creative Writing, English, or Comparative Literature

Engagement Details

Type: Contractor/Freelancer (potential for full-time)
Duration: 1 week per project, with possibility of extension
Schedule: 4-hour overlap with Pacific Time required

Perks

Work on cutting-edge AI projects with leading LLM companies.
Competitive compensation (varies by project).
Potential for contract extension based on performance.
Fully remote work environment.

Note: Turing does not request confidential, proprietary, or trade secret information from any employer, university, or client. All work must comply with applicable NDAs and employment agreements.

Go back

Show all jobs of Turing

AI Research Evaluator (QA Expert) at Turing

About Turing

What You'll Do

Requirements

Ideal Backgrounds

Engagement Details

Perks

Related Jobs

Turing

Varies remote

Turing

TBD remote

Turing

TBD remote in UK

Turing

Varies remote

Turing

TBD remote

Turing

Varies remote

Turing

TBD remote

Turing

TBD Remote (select)

Turing

Varies remote

Turing

TBD remote

Turing

Varies remote

Turing

Varies remote