Benture logo
next job →
Turing logo

AI Research Evaluator (QA Expert) at Turing

posted 1 hour ago
turing.com Contractor remote Varies 32 views

AI Research Evaluator (QA) | Varies | Worldwide Remote | Contractor

Turing is seeking experienced Ph.D., Postdoctoral, or Master's-level professionals to evaluate AI-generated deep research reports. This remote freelance role sits at the intersection of content quality assurance and cutting-edge AI development, helping improve Large Language Models (LLMs) through structured, expert evaluation.

About Turing

Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing helps accelerate frontier research and supports enterprises in transforming AI from proof of concept into proprietary intelligence.

What You'll Do

  • Evaluate AI-generated deep research reports across multiple quality dimensions using a structured rubric, providing numerical ratings and written justifications.
  • Review and annotate video captions for contextual accuracy, grammatical correctness, and alignment with content guidelines.
  • Identify inconsistencies and areas for improvement, providing detailed, evidence-based feedback.
  • Review other annotators' work and offer constructive, actionable feedback.
  • Perform fact-checking and research to validate accuracy, particularly for non-fiction and technical content.
  • Collaborate with cross-functional teams to maintain high-quality standards in annotation and content accuracy.

Requirements

  • Strong command of English, including literature, scripts, and plays.
  • Excellent close-reading skills with the discipline to follow rubrics precisely.
  • Ability to write concise, evidence-based evaluations (~50 words per dimension).
  • Comfortable with structured data entry, including 0.5 increment scoring and sub-dimension averaging.
  • Self-motivated and able to work independently in a remote environment.
  • Reliable desktop or laptop setup with a strong internet connection.
  • Availability for a 4-hour overlap with the Pacific Time (PT) zone.

Ideal Backgrounds

  • Managing Editor, Copy Chief, or Content/Quality Editor
  • Senior Fact-Checker or Research Editor (non-fiction)
  • LQA or Content QA Lead, Academic Grader, or Teaching Assistant
  • Script/Story Analyst, Copy Editor, Book Reviewer, or Beta Reader
  • Journalism, Research Assistant, or background in Creative Writing, English, or Comparative Literature

Engagement Details

  • Type: Contractor/Freelancer (potential for full-time)
  • Duration: 1 week per project, with possibility of extension
  • Schedule: 4-hour overlap with Pacific Time required

Perks

  • Work on cutting-edge AI projects with leading LLM companies.
  • Competitive compensation (varies by project).
  • Potential for contract extension based on performance.
  • Fully remote work environment.

Note: Turing does not request confidential, proprietary, or trade secret information from any employer, university, or client. All work must comply with applicable NDAs and employment agreements.

Go back

Related Jobs

Benture logo
See All Jobs