Benture logo
next job →

Generalist - English & German at Mercor

posted 12 hours ago
mercor.com Contractor remote: Europe, US $39.78/hr 34 views

AI Evaluation Generalist | $39.78/hr | Remote (Europe & US)

Mercor is seeking bilingual English and German speakers to evaluate and improve conversational AI systems. This flexible contract role involves assessing LLM responses, fact-checking content, and providing high-quality human feedback to enhance AI reliability and user experience.

What You'll Do

  • Evaluate LLM-generated responses for accuracy, clarity, and helpfulness across diverse topics
  • Conduct fact-checking using trusted public sources and external tools
  • Generate high-quality evaluation data by annotating response strengths and identifying areas for improvement
  • Assess reasoning quality, tone, completeness, and alignment with conversational guidelines
  • Apply consistent annotations following detailed evaluation taxonomies and benchmarks
  • Ensure model responses meet expected standards and system requirements

Who You Are

  • Hold a Bachelor's degree
  • Native speaker or C2-level fluency in German
  • Significant experience using large language models (LLMs) and understanding their practical applications
  • Excellent writing skills with ability to articulate nuanced, detailed feedback
  • Strong attention to detail and ability to identify subtle issues
  • Adaptable across topics, domains, and varying project requirements
  • Background in structured analytical thinking (research, policy, analytics, linguistics, engineering)
  • Excellent college-level mathematics skills

Nice-to-Have Qualifications

  • Prior experience with RLHF, model evaluation, or data annotation
  • Experience writing or editing high-quality content
  • Familiarity with evaluation rubrics, benchmarks, or quality scoring systems
  • Experience making fine-grained qualitative judgments between multiple outputs

Why Join Mercor

Work at the frontier of human-in-the-loop AI development and directly shape how advanced language models behave in real-world applications. This flexible, remote contract role offers competitive rates and the opportunity to contribute meaningfully to AI systems used by millions globally.

Contract Type: Full-time or Part-time available
Geography: Restricted to Europe and USA

Benture is an independent job board and is not affiliated with or employed by Mercor.

Tips for Applying to Mercor Jobs from Benture

Increase your chances of success!
1
Four Simple Steps

Upload resumeAI interviewComplete formSubmit application

2
Perfect Your Resume

Upload your best, up-to-date resume in English. Mercor will extract details and fill out your profile automatically. Review and adjust as needed.

3
Complete = Win

SHOCKING FACT: Only ~20% of applicants complete their application! Take the 15-minute AI interview about your experience and you'll have MUCH HIGHER chances of getting hired!

AI Interview Tips: The interview focuses on your resume and work experience. Be ready to discuss specific projects and how you solved challenges.

Takes about 15 minutes | Dramatically improves your chances

Related Jobs

Benture logo
See All Jobs