Benture logo

This job post has expired on November 26, 2025. It is likely that the position has already been filled.

Mercor logo

AI Evaluation Generalist at Mercor

posted 5 months ago
mercor.com Contractor remote 30-150/hr 974 views

AI Evaluation Generalist | $30–150/hr | Worldwide Remote

Mercor is seeking versatile and detail-oriented professionals to collaborate on cutting-edge AI evaluation projects. As a Mercor Generalist, you'll help test, refine, and improve how advanced AI models understand and reason about real-world workflows across diverse domains.

Key Responsibilities

  • Evaluate AI-generated outputs for accuracy, clarity, and alignment with real-world reasoning
  • Contribute written assessments and structured feedback on model performance
  • Identify conceptual, logical, and stylistic strengths and weaknesses in AI responses
  • Collaborate asynchronously with research and operations teams to maintain quality and consistency
  • Apply strong analytical judgment and precise communication across all evaluations

Ideal Qualifications

  • Strong English fluency and excellent written communication skills
  • Sharp attention to detail with ability to identify subtle errors or inconsistencies
  • Analytical and critical thinking skills across a wide range of topics
  • No formal educational background required — curiosity, clarity, and reasoning skill are key

Timeline & Commitment

  • Start Date: Rolling (immediate opportunities available)
  • Duration: Varies by project (typically 1–3 months)
  • Hours: Flexible, part-time (~10–20 hours/week, with potential to increase)
  • Schedule: Fully remote and asynchronous

Compensation

  • Competitive pay ranging from $30–150 USD/hour, depending on domain expertise and task complexity
  • Independent contractor arrangement
  • Daily payments via Stripe Connect

Application Process

  1. Submit your resume or professional summary
  2. Complete a short, unpaid qualifying assessment introducing core task formats
  3. High performers will be considered for ongoing paid opportunities across AI evaluation and research projects

About Mercor

Mercor is a San Francisco-based company connecting exceptional professionals to frontier AI research and evaluation projects. Backed by Benchmark, General Catalyst, Adam D'Angelo, Larry Summers, and Jack Dorsey, we've partnered with thousands of professionals to help shape the next generation of intelligent systems.

Join a growing network of professionals working at the intersection of human expertise and artificial intelligence.

How to apply for this role
  • Upload your resume — keep it up-to-date and in English. Mercor will auto-fill your profile from it.
  • Complete the AI interview — a 15-minute conversation about your experience. Be ready to discuss specific projects and challenges you've solved.
  • Submit your application — only about 20% of applicants finish all the steps, so completing yours puts you well ahead.
Benture is an independent job board and is not affiliated with Mercor.

Related Jobs

Benture logo
See All Jobs