Benture logo
next job →

Data Scientist - AI Evaluation & Analysis at Mercor

posted 7 hours ago
mercor.com Contractor remote $100/hour 11 views

Data Scientist - AI Evaluation & Analysis | $100–120/hr | Remote Worldwide

We're seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. You'll identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions.

Key Responsibilities:

  • Statistical Failure Analysis: Identify patterns in AI agent failures across task components including prompts, rubrics, templates, file types, and tags
  • Root Cause Analysis: Determine whether failures stem from task design, rubric clarity, file complexity, or agent limitations
  • Dimension Analysis: Analyze performance variations across finance sub-domains, file types, and task categories
  • Reporting & Visualization: Create dashboards and reports highlighting failure clusters, edge cases, and improvement opportunities
  • Quality Framework: Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings
  • Stakeholder Communication: Present insights to data labeling experts and technical teams

Required Qualifications:

  • Statistical Expertise: Strong foundation in statistical analysis, hypothesis testing, and pattern recognition
  • Programming: Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis
  • Data Analysis: Experience with exploratory data analysis and creating actionable insights from complex datasets
  • AI/ML Familiarity: Understanding of LLM evaluation methods and quality metrics
  • Tools: Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL

Preferred Qualifications:

  • Experience with AI/ML model evaluation or quality assurance
  • Background in finance or willingness to learn finance domain concepts
  • Experience with multi-dimensional failure analysis
  • Familiarity with benchmark datasets and evaluation frameworks
  • 2-4 years of relevant experience

Tips for Applying to Mercor Jobs from Benture

Increase your chances of success!
1
Four Simple Steps

Upload resumeAI interviewComplete formSubmit application

2
Perfect Your Resume

Upload your best, up-to-date resume in English. Mercor will extract details and fill out your profile automatically. Review and adjust as needed.

3
Complete = Win

SHOCKING FACT: Only ~20% of applicants complete their application! Take the 15-minute AI interview about your experience and you'll have MUCH HIGHER chances of getting hired!

AI Interview Tips: The interview focuses on your resume and work experience. Be ready to discuss specific projects and how you solved challenges.

Takes about 15 minutes | Dramatically improves your chances

Related Jobs

Benture logo
See All Jobs