Benture logo

This job post has expired on October 21, 2025. It is likely that the position has already been filled.

Mercor logo

Technical Reviewer - RL Benchmarking at Mercor

posted 6 months ago
mercor.com Contractor remote in US 80-100/hr 363 views

Technical Reviewer - RL Benchmarking | $80–100/hr | Remote in US

Join a leading AI research lab as a Technical Reviewer, where you'll play a vital role in evaluating and refining benchmarking pipelines for reinforcement learning (RL) environments and agentic AI systems. This is an exciting opportunity to ensure the accuracy, reproducibility, and fairness of experimental benchmarks that shape the future of intelligent agents.

  • Key Responsibilities:
  • Review RL environments and assess terminal conditions for correctness and consistency.
  • Evaluate benchmarking pipelines for fairness, reproducibility, and research alignment.
  • Provide structured technical feedback on code implementations and documentation.
  • Collaborate with researchers to refine evaluation metrics and methodologies.
  • Validate results across different runs, seeds, and hardware setups to ensure reproducibility.
  • Document findings and recommend improvements for environment design and benchmarking standards.

Ideal Candidate:

  • Background in reinforcement learning, computer science, or applied AI research.
  • Experience with RL environments and benchmarking methodologies.
  • Proficient in Python (PyTorch/TensorFlow a plus).
  • Strong critical thinking and technical feedback skills.
  • Detail-oriented, with a passion for experimental rigor and reproducibility.

Work Structure: Full-time hourly contractor, 40 hours/week, paid weekly via Stripe Connect. Flexible and remote work style.

Shape the standards of agentic AI research and collaborate with top researchers in a fast-moving field. Apply your expertise to advance the reliability and impact of RL benchmarking.

How to apply for this role
  • Upload your resume — keep it up-to-date and in English. Mercor will auto-fill your profile from it.
  • Complete the AI interview — a 15-minute conversation about your experience. Be ready to discuss specific projects and challenges you've solved.
  • Submit your application — only about 20% of applicants finish all the steps, so completing yours puts you well ahead.
Benture is an independent job board and is not affiliated with Mercor.

Related Jobs

Benture logo
See All Jobs