
Cybersecurity Labeling Expert | $100–150/hr | Worldwide Remote
Join Mercor's AI safety initiative as a Cybersecurity Labeling Expert, where your offensive security expertise directly shapes the classifiers that determine what AI systems will and won't assist with. This is a high-impact contractor role for seasoned security professionals who thrive on nuanced judgment in ambiguous situations.
Review and analyze flagged AI conversations — spanning plain text to code-heavy exchanges — and apply your security knowledge to assess intent and potential harm across four critical domains:
Your ground-truth labels will directly improve the AI safety systems that distinguish legitimate security research from genuine malicious intent — a distinction automated systems consistently struggle to make.
Cyberattacks cause billions in damages annually — ransomware cripples hospitals, data breaches expose millions of individuals. The line between a security researcher and a threat actor often comes down to context, specificity, and intent. Your expert judgment helps ensure AI remains a tool for defenders, not attackers.
You're a strong fit if you have experience in red team consulting, threat intelligence analysis, vulnerability research, or AI safety labeling where nuanced judgment under ambiguity is routine.