QA Specialist – Audio Annotation & Diarization (Arabic) | Contractor | Fully Remote | 15-Week Engagement
Turing is seeking a detail-oriented QA Specialist with native Arabic proficiency to serve as the final quality checkpoint on an evaluation-grade, multilingual audio annotation dataset. You'll review multi-channel recordings, transcriptions, and diarization outputs to ensure they meet rigorous accuracy standards for frontier AI research.
About Turing
Based in San Francisco, Turing is the world's leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing delivers high-quality training data, advanced pipelines, and top AI researchers specializing in coding, reasoning, STEM, multilinguality, and more.
Role Overview
This project involves building a highly accurate dataset of transcribed, multi-channel audio recordings used to evaluate multilingual, multi-speaker AI systems. Contributors record unscripted group conversations, which are then transcribed and diarized before human validation. As a QA Specialist, you are the final line of defense — reviewing both audio quality and human-verified annotations end to end.
Key Responsibilities
- Audio Quality Assurance: Evaluate multi-channel recordings for technical fidelity, verify channel isolation (no audio bleed), and confirm recordings are free from background noise, clipping, or low gain.
- Transcription & Diarization Verification: Review human-validated transcripts to ensure exceptionally high accuracy and low Word Error Rate (WER). Confirm natural speech dynamics — overlaps, interruptions, and false starts — are accurately captured.
- Timestamp & Speaker Validation: Validate turn-level and word-level timestamps and speaker identification, including complex overlapping dialogue. Comfortable reading and validating JSON-formatted data for metadata and timestamp logic.
- Metadata & Content Review: Verify demographic markers, contextual domains, and conversational tags. Audit sessions to ensure no PII, toxic, or sensitive content is present.
- Reporting: Assign clear pass/fail or agree/disagree statuses and provide detailed, actionable feedback when disagreeing with annotator work.
Requirements
- Native proficiency in Arabic
- Exceptional ear for audio fidelity — able to detect subtle noise, channel bleed, or clipping
- Meticulous attention to detail for word-level timestamps and verbatim transcription
- Ability to assess complex multi-speaker conversational dynamics
Ideal Backgrounds
- Linguists / Phonetics Experts: Deep understanding of natural, unnormalized speech patterns and complex conversational annotation
- Language Teachers: Native-level mastery with strict adherence to verbatim transcription guidelines
- Professional Transcriptionists: Rigorous timestamping skills and experience meeting strict accuracy targets
Engagement Details
- Commitment: Minimum 4 hours/day, 40 hours/week, with 4 hours of overlap with PST
- Type: Contractor/Freelancer (no medical or paid leave benefits)
- Duration: 15 weeks
Perks
- Fully remote work environment
- Opportunity to contribute to cutting-edge AI research with leading LLM companies
- Potential for contract extension based on performance