This job post has expired on March 20, 2026. It is likely that the position has already been filled.

Bilingual Italian AI Evaluator Expert at Mercor

posted 4 months ago

mercor.com Contractor remote in Italy $25-30/hr 398 views

Bilingual Italian AI Evaluator | $25–30/hr | Remote in Italy

Mercor is seeking native Italian speakers with exceptional writing skills to contribute to a high-impact AI research project with a leading lab. This short-term, flexible opportunity is ideal for professionals who combine language mastery, strong critical thinking, and instructional clarity to help train and evaluate advanced language models.

What You'll Do:

Multilingual Prompt Design & Optimization: Create detailed prompts in Italian and/or English with multiple constraints and instructions, ensuring natural phrasing and real-world relevance for Italian-speaking users.
Define and Document Evaluation Standards: Establish high-level expectations for correct responses in Italian consumer contexts, and develop comprehensive rubrics that account for linguistic nuance, tone, and cultural conventions.
Model Testing and Grading (Bilingual): Run prompts through models and assess preliminary outputs for accuracy, fluency, and cultural fit in Italian, comparing results against English where needed.
Benchmarking & Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor—maintaining consistency and reliability across Italian-language benchmarks before integration into official evaluations.

Minimum Qualifications:

Native-level fluency in Italian (written) with strong reading/writing ability in English
BS or BA from a reputable institution (completed or in progress)
Strong writing and critical thinking skills
Ability to work independently and meet deadlines
Significant familiarity with ChatGPT or similar tools for personal decision-making, hobbies, or general interests
Based in Italy (or able to reliably produce Italy-specific, culturally accurate Italian)

Preferred Qualifications:

Experience in teaching, research, editing, or academic writing
Experience creating evaluation criteria, rubrics, or grading guidelines
Familiarity with LLMs, prompting, or model evaluation (helpful but not required)

Application & Onboarding Process:

Complete an AI-led interview (approximately 15 minutes)
Complete a 45-minute written assessment focused on writing and rubric creation
If selected, you will be invited to work on the project

Project Details:

Expect to contribute at least 20 hours per week
Commitment of approximately 2+ months
Structured project environment with clear goals and tools
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request

Apply on Mercor Go back

Show all jobs of Mercor

How to apply for this role

Upload your resume — keep it up-to-date and in English. Mercor will auto-fill your profile from it.
Complete the AI interview — a 15-minute conversation about your experience. Be ready to discuss specific projects and challenges you've solved.
Submit your application — only about 20% of applicants finish all the steps, so completing yours puts you well ahead.

Benture is an independent job board and is not affiliated with Mercor.

Bilingual Italian AI Evaluator Expert at Mercor

How to apply for this role

Related Jobs

Mercor

45-70/hr remote in US

Mercor

150-200/h remote in UK

Mercor

150-220/h remote

Mercor

80-150/hr remote

Mercor

80-150/hr remote

Mercor

100-150/h remote

Mercor

130-160/h remote

Mercor

120-170/h remote

Mercor

$50-70/hr remote

Mercor

$40-60/hr remote in US

Mercor

60-80/hr remote in US

Mercor

60-80/hr remote in US