
Government Backoffice — Visual Document Understanding Expert | $50/hr | Remote (US/Canada)
Join a private, invite-only benchmark dataset project evaluating cutting-edge AI models on visual document understanding and instruction-following within the Government Backoffice domain. This is a unique opportunity for domain experts to shape the future of AI evaluation.
As a contributing expert, you will author complex, grounded tasks designed to rigorously test AI model capabilities. Each task must include a clear ground-truth output and an objective evaluation rubric, ensuring high-quality, reproducible benchmarks.
Ideal candidates have hands-on experience with government document processing, backoffice operations, or AI/ML data annotation and evaluation.