
Construction Estimating AI Evaluator | $50/hr | Remote (US & Canada)
Join a cutting-edge benchmark dataset project focused on evaluating AI models for visual document understanding and instruction-following within the Construction Estimating domain. This is an exciting opportunity for industry experts to directly shape the future of AI capabilities in a specialized field.
As a subject matter expert, you will author complex, grounded tasks designed to test AI model performance. Each task must include a clear ground-truth output and an objective evaluation rubric, ensuring rigorous and meaningful benchmarks.
This is a private shortlist opportunity and is not publicly listed on the general job board.