
Structural & Mechanical Engineering AI Evaluator | $70–100/hr | Worldwide Remote
Join a cutting-edge project building large-scale evaluation benchmarks for advanced AI reasoning in scientific and engineering domains. As a task designer, you'll craft graduate-level computational problems that challenge AI systems to use real scientific software tools — from querying simulations and interpreting outputs to designing experimental strategies and recovering hidden information from data.
This is not a typical annotation or labeling role. You'll be designing original, research-grade problems, calibrating them against frontier AI models, and iterating until the difficulty is precisely right.
Structural & Mechanical Engineering: Working with scikit-fem or similar finite element libraries for beam analysis, elasticity problems, and computational mechanics. Experience with Timoshenko beam theory, mesh convergence studies, or variational formulations is highly valued.
Strong candidates will think like puzzle designers — building problems where difficulty stems from reasoning strategy, not brute computation, and where surface-level pattern matching won't suffice.