
Computational Chemistry AI Evaluator | $70–100/hr | Worldwide Remote
Join a cutting-edge project building large-scale evaluation benchmarks for advanced AI reasoning in scientific and engineering domains. As a task designer, you'll create graduate-level computational problems that challenge AI systems to use real scientific software tools — from running simulations and interpreting outputs to designing experimental strategies and recovering hidden information from data.
This is not a typical annotation or labeling role. You'll be crafting original, research-grade problems grounded in real scientific workflows, calibrating them against frontier AI models, and iterating until the difficulty is precisely right.
We're seeking experts with deep, hands-on experience using PySCF for quantum chemistry calculations, including Hartree-Fock, DFT, TDDFT, CASSCF, and post-HF methods. Ideal candidates can design problems around excited-state analysis, orbital diagnostics, method selection for complex electronic structures, and interpreting computational artifacts from method limitations. Experience with other specialized software in this domain is also considered.
Strong candidates will think like puzzle designers — constructing problems where difficulty stems from reasoning strategy, not brute computation, and where surface-level pattern matching won't suffice.