Go Back
RL Research Scientist (Evaluator)
Remote
$50–$75/hr
Contract
About Contrario:
Contrario is a leading data provider building high-quality human-in-the-loop datasets for foundational AI labs and robotics companies.
We’re backed by Y Combinator, Nexus Venture Partners, senior researchers from OpenAI and DeepMind, and former partners at Hudson River Trading, alongside Stanford affiliates.
Job Description:
Design evaluation protocols, sanity-check benchmarks, and label subtle RL failure cases.
Required Qualifications:
PhD-level ML/RL expertise.
Strong understanding of reward models.
Critical evaluation mindset.
Preferred Qualifications:
Published RL research.
Benchmark design experience.
Apply Now