Go Back

RL Research Scientist (Evaluator)

Remote

$50–$75/hr

Contract

About Contrario:
Contrario is a leading data provider building high-quality human-in-the-loop datasets for foundational AI labs and robotics companies.
We’re backed by Y Combinator, Nexus Venture Partners, senior researchers from OpenAI and DeepMind, and former partners at Hudson River Trading, alongside Stanford affiliates.

Job Description:
Design evaluation protocols, sanity-check benchmarks, and label subtle RL failure cases.

Required Qualifications:

  • PhD-level ML/RL expertise.

  • Strong understanding of reward models.

  • Critical evaluation mindset.

Preferred Qualifications:

  • Published RL research.

  • Benchmark design experience.

Apply Now