Home

I’m a ELLIS PhD student at the Max Planck Institute for Intelligent Systems, advised by Prof. Moritz Hardt. My research is broadly in the domain of LLM post-training and AI evals.

These days, I am most excited about:

  • synthetic data (problem) generation pipelines for RL.
  • pushing model's capabilities in hard exploration problems (where pass@k is ~0) via curriculum learning techniques. The end-goal here would be to approach scientific discovery problems like open math conjectures.
  • agentic training of LLMs for long-horizon tasks with tool use.

Past Work: My recent project has been on scaling data (synthetically) for training language models to become better at forecasting future events with RL. I also like working on meaningful evaluations and have pushed for shifting the QA evals ecosystem from MCQ to more open-ended benchmarks.

News

Old

Collaborate?

If you are working on a project and feel I could be worth collaborating or a challenging problem in which I might be interested, you can reach out to me here