
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Bio
Michal is the Founding Researcher at Isara Labs, tenured researcher at Inria, and a lecturer at MVA at ENS Paris-Saclay. Michal is primarily interested in designing algorithms that would require as little human supervision as possible. He works on methods and settings that are able to deal with minimal feedback, such as deep reinforcement learning, bandit algorithms, self-supervised learning, or self play. Michal has recently worked on representation learning, world models and deep (reinforcement) learning algorithms that have some theoretical underpinning. In the past he has also worked on sequential algorithms with structured decisions where exploiting the structure leads to provably faster learning. Michal is now working on a new generation of large language models (LLMs), in addition to providing algorithmic solutions for their scalable test-time inference, fine-tuning and alignment. He received his PhD in 2011 from the University of Pittsburgh, before getting a tenure at Inria in 2012 and co-creating Google DeepMind Paris with R. Munos. In 2024, he became a Principal Llama Scientist at Meta, building online reinforcement learning stack and research for Llama 3. In 2025, he joined Isara Labs as a founding researcher.
Current Students
Past Students and Postdocs
Contact
Paris, France
40 avenue Halley
59650 Villeneuve d'Ascq, France
+33 3 59 57 78 01
4, avenue des Sciences
91190 Gif-sur-Yvette, France




































