Michal Valko - Founding Researcher at Isara Labs

, Founding Researcher at Isara Labs, researcher at Inria, and a lecturer at MVA/ENS PS.

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

News

News: new Accepted at ICML 2026: Language Generation with Replay: A Learning-Theoretic View of Model Collapse with G. Racca and A. Sanyal!
News: new New preprint on optimal last-iterate convergence in matrix games with bandit feedback: Log-barrier for matrix games with C. Fiegel, P. Ménard, T. Kozuno, and V. Perchet!
News: new Panel at GLOBSEC Forum 2026 in Prague (May 21-23)!
News: new Keynote at the European Commission JRC High-Level Event on Democracy in Brussels!
News: new Invited talk at the Armenia LLM Summer School in Yerevan, Armenia (Aug 3-7)!
News: Talk at the Global AI Show: AI 2030 in Riyadh, Saudi Arabia (June 29-30)!
News: Lecturing at the International Summer School on Generative AI at Sapienza Università di Roma (June 22-26)!
News: Keynote at the Critical Ventures Ecosystem Summit 2026 in Lisbon (June 21-22)!
News: Invited talk at Jarná ITAPA 2026 in Bratislava!
News: Talk at the Orange Business Conference in Bratislava!
News: Online talk for AI Day 2 at ZŠ Valaliky, Slovakia!
News: Lecturing at the Reinforcement Learning Summer School (RLSS) in Milan, Italy!
News: Lecturing at the Advanced Course on Data Science & Machine Learning in Castiglione della Pescaia, Tuscany (June 8-12)!
News: Talk on "How Large Language Models Learn to Reason" at the Giorgi Nikoladze Scientific Research Center Seminar, Kutaisi International University, Georgia!
News: Talk at Rhizome Séminaire in Paris!
News: Talk at the Artificial Intelligence + Quantum Conference in Košice (May 11-12)!
News: Talk at GHOST Day: Applied Machine Learning Conference in Poznań, Poland (May 8-9)!

News: Invited talk at the Slovak Oxford Science Conference 2026 in Payerbach, Austria (April 29 - May 2)!
News: Speaking at IT/AI Night organized by Slovak Global Network at STU Bratislava (~500 attendees)!
News: Invited talk at KInIT (Kempelen Institute, Bratislava) on what is new in our research!
News: Public conversation at KInIT in Bratislava: Where Are Humans Still Better Than AI? Michal Valko on Uncertainty and the Future of Technology!
News: Keynote at AI Day at FMUK Bratislava!
News: Podcast interviews in April: Digitálna Strelka, Veda na dosah, Veda odpovedá!
News: The Wall Street Journal writes about Isara: OpenAI Backs New AI Startup Seeking Bot Army Breakthroughs
News: New preprint on model collapse from a learning-theoretic perspective: Language Generation with Replay with G. Racca and A. Sanyal!
News: BreakingAI News interview, Machines Can Think!
News: Forbes: Who to watch in 2026 — These Slovaks are playing in the world's top league! Forbes article
News: Congrats to Daniil Tiapkin for defending his thesis in December 2025 and joining DeepMind! Daniil Tiapkin, thesis
News: Congrats to Lisa Bedin for defending her thesis in 2025! Lisa Bedin, thesis
News: Congrats to Côme Fiegel for defending his thesis in 2025! Côme Fiegel
News: New PhD student at University of Copenhagen (2025-2028) Giorgio Racca, co-supervised with A. Sanyal!
News: New Master student Andrej Špitalský (2025), co-supervised with D. Tiapkin!
News: Game theory paper accepted to ICML 2025!
News: I became an Ambassador for FR8!
News: Became a Venture Partner at Sparkle Ventures, Formula VC, KAYA VC!
News: Angel investor in Entalpic, Forgent AI!
News: Advising Mistral AI, Fastino, Mercor, HaizeLabs, Atla and many more AI startups!
News: Became an Envoy for ESET Science Award!
News: Advisory Board member for GHOST Day: Applied Machine Learning Conference (Poland 2025, 2026).
News: I served as an area chair for NeurIPS 2024.
News: Teaching at MVA/ENS Paris-Saclay for 2024-2025! Graphs in Machine Learning
News: Paper on online RLHF accepted to NeurIPS 2024!
News: I served as a senior area chair for ICML 2024.
News: I gave an invited talk on LLM alignment at ICML 2024.
News: I co-organized the ICML 2024 workshop on LLM alignment.
News: I became the Founding Researcher at Isara Labs!
News: New postdoc joined the team in 2024! Pierre Ménard
News: I gave an invited talk on reinforcement learning from human feedback at DeepMind Paris
News: I gave a keynote talk on alignment at the European AI Summit 2024.
News: I gave an invited talk on online RLHF at Google Research in 2024.
News: Four LLM alignment papers accepted to ICML 2024!
News: We released Llama 3, check it out! Llama 3 in 2024.
News: I became a Principal Llama Scientist of Meta's GenAI in Paris! Principal Llama Scientist
News: Paper on scalable fine-tuning accepted to AISTATS 2024!
News: IPO paper accepted to IPO, AISTATS 2024!
News: Fast-forward ⏩ alignment research with Nash learning from human feedback!
News: Paper on test-time computation for LLMs accepted to ICLR 2024!
News: One RLHF theory and one exploration bonus paper at ICLR 2024!
News: I served as a senior area chair for NeurIPS 2023.
News: I co-organized the NeurIPS 2023 workshop on representation learning.
News: Another RLHF result, this time includes learning rates for RLHF! RLHF result
News: A new RLHF paper from our group! RLHF paper
News: Paper on representation learning for world models accepted to NeurIPS 2023!
News: Teaching at MVA/ENS Paris-Saclay for 2023-2024! Graphs in Machine Learning
News: An RL paper on learning rate randomization accepted to NeurIPS 2023!
News: I gave a tutorial on reinforcement learning from human feedback at ICML 2023.
News: I served as an area chair for ICML 2023.
News: Paper on self-supervised learning for video accepted to ICCV 2023!
News: Co-organized EEML 2023 in Slovakia!
News: I gave an invited talk on game-theoretic approaches to alignment at Stanford in 2023.
News: I gave an invited talk on world models at MIT in 2023.
News: I gave an invited talk on BYOL and self-supervised learning at Facebook AI Research in 2023.
News: Apply for DeepMind Paris internships for Summer 2023. DeepMind Paris internships
News: BOLD (ANR) project accepted for 2019-2023 (PI: V. Perchet).
News: Paper on efficient exploration in RL accepted to ICML 2023!
News: BIG NEWS: Outstanding paper award at ICML 2023 for our game theory work! ICML 2023, game theory work
News: Nine papers including two orals accepted to ICML 2023!
News: Paper on bandit algorithms with graph structure accepted to AISTATS 2023!
News: Teaching at MVA/ENS Paris-Saclay for 2022-2023! Graphs in Machine Learning
News: BYOL-Hindsight led by Dan Jarret accepted to NeurIPS 2022 DeepRL Workshop!
News: Two papers on RL accepted to NeurIPS 2022!
News: Congrats to Jean Tarbouriech for defending his thesis on July 6th, 2022! Jean Tarbouriech
News: BYOL-Explore that performs representation learning and exploration together is out! BYOL-Explore
News: Apply for DeepMind Paris internships for Summer 2022. DeepMind Paris internships
News: Our paper From Dirichlet to Rubin received a long oral at ICML 2022! (< 2%) From Dirichlet to Rubin
News: Three papers accepted to ICML 2022!
News: Congrats to Omar D. Domingues for defending his thesis on March 18th, 2022! Omar D. Domingues
News: Two RL papers accepted to AISTATS 2022!
News: Joined the editorial boards of TMLR, JMLR, MLJ.
News: BGRL accepted to ICLR 2022!
News: Congrats to Xuedong Shang for defending his thesis on September 29th, 2021! Xuedong Shang
News: Teaching at MVA/ENS Paris-Saclay for 2021-2022! Graphs in Machine Learning
News: Five papers accepted to NeurIPS 2021, including 1 oral and 2 spotlights! NeurIPS 2021
News: BraVe, self-supervised learning framework for video, accepted to ICCV 2021!
News: IXOMD was invited to be presented as RL Theory seminar!
News: Our minimax SSP work was invited to be presented as RL Theory seminar! minimax SSP
News: Six papers accepted to ICML 2021, including the long talk on UCBMQ! ICML 2021
News: I served as an area chair for ICLR 2021.
News: KeRNS algorithm for non-stationary kernel RL accepted to AISTATS 2021!
News: The Graphs in ML MVA course started on January 5th, 2021 and was taught by Daniele Calandriello.
News: Promoted to Senior Staff Research Scientist at DeepMind Paris!
News: Three papers on reinforcement learning theory accepted to ALT 2021!
News: Congrats to Julien Seznec for defending his thesis on December 15th, 2020! Julien Seznec
News: I served as an area chair for NeurIPS 2020.
News: Congrats to Pierre Perrault for defending his thesis on November 30th, 2020! Pierre Perrault
News: After 4 years, the full Spectral Bandits paper with the lower bound is now online. now online
News: Five papers accepted to NeurIPS 2020 including two oral talks for BYOL and DISCO and 1 spotlight! NeurIPS 2020
News: Yannic Kilcher made a YouTube video about our BYOL work! YouTube video
News: Three months of lockdown led to our three months intense self-supervised learning :-). BYOL is out! BYOL is out!
News: I gave 3 online talks on BYOL in November and December 2020. BYOL
News: Congrats to Guillaume Gautier for defending his thesis on May 19th, 2020! Guillaume Gautier
News: Eight papers accepted to ICML 2020. "See" you in Vienna! ICML 2020
News: Covariance-adapting semi-bandits paper accepted to COLT 2020. "See" you in Graz! COLT 2020
News: I gave an invited course on reinforcement learning at Math of Machine Learning Winter School during February 19th-22nd, 2020 in Sochi, Russia. Math of Machine Learning Winter School
News: Four papers accepted to AISTATS 2020. "See" you in Palermo! AISTATS 2020
News: Joined DeepMind Paris as a Staff Research Scientist!
News: Congrats to Jean-Bastien Grill for defending his thesis in 2019 and joining DeepMind! Jean-Bastien Grill, thesis
News: Congrats to Omar and Guillaume for their NeurIPS 2019 travel awards!
News: I served as an area chair for NeurIPS 2019.
News: I gave an invited talk during October 16th-18th, 2019 at GIF 2019, Yerevan, Armenia. GIF 2019
News: I gave an invited talk during September 26-27, 2019 at Lancaster and DeepMind Bandit Workshop, London, UK. Lancaster and DeepMind Bandit Workshop
News: I gave a talk during September 25-26, 2019, at Recent developments in kernel methods, UCL, London, UK. Recent developments in kernel methods
News: Four papers accepted to NeurIPS 2019. See you in Vancouver and Whistler! NeurIPS 2019
News: I gave an invited talk on July 23rd, 2019 for Cisco in Kraków, Poland. Cisco
News: I gave an invited talk at Yandex HQ on July 5th, 2019 in Moscow, Russia. Yandex
News: I gave an invited talk during July 3rd-8th, 2019 at RAAI Summer School 2019, Moscow Institute of Physics and Technology. RAAI Summer School 2019
News: I was on the program committee for COLT 2019.
News: I gave an invited talk on June 14th-15th, 2019 at ICML workshop on negative dependence.
News: On June 3rd-4th, 2019 we organized The power of graphs workshop. The power of graphs, Laura Toni
News: We organized Reinforcement Learning Summer SCOOL on 1-12 July 2019 in Lille, France. Reinforcement Learning Summer SCOOL
News: We organized Optimizing Human Learning 2019 workshop. Optimizing Human Learning 2019
News: I gave two invited talks on January 7th and 8th, 2019 at Verimag, CNRS Grenoble, France. Verimag
News: I became a reelected member of Inria Evaluation Committee for 2015-2019. Inria Evaluation Committee
News: I gave an invited talk on May 28th, 2019 at Theoretical Computer Science seminar at CU in Bratislava. Theoretical Computer Science seminar
News: Two papers accepted to ICML 2019. See you in Long Beach! ICML 2019
News: A GP-UCB sparsification paper accepted to COLT 2019. See you in Phoenix! COLT 2019
News: I gave an invited talk on February 20th, 2019 at Data Analytics Meetings at UPJŠ in Košice. Data Analytics Meetings
News: Three papers accepted to AISTATS 2019. See you in Okinawa! AISTATS 2019
News: I gave an invited talk on January 25th, 2019 at DPMMS, U. Cambridge, UK. DPMMS
News: P. Ménard joined as a postdoc! P. Ménard
News: Two papers on black-box optimization accepted to ALT 2019! ALT 2019
News: I served as an area chair for NeurIPS 2018.
News: Pierre Perrault gave an invited talk on Stochastic multi-arm bandit problem at Lambda seminar, Université de Bordeaux.
News: DPPy: Sampling determinantal point processes with Python released!
News: Starting October 1st, 2018, I taught Graphs in Machine Learning in MVA Master at ENS Paris-Saclay! Graphs in Machine Learning, MVA Master, ENS Paris-Saclay
News: I gave an invited talk on September 10th-13th, 2018 at International Workshop on Optimization and Machine Learning at CIMI, Toulouse. Optimization and Machine Learning
News: Brownian motion optimization accepted to NeurIPS 2018! See you in Montréal! NeurIPS 2018
News: A paper on optimistic optimization accepted to EWRL 2018. EWRL 2018
News: ICML Top 10 Reviewer Award at ICML 2018!
News: A paper on scattering for deep learning accepted to ECCV 2018. ECCV 2018
News: I co-organized CNRS Summer school on Networks, Graphs, and Machine Learning (RESCOM 2018) in Porquerolles, June 18-22, 2018. RESCOM 2018
News: Congrats to Daniele Calandriello for winning the prize for the Best AI Thesis from France in 2018. Inria press release, Inria article, CNRS article
News: We organized Optimizing Human Learning workshop. Optimizing Human Learning
News: I gave an invited talk for Workshop on Graph Learning, LINCS, Paris, on May 14th, 2018. Workshop on Graph Learning
News: A paper on distributed graph sparsification accepted to ICML 2018. See you in Stockholm! ICML 2018
News: A bandit paper on best of both worlds accepted to COLT 2018. See you in Stockholm! COLT 2018
News: I gave an invited talk at Journée Big data, Polytech'Lille on March 22nd, 2018, in Lille. Journée Big data, Polytech'Lille
News: I gave an invited talk for GDR ISIS on February 8th, 2018 at Télécom ParisTech.
News: I gave an invited talk on January 7th, 2018 at MIST 2018.
News: DELTA (EU CHIST-ERA) project accepted for 2018-2022 (Project Coordinator, PI: A. Jonsson). DELTA
News: Received Inria award for scientific excellence for 2018-2021: Prime d'excellence scientifique.
News: Congrats to Daniele Calandriello for defending his thesis on December 18th, 2017! Daniele Calandriello
News: Adobe research highlights our work on online influence maximization presented at NIPS 2017. online influence maximization
News: I gave a talk on November 9th, 2017 at Plateau Inria Euratechnologies.
News: Starting October 2nd, 2017, I taught Graphs in Machine Learning in MVA Master at ENS Paris-Saclay! Graphs in Machine Learning, MVA Master, ENS Paris-Saclay
News: I gave a talk on September 19th, 2017 at DeepMind, London.
News: I gave a talk on September 18th, 2017 at Decision Theory and Network Science, Lancaster, UK (STOR-i 2017) Decision Theory and Network Science
News: Two papers accepted to NIPS 2017. See you in California! NIPS 2017
News: Équipe associée Nord-Européenne accepted to work with A. Carpentier on Adaptive allocation of resources for recommender systems.
News: I gave a talk on July 11th, 2017 at ICML 2017 workshop on Picky Learners. ICML 2017 workshop on Picky Learners
News: Congrats to Guillaume and Daniele for their travel grants to ICML 2017. ICML 2017
News: I gave a talk on June 28th, 2017 for L'Institut de Mathématiques de Toulouse.
News: I gave a talk on June 14th, 2017 at Journées Scientifiques Inria 2017 in Nice, France.
News: Extra-Learn (ANR) project accepted for 2014-2017 (PI: A. Lazaric).
News: Two papers accepted to ICML 2017. See you in Australia! ICML 2017
News: I gave a popularization talk on maximizing influencer detection on social networks at Inria Lille.
News: Congrats to Daniele for receiving travel grants to AISTATS 2017. AISTATS 2017
News: I gave a talk on March 22nd, 2017 at Universität Potsdam at Amazon in Berlin.
News: Spectral Bandits accepted for publication to JMLR. JMLR
News: Two papers accepted to AISTATS 2017. See you in Florida! AISTATS 2017
News: New projects starting: LeLivreScolaire.fr - Sequential Learning for Educational Systems (PI, 2017-2020), Allocate with U. Potsdam (PI, 2017-2019), BoB (ANR, PI: R. Bardenet, 2016-2020).
News: I gave an invited talk on December 21st, 2016 at Textkernel talks series in Amsterdam. Textkernel talks series
News: Congrats to Tomáš Kocák for defending his thesis in 2016! Tomáš Kocák, thesis
News: Starting October 3rd, 2016, I taught Graphs in Machine Learning in MVA Master at ENS Paris-Saclay! Graphs in Machine Learning, MVA Master, ENS Paris-Saclay
News: I gave an invited talk on September 22nd, 2016 at Theoretical Computer Science seminar in Bratislava. Theoretical Computer Science seminar
News: I gave an invited talk on September 15th-19th, 2016 at ITAT 2016 conference at High Tatras, Slovakia. ITAT 2016
News: My habilitation thesis, Bandits on graphs and structures, is now online. Bandits on graphs and structures
News: TrailBlazer paper on sample-efficient Monte-Carlo planning accepted as oral presentation to NeurIPS 2016! NeurIPS 2016
News: I gave an invited talk on June 16th, 2016 at Graph-based Learning and Graph Mining in Lille. Graph-based Learning and Graph Mining
News: On June 15th, 3:30 PM at ENS Paris-Saclay, I defended my HdR thesis on Bandits on Graphs and Structures! Bandits on Graphs and Structures
News: I gave an invited talk on May 13th, 2016 at Network Science Thematic Semester, at ENS Lyon. Network Science Thematic Semester
News: One paper accepted to ICML 2016 and two to UAI 2016. See you in NYC! ICML 2016, UAI 2016
News: Bayesian policy gradient and actor-critic algorithms accepted to JMLR. JMLR
News: Two graph bandit papers accepted to AISTATS 2016! AISTATS 2016
News: I gave an invited talk at Multi-armed Bandit Workshop in Lancaster, UK. Multi-armed Bandit Workshop
News: Starting September 28th, 2015, I taught Graphs in Machine Learning in MVA Master at ENS Paris-Saclay! Graphs in Machine Learning, MVA Master, ENS Paris-Saclay
News: Parallel Optimistic Optimization paper accepted to NIPS 2015! NIPS 2015
News: ICML Reviewer Award at ICML 2015!
News: SequeL hosted ICML 2015 in Lille! ICML 2015
News: Polymatroid Bandits accepted for publication to JMLR. JMLR
News: Received Comenius University Distinguished Alumni award!
News: I was elected to be a member of Inria Evaluation Committee for 2014-2015. Inria Evaluation Committee
News: Two bandit papers accepted to ICML 2015 in Lille, France. ICML 2015
News: MESSI (MaxEnt SSIRL) paper accepted to IJCAI 2015 in Argentina. IJCAI 2015
News: Inria's press interview with N. Vayatis and myself about MVA's Graphs in ML. French, English
News: Intel face recognition advertising video.
News: EduBand, associated team project with Carnegie Mellon, accepted for 2015-2018 (with A. Lazaric and E. Brunskill).
News: Starting in January 2015, I taught Graphs in Machine Learning in MVA Master at ENS Paris-Saclay! Graphs in Machine Learning, MVA Master, ENS Paris-Saclay
News: Three papers on practical bandit settings accepted to NeurIPS 2014! NeurIPS 2014
News: Ford and Intel Mobii project using Face Recognition at Engadget. Engadget
News: Ford prototype using Face Recognition at Intel. Intel
News: We established an Erasmus agreement between École Centrale, Lille and U. Comenius, Bratislava for Computer Science. École Centrale, Lille, U. Comenius, Bratislava
News: I became CR1, an experienced junior scientist, at Inria in 2014.
News: Elected member of Inria Evaluation Committee (CE Inria, 2014-2019).
News: Two Spectral Bandit papers accepted to ICML 2014, AAAI 2014!
News: Bandits attack function optimization paper accepted to CEC 2014. CEC 2014
News: I organized a plenary Inria talk by Jennifer Healey on Transportation Futures.
News: Received Inria award for scientific excellence for 2014-2017: Prime d'excellence scientifique.
News: Recorded talk from ICML 2013 on StoSOO is online. online
News: INTEL/Inria collaboration: Signed INTEL funded project on Internet of Things research.
News: Kernelised contextual bandits paper accepted to UAI 2013, see you in Bellevue, WA! UAI 2013
News: Inria publishes an article about our work at INTEL on Face Recognition. Face Recognition
News: I was on the organizing committee of JFPDA 2013. JFPDA 2013
News: Promoted to tenured experienced scientist at Inria Lille with SequeL team!
News: StoSOO paper accepted to ICML 2013! ICML 2013
News: FG 2013 paper accepted. See you in April in Shanghai! FG 2013
News: I was at ICML, EWRL 2012 in Edinburgh in July 2012.
News: I gave a talk at Large-Scale Online Learning and Decision-Making Workshop in Windsor, UK.
News: Our article was accepted to the JMLR post-proceedings of EWRL 2012. EWRL 2012
News: EWRL 2012 paper accepted for presentation. EWRL 2012
News: An article accepted to JBI 2012. JBI 2012
News: I was the opening speaker at Slovak Oxford Science 2012.
News: An article accepted to International Journal of Dentistry. article
News: I became a junior researcher at Inria Lille - Nord Europe with SequeL team.
News: Joined Inria Lille as a postdoc with Rémi Munos in team SequeL!
News: CompLACS (EU FP7) project accepted for 2011-2015 (PI: J. Shawe-Taylor). CompLACS
News: I submitted the final version of my dissertation on August 18th, 2011. final version
News: ICDM 2011 paper accepted. paper
News: I received my PhD in Machine Learning from University of Pittsburgh.
News: I defended my dissertation on August 1st, 2011. dissertation
News: I gave a talk at Microsoft Research on July 6th, 2011.
News: I was at ICML 2011 from June 24th until July 2nd.
News: Compunetix Best Research Award from the CS Department at University of Pittsburgh (also received in 2008)!
News: I won the Computer Science Research Competition 2011. Computer Science Research Competition 2011
News: ICML 2011 - Global paper accepted: Conditional Anomaly Detection Using Soft Harmonic Functions. paper
News: I received the award for Runner-Up for Best Research Poster, Elevator Pitch and Scavenger Hunt Award on CS Day, March 24th, 2011. Runner-Up for Best Research Poster, Elevator Pitch
News: Paper accepted to the Grad Expo Conference, February 7th, 2011. Paper
News: I passed my proposal defense on December 20th, 2010 at University of Pittsburgh.
News: highlight: Homer Warner Best Paper Award at AMIA 2010.
News: AMIA 2010 paper accepted. paper
News: I received Academic Entrepreneurship Certificate. Certificate
News: UAI 2010 paper accepted. paper
News: I was an intern at Intel Research, Santa Clara, CA (Summer 2010).
News: highlight: Google Best Paper Award at OLCV - CVPR 2010. Award
News: University of Pittsburgh Honors Convocation Recognition (2009)!
News: 2008 awards: Andrew Mellon Fellowship, CS Research Competition, and CS Day Best Poster! Mellon Fellowship, research competition, poster winner, poster runner-up
News: Started as Research Assistant on Conditional Anomaly Detection project at University of Pittsburgh (2007-2011).
News: Started PhD in Machine Learning at University of Pittsburgh with Miloš Hauskrecht!
News: Released PlaSyn — Plastic Synapses simulator for spiking neural networks.
News: First publication: Evolutionary Feature Selection for Spiking Neural Network Pattern Classifiers at EPIA 2005.
News: MSc. summa cum laude in Computer Science from Comenius University, Bratislava! thesis
News: European Erasmus Scholarship in Lisbon, Portugal (Spring 2005).
News: Slovak Academy of Sciences Fellowship (2003-2005).
News: Elected Academic Senate Member at Comenius University, Bratislava (2003-2005).
News: Started organizing correspondence math seminars (KMS, STROM, SKMS) — continued until 2005.
News: 9th place at Programming Contest Zenit (national final).
News: First place at Slovak Mathematical Olympiad, regional final (also in 1993 and 1994).

older news

Bio

Michal is the Founding Researcher at Isara Labs, tenured researcher at Inria, and a lecturer at MVA at ENS Paris-Saclay. Michal is primarily interested in designing algorithms that would require as little human supervision as possible. He works on methods and settings that are able to deal with minimal feedback, such as deep reinforcement learning, bandit algorithms, self-supervised learning, or self play. Michal has recently worked on representation learning, world models and deep (reinforcement) learning algorithms that have some theoretical underpinning. In the past he has also worked on sequential algorithms with structured decisions where exploiting the structure leads to provably faster learning. Michal is now working on a new generation of large language models (LLMs), in addition to providing algorithmic solutions for their scalable test-time inference, fine-tuning and alignment. He received his PhD in 2011 from the University of Pittsburgh, before getting a tenure at Inria in 2012 and co-creating Google DeepMind Paris with R. Munos. In 2024, he became a Principal Llama Scientist at Meta, building online reinforcement learning stack and research for Llama 3. In 2025, he joined Isara Labs as a founding researcher.

📋long CV

📄1-page resume

🤝work with me and invitations