Michal Valko : Research

Michal Valko, machine learning scientist DeepMind, Inria, and a lecturer at MVA/ENS PS.

  deep reinforcement learning, Monte-Carlo tree search, representation learning, active learning, graphs, bandit theory


News: new Six papers accepted to ICML 2021, including the long talk on UCBMQ!
News: new KeRNS algorithm for non-stationary kernel RL accepted to AISTATS 2021!
News: new Three papers on reinforcement learning theory accepted to ALT 2021!
News: new After 4 years, the full Spectral Bandits paper with the lower bound and the comprehensive set of experiment is now online.
News: new I will be giving 3 online talks on BYOL in Novemebr and December 2020..
News: new Congrats to Julien Seznec for defending his thesis on Dec 15th, 2020!
News: new Congrats to Pierre Perrault for defending his thesis on Nov 30th, 2020!
News: new The Graphs in ML MVA course will start on January 5th, 2021 and will be taught by Daniele Calandriello.
News: new Five papers accepted to NeurIPS 2020 including two oral talks for BYOL and DISCO and 1 spotlight!.
News: new I am serving as an area chair for ICLR 2021.
News: very hot news Yannic Kilcher made a youtube video about our BYOL work!
News: very hot news Three months of lockdown lead to our three months intense self-supervised learning :-). BYOL is out!.
News: Eight papers accepted to ICML 2020. "See" you in Vienna!
News: Covariance-adapting semi-bandits paper accepted to COLT 2020. "See" you in Graz!
News: Congrats to Guillaume Gautier for defending his thesis on May 19th, 2020!
News: I am serving as an area chair for NeurIPS 2020.
News: Four papers accepted to AISTATS 2020. "See" you in Palermo!
News: I am giving an invited course on reinforcement learning at Math of Machine Learning Winter School during February 19-22th, 2020 in Sochi, Russia.

older news


Michal is a machine learning scientist in DeepMind Paris, SequeL team at Inria, and the lecturer of the master course Graphs in Machine Learning at l'ENS Paris-Saclay. Michal is primarily interested in designing algorithms that would require as little human supervision as possible. This means 1) reducing the “intelligence” that humans need to input into the system and 2) minimizing the data that humans need to spend inspecting, classifying, or “tuning” the algorithms. Another important feature of machine learning algorithms should be the ability to adapt to changing environments. That is why he is working in domains that are able to deal with minimal feedback, such as online learning, bandit algorithms, semi-supervised learning, and anomaly detection. Most recently he has worked on sequential algorithms with structured decisions where exploiting the structure leads to provably faster learning. Structured learning requires more time and space resources and therefore the most recent work of Michal includes efficient approximations such as graph and matrix sketching with learning guarantees. In past, the common thread of Michal's work has been adaptive graph-based learning and its application to real-world applications such as recommender systems, medical error detection, and face recognition. His industrial collaborators include Adobe, Intel, Technicolor, and Microsoft Research. He received his Ph.D. in 2011 from the University of Pittsburgh under the supervision of Miloš Hauskrecht and after was a postdoc of Rémi Munos before taking a permanent position at Inria in 2012.

Collaborative Projects

previous projects

Students and postdocs

  • David Cheikhi, 2020 - 2021, Columbia Universitu, NYC/École Polytechnique, Paris, with Pierre Ménard
  • Robert Müller, 2020, Technical University of Munich, M2 student, with Pierre Ménard
  • Ahmed Choukarah, 2020, ENS Ulm, L3 student, with Pierre Ménard
  • Côme Fiegel, 2019, ENS Ulm, L3 student, with Victor Gabillon
  • Axel Elaldi, 2017-2018, master student, École Centrale de Lille ↝ ENS Paris-Saclay/MVA
  • Xuedong Shang, 2017, master student, ENS Rennes, with Emilie Kaufmann ↝ Inria
  • Guillaume Gautier, 2016, master student, École Normale Supérieure, Paris-Saclay, with Rémi Bardenet ↝ Inria/CNRS
  • Andrea Locatelli, 2015-2016, ENSAM/ENS Paris-Saclay, with Alexandra Carpentier ↝ Universität Potsdam
  • Souhail Toumdi, 2015 - 2016, master student, École Centrale de Lille, with Rémi Bardenet ↝ ENS Paris-Saclay/MVA
  • Akram Erraqabi, 2015, master student, École Polytechnique, Paris ↝ Université de Montréal
  • Mastane Achab, 2015, master student, École Polytechnique, Paris, with G. Neu ↝ l'ENS Paris-Saclay ↝ Télécom ParisTech
  • Jean-Bastien Grill, 2014, master student, École Normale Supérieure, Paris, with Rémi Munos ↝ Inria
  • Alexandre Dubus, 2012-2013, master student, Université Lille1 - Sciences et Technologies ↝ Inria
  • Karim Jedda, 2012-2013, master student, École Centrale de Lille ↝ ProSiebenSat.1
  • Alexis Wehrli, 2012-2013, master student, École Centrale de Lille ↝ ERDF


  • DeepMind Paris (bureau: FR-PAR-14L-6-623E)
  • 14 Rue de Londres
  • 75009 Paris
  • Inria Lille - Nord Europe, equipe SequeL (bureau: A05)
  • Parc Scientifique de la Haute Borne
  • 40 avenue Halley
  • 59650 Villeneuve d'Ascq, France
  • office phone: +33 3 59 57 7801
  • CMLA, ENS Paris-Saclay (bureau: vacataires)
  • 61 avenue du président Wilson
  • 40 avenue Halley
  • 94235 Cachan cedex