Michal Valko : Research
My Google Scholar profile, ArXiv profile, and HAL profile.

preprints

  • Yunhao Tang, Taco Cohen, David W. Zhang, Michal Valko, Rémi Munos: RL-finetuning LLMs from on- and off-policy data with a single algorithm, arXiv preprint
  • Chaoqi Wang, Zhuokai Zhao, Chen Zhu, Karthik Abinav Sankararaman, Michal Valko, Xuefei Cao, Zhaorun Chen, Madian Khabsa, Yuxin Chen, Hao Ma, Sinong Wang: Preference optimization with multi-sample comparisons, arXiv preprint
  • Antoine Scheid, Étienne Boursier, Alain Durmus, Michael I Jordan, Pierre Ménard, Éric Moulines, Michal Valko: Optimal design for reward modeling in RLHF, arXiv preprint
  • Pierre Perrault, Denis Belomestny, Pierre Ménard, Éric Moulines, Alexey Naumov, Daniil Tiapkin, Michal Valko: A new bound on the cumulant generating function of Dirichlet processes, arXiv preprint
  • Yunhao Tang, Daniel Zhaohan Guo, Zeyu Zheng, Daniele Calandriello, Yuan Cao, Eugene Tarassov, Rémi Munos, Bernardo Ávila Pires, Michal Valko, Yong Cheng, Will Dabney: Understanding the performance gap between online and offline alignment algorithms, arXiv preprint
  • Denis Belomestny, Pierre Ménard, Alexey Naumov, Daniil Tiapkin, Michal Valko: Sharp deviations bounds for Dirichlet weighted sums with application to analysis of Bayesian algorithms, arXiv preprint
  • Tadashi Kozuno, Wenhao Yang, Nino Vieillard, Toshinori Kitamura, Yunhao Tang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Michal Valko, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári: KL-entropy-regularized RL with a generative model is minimax optimal, arXiv preprint

2025

  • Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Michal Valko, Vianney Perchet: The Harder Path: Last iterate convergence for uncoupled Learning in zero-sum games with bandit feedback, in International Conference on Machine Learning (ICML 2025)

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2008

2007

2006

2005

  • Michal Valko, Nuno C. Marques, Marco Castelani: Evolutionary Feature Selection for Spiking Neural Network Pattern Classifiers in Proceedings of Portuguese Conference on Artificial Intelligence (EPIA 2005), eds. Bento et al., IEEE, pages 24-32. bibtex
  • Michal Valko Evolving Neural Networks for Statistical Decision Theory, Comenius University, Bratislava, 2005 (master thesis) (2005) Advisor: Radoslav Harman bibtex talk

older preprints

  • Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Ávila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos: Geometric entropic exploration, arXiv preprint
  • Pierre Perrault, Jennifer Healey, Zheng Wen, Michal Valko Michal Valko: On the approximation relationship between optimizing ratio of submodular (RS) and difference of submodular (DS) functions, arXiv preprint
  • Branislav Kveton, Zheng Wen, Azin Ashkan, Michal Valko: Learning to Act Greedily: Polymatroid Semi-Bandits, accepted for publication to Journal of Machine Learning Research (JMLR) bibtex arXiv preprint

Presentations

  • Michal Valko: Graph-Based Anomaly Detection with Soft Harmonic Functions: Presented at CS Department Research Competition (Research 2011) [#1st place] talk also at (Grad Expo 2011) and (CS DAY 2011) poster
  • Branislav Kveton, Michal Valko, Matthai Philiposse: Real-Time Adaptive Face Recognition, Presented at 23rd Neural Information Processing Systems conference (NeurIPS 2009), Video: Adaptation, Video: OfficeSpace, poster #1, poster #2
  • Michal Valko:, Branislav Kveton, Matthai Philiposse: Robust Face Recognition Using Online Learning, Presented at 9th University of Pittsburgh Science conference (SCIENCE 2009)Grad Expo 2010) talk and (CS Day 2010) poster
  • Michal Valko: Conditional anomaly detection with adaptive similarity metric: Presented at CS Department Research Competition (Research 2008) [#1st place] talk
  • Michal Valko, Milos Hauskrecht, G. Cooper, S. Visweswaran, M. Saul, A. Seybert, J. Harrison, A. Post: Conditional Anomaly Detection, Presented at (CS Day 2008) [#1st by people, #2nd by faculty] also at University of Pittsburgh, Arts & Sciences (Grad Expo 2008) poster

References

mv