← Back to Bandits

Links & Groups

Links to research groups working on multi-armed bandit algorithms and web resources.

Web Links

Wikipedia: Multi-armed bandit

DeepMind - Theoretical foundations, textbook author

DeepMind & U. Alberta - Theory and algorithms, textbook author

OpenAI (formerly Microsoft Research) - Theoretical advances

Stanford University - Thompson Sampling, posterior sampling

Microsoft Research - Contextual bandits, online learning

Columbia University - Statistical learning, contextual bandits

Microsoft Research - Contextual bandits at scale

UC Berkeley - Applications to AI safety and decision making

INRIA & ENS Paris - Sequential learning and RL

INRIA Lille - Statistical online learning

London, UK - Large-scale applications, theory

Microsoft Research - Contextual bandits, Vowpal Wabbit

UC Berkeley - Deep RL and bandits

Stanford University - Theory and applications