Volume 51, October 2015Modélisation Aléatoire et Statistique - Journées MAS 2014
|Page(s)||246 - 271|
|Published online||12 October 2015|
Online Learning and Game Theory. A Quick Overview with recent results and applications*
1 Aix-Marseille University, CNRS and
EHESS, Centre de la vieille
Charité, 2 rue de la Charité, 13002
2 EDF R&D, Clamart, France & GREGHEC (HEC Paris, CNRS), Jouy-en-Josas, France
3 INRIA Grenoble Rhône-Alpes, LIG laboratoire d’Informatique de Grenoble, MESCAL
4 Laboratoire de Probabiliés et Modèles Aléatoires, Université Paris Diderot, Paris, France & Equipe Sierra, INRIA - ENS Paris, Paris, France
We study one of the main concept of online learning and sequential decision problem known as regret minimization. We investigate three different frameworks, whether data are generated accordingly to some i.i.d. process, or when no assumption whatsoever are made on their generation and, finally, when they are the consequences of some sequential interactions between players.
The overall objective is to provide a comprehensive introduction to this domain. In each of these main setups, we define and analyze classical algorithms and we analyze their performances. Finally, we also show that some concepts of equilibria that emerged in game theory are learnable by players using online learning schemes while some other concepts are not learnable.
© EDP Sciences, SMAI 2015
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.