Learning the distribution with largest mean: two bandit frameworks
and
ESAIM: Procs, 60 (2017) 114-131
Published online: 14 December 2017
DOI: 10.1051/proc/201760114
