Dorian Baudry
Dorian Baudry
Home
Publications
CV
Contact
Light
Dark
Automatic
Source Themes
Sub-Sampling Algorithms for Efficient Non-Parametric Bandit Exploration
In this paper we propose the first multi-armed bandit algorithm based on re-sampling that achieves asymptotically optimal regret simultaneously for different families of arms (namely Bernoulli, Gaussian and Poisson distributions). Unlike Thompson …
«
Cite
×