A minimax and asymptotically optimal algorithm for stochastic bandits (Q4645646)
From MaRDI portal
scientific article; zbMATH DE number 6999898
Language | Label | Description | Also known as |
---|---|---|---|
English | A minimax and asymptotically optimal algorithm for stochastic bandits |
scientific article; zbMATH DE number 6999898 |
Statements
10 January 2019
0 references
stochastic multi-armed bandits
0 references
regret analysis
0 references
upper confidence bound (UCB)
0 references
minimax optimality
0 references
asymptotic optimality
0 references
stat.ML
0 references
cs.LG
0 references
math.ST
0 references
stat.TH
0 references