Exploration and exploitation of scratch games
From MaRDI portal
(Redirected from Publication:374139)
Recommendations
Cites work
- Asymptotically efficient adaptive allocation rules
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Finite-time analysis of the multiarmed bandit problem
- Prediction, Learning, and Games
- Probability inequalities for the sum in sampling without replacement
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- The Nonstochastic Multiarmed Bandit Problem
This page was built for publication: Exploration and exploitation of scratch games
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q374139)