Exploration and exploitation of scratch games
DOI10.1007/S10994-013-5359-2zbMATH Open1273.68298OpenAlexW1973688704MaRDI QIDQ374139FDOQ374139
Authors: Raphaël Féraud, Tanguy Urvoy
Publication date: 22 October 2013
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-013-5359-2
Recommendations
Learning and adaptive systems in artificial intelligence (68T05) Inequalities; stochastic orderings (60E15) Applications of game theory (91A80) Stochastic games, stochastic differential games (91A15) Probabilistic games; gambling (91A60)
Cites Work
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Prediction, Learning, and Games
- Asymptotically efficient adaptive allocation rules
- The Nonstochastic Multiarmed Bandit Problem
- Finite-time analysis of the multiarmed bandit problem
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- Probability inequalities for the sum in sampling without replacement
Cited In (1)
This page was built for publication: Exploration and exploitation of scratch games
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q374139)