Exploration and exploitation of scratch games
From MaRDI portal
Publication:374139
DOI10.1007/s10994-013-5359-2zbMath1273.68298OpenAlexW1973688704MaRDI QIDQ374139
Publication date: 22 October 2013
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-013-5359-2
Inequalities; stochastic orderings (60E15) Learning and adaptive systems in artificial intelligence (68T05) Applications of game theory (91A80) Stochastic games, stochastic differential games (91A15) Probabilistic games; gambling (91A60)
Related Items (1)
Cites Work
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Asymptotically efficient adaptive allocation rules
- Probability inequalities for the sum in sampling without replacement
- The Nonstochastic Multiarmed Bandit Problem
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- Prediction, Learning, and Games
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Exploration and exploitation of scratch games