Publication:2908837
From MaRDI portal
DOI10.4230/LIPIcs.FSTTCS.2010.65zbMath1245.91004MaRDI QIDQ2908837
Publication date: 29 August 2012
Full work available at URL: http://subs.emis.de/LIPIcs/frontdoor_7f00.html
multi-armed bandit; two-player zero-sum game; memoryless deterministic strategy; one-player zero-sum game
91A05: 2-person games
91A80: Applications of game theory
60J10: Markov chains (discrete-time Markov processes on discrete state spaces)
91A15: Stochastic games, stochastic differential games