An asymptotically optimal policy for finite support models in the multiarmed bandit problem

From MaRDI portal
Revision as of 03:42, 30 January 2024 by Import240129110155 (talk | contribs) (Created automatically from import240129110155)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:415624

DOI10.1007/S10994-011-5257-4zbMath1237.91037arXiv0905.2776OpenAlexW2131958277WikidataQ56675674 ScholiaQ56675674MaRDI QIDQ415624

Junya Honda, Akimichi Takemura

Publication date: 8 May 2012

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/0905.2776





Related Items (9)




Cites Work




This page was built for publication: An asymptotically optimal policy for finite support models in the multiarmed bandit problem