An asymptotically optimal policy for finite support models in the multiarmed bandit problem

From MaRDI portal
Publication:415624

DOI10.1007/s10994-011-5257-4zbMath1237.91037arXiv0905.2776OpenAlexW2131958277WikidataQ56675674 ScholiaQ56675674MaRDI QIDQ415624

Junya Honda, Akimichi Takemura

Publication date: 8 May 2012

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/0905.2776




Related Items (9)



Cites Work


This page was built for publication: An asymptotically optimal policy for finite support models in the multiarmed bandit problem