Good arm identification via bandit feedback
DOI10.1007/S10994-019-05784-4zbMATH Open1491.68160arXiv1710.06360OpenAlexW2962902250WikidataQ128264264 ScholiaQ128264264MaRDI QIDQ2425222FDOQ2425222
Authors: Hideaki Kano, Junya Honda, Kentaro Sakamaki, Kentaro Matsuura, Atsuyoshi Nakamura, Masashi Sugiyama
Publication date: 26 June 2019
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1710.06360
Recommendations
- On the complexity of best-arm identification in multi-armed bandit models
- Lower bounds on the sample complexity of exploration in the multi-armed bandit problem.
- Pure exploration in infinitely-armed bandit models with fixed-confidence
- The sample complexity of exploration in the multi-armed bandit problem
- Multi-armed bandits with simple arms
Learning and adaptive systems in artificial intelligence (68T05) Stopping times; optimal stopping problems; gambling theory (60G40)
Cites Work
- On the complexity of best-arm identification in multi-armed bandit models
- Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems
- Asymptotically efficient adaptive allocation rules
- Finite-time analysis of the multiarmed bandit problem
- Kullback-Leibler upper confidence bounds for optimal sequential allocation
- Reinforcement learning. An introduction
- A procedure for selecting a subset of size m containing the l best of k independent normal populations, with applications to simulation
Cited In (7)
- On the complexity of best-arm identification in multi-armed bandit models
- Pure exploration in infinitely-armed bandit models with fixed-confidence
- Best arm identification for contaminated bandits
- Best arm identification in generalized linear bandits
- Secure best arm identification in multi-armed bandits
- A bad arm existence checking problem: how to utilize asymmetric problem structure?
- A PAC algorithm in relative precision for bandit problem with costly sampling
This page was built for publication: Good arm identification via bandit feedback
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2425222)