Normal bandits of unknown means and variances
From MaRDI portal
Publication:4558474
zbMATH Open1471.62441MaRDI QIDQ4558474FDOQ4558474
Authors: Wesley Cowan, Junya Honda, Michael N. Katehakis
Publication date: 22 November 2018
Full work available at URL: http://jmlr.csail.mit.edu/papers/v18/15-154.html
Recommendations
- Optimal adaptive policies for sequential allocation problems
- scientific article; zbMATH DE number 4059270
- Optimal sequential sampling from two populations.
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- Adaptive treatment allocation and the multi-armed bandit problem
Cites Work
- Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
- Asymptotically efficient adaptive allocation rules
- On the Gittins index for multiarmed bandits
- Multi-armed bandit allocation indices. With a foreword by Peter Whittle.
- Some aspects of the sequential design of experiments
- Finite-time analysis of the multiarmed bandit problem
- Kullback-Leibler upper confidence bounds for optimal sequential allocation
- UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem
- 10.1162/1532443041827907
- An asymptotically optimal policy for finite support models in the multiarmed bandit problem
- Optimal Adaptive Policies for Markov Decision Processes
- Optimal adaptive policies for sequential allocation problems
- ASYMPTOTIC BAYES ANALYSIS FOR THE FINITE-HORIZON ONE-ARMED-BANDIT PROBLEM
- Asymptotically optimal Bayesian sequential change detection and identification rules
- Non-asymptotic analysis of a new bandit algorithm for semi-bounded rewards
- Title not available (Why is that?)
- Online Learning of Rested and Restless Bandits
- Multi-armed bandits under general depreciation and commitment
- On large deviations properties of sequential allocation problems
- Title not available (Why is that?)
Cited In (4)
This page was built for publication: Normal bandits of unknown means and variances
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4558474)