Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit
From MaRDI portal
Publication:2689638
DOI10.1134/S00051179220110108MaRDI QIDQ2689638
Alex V. Kolnogorov, S. V. Garbar
Publication date: 13 March 2023
Published in: Automation and Remote Control (Search for Journal in Brave)
dynamic programmingminimax approachMonte-Carlo simulationmultiarmed bandit probleminvariant descriptionGaussian multiarmed banditUCB rule
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Batched bandit problems
- Adaptive treatment allocation and the multi-armed bandit problem
- On Bayesian index policies for sequential resource allocation
- Gaussian two-armed bandit and optimization of batch data processing
- Gaussian two-armed bandit: limiting description
- An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
- Sequential medical trials
- 10.1162/153244303321897663
- Bandit Algorithms
- Prediction, Learning, and Games
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: Customization of J. Bather's UCB strategy for a Gaussian multiarmed bandit