An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems
From MaRDI portal
Publication:2665165
DOI10.1016/J.AUTOMATICA.2021.109673OpenAlexW3159888357MaRDI QIDQ2665165
Publication date: 18 November 2021
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/2007.14550
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- An asymptotically optimal strategy for constrained multi-armed bandit problems
- Sample average approximation of expected value constrained stochastic programs
- Asymptotically efficient adaptive allocation rules
- The Sample Average Approximation Method for Stochastic Discrete Optimization
- Randomised allocation of treatments in sequential trials
- Simple Bayesian Algorithms for Best-Arm Identification
- Stochastically Constrained Ranking and Selection via SCORE
- Probability Inequalities for Sums of Bounded Random Variables
- Prediction, Learning, and Games
- Some aspects of the sequential design of experiments
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: An index-based deterministic convergent optimal algorithm for constrained multi-armed bandit problems