Combining multiple strategies for multiarmed bandit problems and asymptotic optimality
From MaRDI portal
(Redirected from Publication:892592)
Recommendations
- An asymptotically optimal strategy for constrained multi-armed bandit problems
- Asymptotically optimal multi-armed bandit policies under a cost constraint
- scientific article; zbMATH DE number 4064878
- Asymptotically optimal algorithms for budgeted multiple play bandits
- On nearly selfoptimizing strategies for multiarmed bandit problems with controlled arms
- Combinatorial multi-armed bandit and its extension to probabilistically triggered arms
- Sequential Multi-Hypothesis Testing in Multi-Armed Bandit Problems: An Approach for Asymptotic Optimality
- An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits
- Multi-armed bandits in discrete and continuous time
Cites work
- scientific article; zbMATH DE number 3861091 (Why is no real title available?)
- scientific article; zbMATH DE number 3474804 (Why is no real title available?)
- Combining expert advice in reactive environments
- Finite-time analysis of the multiarmed bandit problem
- Online learning methods for networking
- Prediction, Learning, and Games
- Randomised allocation of treatments in sequential trials
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- Some aspects of the sequential design of experiments
- The Nonstochastic Multiarmed Bandit Problem
Cited in
(3)
This page was built for publication: Combining multiple strategies for multiarmed bandit problems and asymptotic optimality
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q892592)