Ballooning multi-armed bandits
From MaRDI portal
Publication:2238588
Recommendations
Cites work
- 10.1162/153244303321897663
- A Tutorial on Thompson Sampling
- A minimax and asymptotically optimal algorithm for stochastic bandits
- A quality assuring, cost optimal multi-armed bandit mechanism for expertsourcing
- Arm-acquiring bandits
- Asymptotically efficient adaptive allocation rules
- Bandit algorithms
- Bandit problems with infinitely many arms
- Finite-time analysis of the multiarmed bandit problem
- Inequalities on the Lambert \(W\) function and hyperpower function
- Introduction to multi-armed bandits
- Learning and incentives in user-generated content: multi-armed bandits with endogenous arms
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
- On the Distribution of the Number of Successes in Independent Trials
- On the Lambert \(w\) function
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- Regret bounds and minimax policies under partial monitoring
- Regret bounds for sleeping experts and bandits
- Thompson sampling: an asymptotically optimal finite-time analysis
- UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem
This page was built for publication: Ballooning multi-armed bandits
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2238588)