Multi-armed bandit with sub-exponential rewards
From MaRDI portal
Publication:2060366
Recommendations
- Optimal exploration-exploitation in a multi-armed bandit problem with non-stationary rewards
- Multi-armed bandits in discrete and continuous time
- Independently Expiring Multiarmed Bandits
- Multi-armed bandit problem revisited
- scientific article; zbMATH DE number 4084786
- The Multi-Armed Bandit With Stochastic Plays
- The Nonstochastic Multiarmed Bandit Problem
Cites work
- An Introduction to Heavy-Tailed and Subexponential Distributions
- Bandit algorithms
- Bandits With Heavy Tail
- Concentration inequalities. A nonasymptotic theory of independence
- Introduction to multi-armed bandits
- Inventory rebalancing and vehicle routing in bike sharing systems
- Learning to optimize via posterior sampling
- Online learning and online convex optimization
- Pricing of reusable resources under ambiguous distributions of demand and service time with emerging applications
- Regret analysis of stochastic and nonstochastic multi-armed bandit problems
- Some aspects of the sequential design of experiments
- The concentration of measure phenomenon
Cited in
(3)
This page was built for publication: Multi-armed bandit with sub-exponential rewards
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2060366)