Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems
From MaRDI portal
Publication:6167036
DOI10.1016/j.csda.2022.107610OpenAlexW4295358291MaRDI QIDQ6167036
No author found.
Publication date: 7 July 2023
Published in: Computational Statistics and Data Analysis (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.csda.2022.107610
reinforcement learningGittins indexmulti-armed bandit problemempirical Gittins indexrewarded Markov process
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
- Gittins indices in the dynamic allocation problem for diffusion processes
- Dynamic priority allocation via restless bandit marginal productivity indices
- Asymptotically efficient adaptive allocation rules
- Adaptive treatment allocation and the multi-armed bandit problem
- Arm-acquiring bandits
- On the Gittins index for multiarmed bandits
- Multi-armed bandits in discrete and continuous time
- A short proof of the Gittins index theorem
- Dynamic allocation problems in continuous time
- Multi-armed bandit problem revisited
- Synchronization and optimality for multi-armed bandit problems in continuous time
- On Bayesian index policies for sequential resource allocation
- Lévy bandits: Multi-armed bandits driven by Lévy processes
- On the Whittle index of Markov modulated restless bandits
- Optimal stochastic scheduling
- On Gittins' index theorem in continuous time
- ON THE OPTIMALITY OF AN INDEX RULE IN MULTICHANNEL ALLOCATION FOR SINGLE-HOP MOBILE NETWORKS WITH MULTIPLE SERVICE CLASSES
- Consistency of Sequential Bayesian Sampling Policies
- Multi‐Armed Bandit Allocation Indices
- Linearly Parameterized Bandits
- Branching Bandit Processes
- Extensions of the multiarmed bandit problem: The discounted case
- A Survey of Some Results in Stochastic Adaptive Control
- Continuous-time allocation indices and their discrete-time approximation
- Open bandit processes and optimal scheduling of queueing networks
- On Bayesian models in stochastic scheduling
- A minimax and asymptotically optimal algorithm for stochastic bandits
- The Multi-Armed Bandit With Stochastic Plays
- On the Whittle Index for Restless Multiarmed Hidden Markov Bandits
- General Gittins index processes in discrete time.
- Sample mean based index policies by O(log n) regret for the multi-armed bandit problem
- A General Theory of MultiArmed Bandit Processes with Constrained Arm Switches
- Bandit Algorithms
- Methods and Applications of Statistics in Clinical Trials
- Open Bandit Processes with Uncountable States and Time-Backward Effects
- INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS
- Some aspects of the sequential design of experiments
- Finite-time analysis of the multiarmed bandit problem