An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits
From MaRDI portal
Publication:5203955
DOI10.1017/apr.2019.29zbMath1427.90288OpenAlexW2711678697WikidataQ127308402 ScholiaQ127308402MaRDI QIDQ5203955
Gabriel Zayas-Cabán, Stefanus Jasin, Gui-Hua Wang
Publication date: 9 December 2019
Published in: Unnamed Author (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/2027.42/138941
Stochastic scheduling theory in operations research (90B36) Markov and semi-Markov decision processes (90C40) Performance evaluation, queueing, and scheduling in the context of computer systems (68M20)
Related Items (4)
Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality ⋮ An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits: Corrigendum ⋮ Exponential asymptotic optimality of Whittle index policy ⋮ An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Asymptotically optimal priority policies for indexable and nonindexable restless bandits
- Multi-armed bandits with discount factor near one: The Bernoulli case
- The Complexity of Optimal Queuing Network Control
- Computing a Classic Index for Finite-Horizon Bandits
- Optimal priority assignment with hard constraint
- Multi‐Armed Bandit Allocation Indices
- Dynamic Assortment with Demand Learning for Seasonal Consumer Goods
- On Sequential Designs for Maximizing the Sum of $n$ Observations
- On an index policy for restless bandits
- Restless Bandits, Linear Programming Relaxations, and a Primal-Dual Index Heuristic
- Optimality of Myopic Sensing in Multichannel Opportunistic Access
- Improving Health Outcomes Through Better Capacity Allocation in a Community-Based Chronic Care Model
- An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits
- Some aspects of the sequential design of experiments
This page was built for publication: An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits