A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits
From MaRDI portal
Publication:5119843
DOI10.1287/moor.2019.0998zbMath1455.90147arXiv1512.04403OpenAlexW2982074554WikidataQ127024107 ScholiaQ127024107MaRDI QIDQ5119843
Publication date: 1 September 2020
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1512.04403
Markov decision processesdiscrete timeWhittle indexdiscounted criterionindex policiesthreshold policiesindexability
Stochastic scheduling theory in operations research (90B36) Dynamic programming (90C39) Programming in abstract spaces (90C48) Markov and semi-Markov decision processes (90C40)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Admission and routing of soft real-time jobs to multiclusters: design and comparison of index policies
- Asymptotically optimal priority policies for indexable and nonindexable restless bandits
- Dynamic priority allocation via restless bandit marginal productivity indices
- Markov programming by successive approximations with respect to weighted supremum norms
- Whittle's index policy for a multi-class queueing system with convex holding costs
- Dynamic allocation indices for restless projects and queueing admission control: a polyhedral approach
- Resource allocation and routing in parallel multi-server queues with abandonments for cloud profit maximization
- Optimality of monotonic policies for two-action Markovian decision processes, with applications to control of queues with delayed information
- On the substitution rule for Lebesgue-Stieltjes integrals
- Restless bandits, partial conservation laws and indexability
- The Complexity of Optimal Queuing Network Control
- Partially Observed Markov Decision Processes
- Exploiting Channel Memory for Joint Estimation and Scheduling in Downlink Networks—a Whittle’s Indexability Analysis
- INDEXABILITY OF BANDIT PROBLEMS WITH RESPONSE DELAYS
- A Restless Bandit Marginal Productivity Index for Opportunistic Spectrum Access with Sensing Errors
- The Multi-Armed Bandit Problem: Decomposition and Computation
- A Characterization of Waiting Time Performance Realizable by Single-Server Queues
- On an index policy for restless bandits
- Multiclass Queueing Systems: Polymatroidal Structure and Optimal Scheduling Control
- On Dynamic Programming with Unbounded Rewards
- Solving a general discounted dynamic program by linear programming
- The Lebesgue-Stieltjes Integral
- Whittle Index Policy for Crawling Ephemeral Content
- Whittle’s Index Policy for Multi-Target Tracking with Jamming and Nondetections
- Wireless Channel Selection with Restless Bandits
- On the Whittle Index for Restless Multiarmed Hidden Markov Bandits
- Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
- Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access
- Scheduling Continuous-Time Kalman Filters
- MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT
- Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues
- Scheduling a Make-To-Stock Queue: Index Policies and Hedging Points
- Convex functions and their applications. A contemporary approach
This page was built for publication: A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits