A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits

DOI10.1287/moor.2019.0998zbMath1455.90147arXiv1512.04403OpenAlexW2982074554WikidataQ127024107 ScholiaQ127024107MaRDI QIDQ5119843

José Niño-Mora

Publication date: 1 September 2020

Published in: Mathematics of Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1512.04403

zbMATH Keywords

Markov decision processes discrete time Whittle index discounted criterion index policies threshold policies indexability

Mathematics Subject Classification ID

Stochastic scheduling theory in operations research (90B36) Dynamic programming (90C39) Programming in abstract spaces (90C48) Markov and semi-Markov decision processes (90C40)

Cites Work

Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Unnamed Item
Admission and routing of soft real-time jobs to multiclusters: design and comparison of index policies
Asymptotically optimal priority policies for indexable and nonindexable restless bandits
Dynamic priority allocation via restless bandit marginal productivity indices
Markov programming by successive approximations with respect to weighted supremum norms
Whittle's index policy for a multi-class queueing system with convex holding costs
Dynamic allocation indices for restless projects and queueing admission control: a polyhedral approach
Resource allocation and routing in parallel multi-server queues with abandonments for cloud profit maximization
Optimality of monotonic policies for two-action Markovian decision processes, with applications to control of queues with delayed information
On the substitution rule for Lebesgue-Stieltjes integrals
Restless bandits, partial conservation laws and indexability
The Complexity of Optimal Queuing Network Control
Partially Observed Markov Decision Processes
Exploiting Channel Memory for Joint Estimation and Scheduling in Downlink Networks—a Whittle’s Indexability Analysis
INDEXABILITY OF BANDIT PROBLEMS WITH RESPONSE DELAYS
A Restless Bandit Marginal Productivity Index for Opportunistic Spectrum Access with Sensing Errors
The Multi-Armed Bandit Problem: Decomposition and Computation
A Characterization of Waiting Time Performance Realizable by Single-Server Queues
On an index policy for restless bandits
Multiclass Queueing Systems: Polymatroidal Structure and Optimal Scheduling Control
On Dynamic Programming with Unbounded Rewards
Solving a general discounted dynamic program by linear programming
The Lebesgue-Stieltjes Integral
Whittle Index Policy for Crawling Ephemeral Content
Whittle’s Index Policy for Multi-Target Tracking with Jamming and Nondetections
Wireless Channel Selection with Restless Bandits
On the Whittle Index for Restless Multiarmed Hidden Markov Bandits
Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access
Scheduling Continuous-Time Kalman Filters
MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT
Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues
Scheduling a Make-To-Stock Queue: Index Policies and Hedging Points
Convex functions and their applications. A contemporary approach

This page was built for publication: A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits