On an index policy for restless bandits

From MaRDI portal
Publication:3970272


DOI10.2307/3214547zbMath0735.90072MaRDI QIDQ3970272

Gideon Weiss, Richard R. Weber

Publication date: 25 June 1992

Published in: Journal of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/3214547


90C40: Markov and semi-Markov decision processes

91B32: Resource and cost allocation (including fair division, apportionment, etc.)


Related Items

Unnamed Item, MYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUE, BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES, On the optimal allocation of service to impatient tasks, Monotone Policies and Indexability for Bidirectional Restless Bandits, Independently Expiring Multiarmed Bandits, Resource competition in virtual network embedding, A Restless Bandit Model for Resource Allocation, Competition, and Reservation, Conditions for indexability of restless bandits and an algorithm to compute Whittle index, Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality, A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits, Group Maintenance: A Restless Bandits Approach, An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits, Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability, MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT, INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS, Some indexable families of restless bandit problems, Spinning plates and squad systems: policies for bi-directional restless bandits, Testing indexability and computing Whittle and Gittins index in subcubic time, Exponential asymptotic optimality of Whittle index policy, Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges, Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic, Four proofs of Gittins' multiarmed bandit theorem, Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation, Dynamic resource allocation in a multi-product make-to-stock production system, General notions of indexability for queueing control and asset management, A mean field approach for optimization in discrete time, Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution, Asymptotically optimal index policies for an abandonment queue with convex holding cost, Dynamic priority allocation via restless bandit marginal productivity indices, Stochastic scheduling and forwards induction, Index policies for the maintenance of a collection of machines by a set of repairmen, On the Whittle index of Markov modulated restless bandits, Whittle index based Q-learning for restless bandits with average reward, Multi-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approach, Dynamic routing in distinguishable parallel queues: an application of product returns for remanufacturing, Scheduling of multi-class multi-server queueing systems with abandonments, On the computation of Whittle's index for Markovian restless bandits, Generalized Restless Bandits and the Knapsack Problem for Perishable Inventories, r-extreme signalling for congestion control, Index policies for discounted bandit problems with availability constraints, INDEXABILITY OF BANDIT PROBLEMS WITH RESPONSE DELAYS, Grid Brokering for Batch Allocation Using Indexes