On an index policy for restless bandits

DOI10.2307/3214547zbMath0735.90072OpenAlexW2044502527MaRDI QIDQ3970272

Publication date: 25 June 1992

Published in: Journal of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.2307/3214547

zbMATH Keywords

multi-armed bandit problem expected time-average reward Markov rule

Mathematics Subject Classification ID

Markov and semi-Markov decision processes (90C40) Resource and cost allocation (including fair division, apportionment, etc.) (91B32)

Related Items (44)

Stochastic scheduling and forwards induction ⋮ Conditions for indexability of restless bandits and an algorithm to compute Whittle index ⋮ Dynamic Programs with Shared Resources and Signals: Dynamic Fluid Policies and Asymptotic Optimality ⋮ Resource capacity allocation to stochastic dynamic competitors: knapsack problem for perishable items and index-knapsack heuristic ⋮ Four proofs of Gittins' multiarmed bandit theorem ⋮ On the optimal allocation of service to impatient tasks ⋮ Dynamic routing in distinguishable parallel queues: an application of product returns for remanufacturing ⋮ Scheduling of multi-class multi-server queueing systems with abandonments ⋮ Generalized Restless Bandits and the Knapsack Problem for Perishable Inventories ⋮ On the computation of Whittle's index for Markovian restless bandits ⋮ Multi-machine preventive maintenance scheduling with imperfect interventions: a restless bandit approach ⋮ Testing indexability and computing Whittle and Gittins index in subcubic time ⋮ MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT ⋮ INDEXABILITY AND OPTIMAL INDEX POLICIES FOR A CLASS OF REINITIALISING RESTLESS BANDITS ⋮ Exponential asymptotic optimality of Whittle index policy ⋮ A confirmation of a conjecture on Feldman’s two-armed bandit problem ⋮ Index policies for discounted bandit problems with availability constraints ⋮ A mean field approach for optimization in discrete time ⋮ A Verification Theorem for Threshold-Indexability of Real-State Discounted Restless Bandits ⋮ r-extreme signalling for congestion control ⋮ Dynamic priority allocation via restless bandit marginal productivity indices ⋮ MYOPIC POLICIES FOR NON-PREEMPTIVE SCHEDULING OF JOBS WITH DECAYING VALUE ⋮ Group Maintenance: A Restless Bandits Approach ⋮ BANDIT STRATEGIES EVALUATED IN THE CONTEXT OF CLINICAL TRIALS IN RARE LIFE-THREATENING DISEASES ⋮ Some indexable families of restless bandit problems ⋮ Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation ⋮ Attack allocation on remote state estimation in multi-systems: structural results and asymptotic solution ⋮ Monotone Policies and Indexability for Bidirectional Restless Bandits ⋮ Dynamic resource allocation in a multi-product make-to-stock production system ⋮ INDEXABILITY OF BANDIT PROBLEMS WITH RESPONSE DELAYS ⋮ General notions of indexability for queueing control and asset management ⋮ Index policies for the maintenance of a collection of machines by a set of repairmen ⋮ Independently Expiring Multiarmed Bandits ⋮ An asymptotically optimal heuristic for general nonstationary finite-horizon restless multi-armed, multi-action bandits ⋮ Grid Brokering for Batch Allocation Using Indexes ⋮ Asymptotically optimal index policies for an abandonment queue with convex holding cost ⋮ Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability ⋮ Spinning plates and squad systems: policies for bi-directional restless bandits ⋮ Resource competition in virtual network embedding ⋮ On the Whittle index of Markov modulated restless bandits ⋮ Unnamed Item ⋮ Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges ⋮ Whittle index based Q-learning for restless bandits with average reward ⋮ A Restless Bandit Model for Resource Allocation, Competition, and Reservation

This page was built for publication: On an index policy for restless bandits