A short proof of the Gittins index theorem
From MaRDI portal
Publication:1327612
DOI10.1214/aoap/1177005207zbMath0816.60040OpenAlexW2159471113MaRDI QIDQ1327612
Publication date: 6 July 1995
Published in: The Annals of Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aoap/1177005207
Deterministic scheduling theory in operations research (90B35) Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40) Sequential statistical design (62L05)
Related Items
Open Bandit Processes with Uncountable States and Time-Backward Effects ⋮ Optimistic Gittins Indices ⋮ Four proofs of Gittins' multiarmed bandit theorem ⋮ Continue, quit, restart probability model ⋮ The multi-armed bandit, with constraints ⋮ The archievable region method in the optimal control of queueing systems; formulations, bounds and policies ⋮ Index policy for multiarmed bandit problem with dynamic risk measures ⋮ Unnamed Item ⋮ A perpetual search for talents across overlapping generations: a learning process ⋮ MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT ⋮ Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems ⋮ Index policies for discounted bandit problems with availability constraints ⋮ Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation ⋮ A generalized Gittins index for a Markov chain and its recursive calculation ⋮ Gittins Index for Simple Family of Markov Bandit Processes with Switching Cost and No Discounting ⋮ Dynamic matching with teams