A short proof of the Gittins index theorem

From MaRDI portal

Publication:1327612

Jump to:navigation, search

DOI10.1214/aoap/1177005207zbMath0816.60040OpenAlexW2159471113MaRDI QIDQ1327612

John N. Tsitsiklis

Publication date: 6 July 1995

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aoap/1177005207

zbMATH Keywords

induction multi-armed bandits interchange Gittins index theorem

Mathematics Subject Classification ID

Deterministic scheduling theory in operations research (90B35) Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40) Sequential statistical design (62L05)

Related Items

Open Bandit Processes with Uncountable States and Time-Backward Effects ⋮ Optimistic Gittins Indices ⋮ Four proofs of Gittins' multiarmed bandit theorem ⋮ Continue, quit, restart probability model ⋮ The multi-armed bandit, with constraints ⋮ The archievable region method in the optimal control of queueing systems; formulations, bounds and policies ⋮ Index policy for multiarmed bandit problem with dynamic risk measures ⋮ Unnamed Item ⋮ A perpetual search for talents across overlapping generations: a learning process ⋮ MULTI-ARMED BANDITS UNDER GENERAL DEPRECIATION AND COMMITMENT ⋮ Empirical Gittins index strategies with \(\varepsilon\)-explorations for multi-armed bandit problems ⋮ Index policies for discounted bandit problems with availability constraints ⋮ Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation ⋮ A generalized Gittins index for a Markov chain and its recursive calculation ⋮ Gittins Index for Simple Family of Markov Bandit Processes with Switching Cost and No Discounting ⋮ Dynamic matching with teams

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1327612&oldid=13453795"