Four proofs of Gittins' multiarmed bandit theorem

DOI10.1007/S10479-013-1523-0MaRDI QIDQ333080zbMATH OpenOpenAlexFDO

Publication date 9 November 2016

Published in Annals of Operations Research (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1007/s10479-013-1523-0

linear programming dynamic programming Gittins index bandit problems

Dynamic programming (90C39) Queueing theory (aspects of probability theory) (60K25) Stopping times; optimal stopping problems; gambling theory (60G40) Markov and semi-Markov decision processes (90C40)

Recommendations

A short proof of the Gittins index theorem
On the Gittins index for multiarmed bandits
On Gittins' index theorem in continuous time
General Gittins index processes in discrete time.
Multi-armed bandit problem revisited

Cites work

scientific article; zbMATH DE number 4131489 (Why is no real title available?)
scientific article; zbMATH DE number 3125136 (Why is no real title available?)
scientific article; zbMATH DE number 3906232 (Why is no real title available?)
scientific article; zbMATH DE number 4029251 (Why is no real title available?)
scientific article; zbMATH DE number 4087408 (Why is no real title available?)
scientific article; zbMATH DE number 3687126 (Why is no real title available?)
scientific article; zbMATH DE number 48691 (Why is no real title available?)
scientific article; zbMATH DE number 3474804 (Why is no real title available?)
scientific article; zbMATH DE number 3638998 (Why is no real title available?)
scientific article; zbMATH DE number 194374 (Why is no real title available?)
scientific article; zbMATH DE number 1860211 (Why is no real title available?)
scientific article; zbMATH DE number 3238721 (Why is no real title available?)
scientific article; zbMATH DE number 3313523 (Why is no real title available?)
scientific article; zbMATH DE number 3422402 (Why is no real title available?)
A \((2/3)n^{3}\) fast-pivoting algorithm for the Gittins index and optimal stopping of a Markov chain
A generalized Gittins index for a Markov chain and its recursive calculation
A short proof of the Gittins index theorem
Addendum to ‘On an index policy for restless bandits'
Almost optimal policies for stochastic systems which almost satisfy conservation laws
Arm-acquiring bandits
Branching Bandit Processes
Characterization and Optimization of Achievable Performance in General Queueing Systems
Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
Discrete multiarmed bandits and multiparameter processes
Dynamic Scheduling of a Multiclass Queue: Discount Optimality
Dynamic allocation indices for restless projects and queueing admission control: a polyhedral approach
Extensions of the multiarmed bandit problem: The discounted case
Finite state multi-armed bandit problems: Sensitive-discount, average-reward and average-overtaking optimality
Multi-armed bandit allocation indices. With a foreword by Peter Whittle.
Multi-armed bandit problem revisited
Multi-armed bandits in discrete and continuous time
Multiclass Queueing Systems: Polymatroidal Structure and Optimal Scheduling Control
Multiple feedback at a single-server station
On an index policy for restless bandits
On duality theory of conic linear problems.
On the Gittins index for multiarmed bandits
Optimal Control of Single-Server Queuing Networks and Multi-Class M/G/1 Queues with Feedback
Parallel Scheduling of Multiclass M/M/m Queues: Approximate and Heavy-Traffic Optimization of Achievable Performance
Restless Bandit Marginal Productivity Indices, Diminishing Returns, and Optimal Control of Make-to-Order/Make-to-Stock M/G/1 Queues
Restless bandits, partial conservation laws and indexability
Risk-Sensitive and Risk-Neutral Multiarmed Bandits
Scheduling for Minimum Total Loss Using Service Time Distributions
The Achievable Region Approach to the Optimal Control of Stochastic Systems
The Multi-Armed Bandit Problem: Decomposition and Computation
The multi-armed bandit, with constraints
Time-Sharing Service Systems. I

Cited in

(12)

Optimal strategies for families of alternative bandit processes
A Bayesian two-armed bandit model
scientific article; zbMATH DE number 775000 (Why is no real title available?)
Approximation algorithms for stochastic combinatorial optimization problems
Robust control of the multi-armed bandit problem
Technical note -- A note on the equivalence of upper confidence bounds and Gittins indices for patient agents
A short proof of the Gittins index theorem
Multi-armed bandit problem revisited
Gittins' theorem under uncertainty
Decomposing risk in an exploitation-exploration problem with endogenous termination time
Multi-armed bandits under general depreciation and commitment
A lemma on the multiarmed bandit problem

This page was built for publication: Four proofs of Gittins' multiarmed bandit theorem

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q333080)