Linear Programming for Finite State Multi-Armed Bandit Problems
From MaRDI portal
Publication:3722294
DOI10.1287/moor.11.1.180zbMath0592.90095DBLPjournals/mor/ChenK86OpenAlexW2104336565WikidataQ56001616 ScholiaQ56001616MaRDI QIDQ3722294
Michael N. Katehakis, Yih Ren Chen
Publication date: 1986
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/26372b839c3c7266edb3a5b67d76e74463469a73
Related Items (8)
A bisection/successive approximation method for computing Gittins indices ⋮ Information Acquisition Under Resource Limitations in a Noisy Environment ⋮ Derman's book as inspiration: some results on LP for MDPs ⋮ Testing indexability and computing Whittle and Gittins index in subcubic time ⋮ Survey of linear programming for standard and nonstandard Markovian control problems. Part II: Applications ⋮ Branching Bandit Processes ⋮ Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges ⋮ Optimal stopping problems for multiarmed bandit processes with arms' independence
This page was built for publication: Linear Programming for Finite State Multi-Armed Bandit Problems