Algorithms for evaluating the dynamic allocation index
From MaRDI portal
Publication:1166434
DOI10.1016/0167-6377(82)90050-5zbMath0488.90074MaRDI QIDQ1166434
Publication date: 1982
Published in: Operations Research Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0167-6377(82)90050-5
algorithms; two-armed bandit problem; optimal policies; multi-armed bandit problem; dynamic allocation indices; alternative bandit processes; calculation of indices; Markov decision chain
90C40: Markov and semi-Markov decision processes