Algorithms for evaluating the dynamic allocation index
From MaRDI portal
Publication:1166434
DOI10.1016/0167-6377(82)90050-5zbMath0488.90074OpenAlexW1989992646MaRDI QIDQ1166434
Publication date: 1982
Published in: Operations Research Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0167-6377(82)90050-5
algorithmstwo-armed bandit problemoptimal policiesmulti-armed bandit problemdynamic allocation indicesalternative bandit processescalculation of indicesMarkov decision chain
Related Items (8)
Stochastic scheduling and forwards induction ⋮ Index policy for multiarmed bandit problem with dynamic risk measures ⋮ Algorithms for evaluating the dynamic allocation index ⋮ Stochastic scheduling: a short history of index policies and new approaches to index generation for dynamic resource allocation ⋮ A comparative study of ad hoc techniques and evolutionary methods for multi-armed bandit problems ⋮ Un ordonnancement dynamique de tâches stochastiques sur un seul processeur ⋮ Robust control of the multi-armed bandit problem ⋮ Multi-armed bandit models for the optimal design of clinical trials: benefits and challenges
Cites Work
This page was built for publication: Algorithms for evaluating the dynamic allocation index