The learning component of dynamic allocation indices
DOI10.1214/aos/1176348788zbMath0760.62080OpenAlexW2063853624MaRDI QIDQ1206729
Publication date: 1 April 1993
Published in: The Annals of Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aos/1176348788
Gittins indexexponential discountingnormal distributionsmultiarmed bandit problemBernoulli reward processdynamical allocation indexexpected immediate rewardlearning componentoptimal allocation ruletarget processesupward adjustment
Bayesian problems; characterization of Bayes procedures (62C10) Sequential statistical methods (62L99) Optimal stochastic control (93E20) Markov and semi-Markov decision processes (90C40) Sequential statistical design (62L05)
Related Items (9)
This page was built for publication: The learning component of dynamic allocation indices