One-armed bandit problems with covariates
From MaRDI portal
Publication:1184219
DOI10.1214/aos/1176348382zbMath0757.62038OpenAlexW2004971231MaRDI QIDQ1184219
Publication date: 28 June 1992
Published in: The Annals of Statistics (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1214/aos/1176348382
clinical trialsasymptotically optimalmyopic policyregretcovariateBayesian sequential allocationinfinite population of patientsone-armed bandit problemtotal discounted expected reward
Applications of statistics to biology and medical sciences; meta analysis (62P10) Sequential statistical analysis (62L10)
Related Items
Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints ⋮ Isotonic smoothing splines under sequential designs ⋮ Woodroofe's one-armed bandit problem revisited ⋮ Bandit and covariate processes, with finite or non-denumerable set of arms ⋮ A linear response bandit problem ⋮ MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS ⋮ One-armed bandit process with a covariate ⋮ Optimal allocations in sequential tests involving two populations with covariates ⋮ Optimal Bayesian strategies for the infinite-armed Bernoulli bandit ⋮ Modeling item-item similarities for personalized recommendations on Yahoo! front page ⋮ Arbitrary side observations in bandit problems ⋮ Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards ⋮ Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates