One-armed bandit problems with covariates

From MaRDI portal

Publication:1184219

Jump to:navigation, search

DOI10.1214/aos/1176348382zbMath0757.62038OpenAlexW2004971231MaRDI QIDQ1184219

Jyotirmoy Sarkar

Publication date: 28 June 1992

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1214/aos/1176348382

zbMATH Keywords

clinical trials asymptotically optimal myopic policy regret covariate Bayesian sequential allocation infinite population of patients one-armed bandit problem total discounted expected reward

Mathematics Subject Classification ID

Applications of statistics to biology and medical sciences; meta analysis (62P10) Sequential statistical analysis (62L10)

Related Items

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpoints ⋮ Isotonic smoothing splines under sequential designs ⋮ Woodroofe's one-armed bandit problem revisited ⋮ Bandit and covariate processes, with finite or non-denumerable set of arms ⋮ A linear response bandit problem ⋮ MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS ⋮ One-armed bandit process with a covariate ⋮ Optimal allocations in sequential tests involving two populations with covariates ⋮ Optimal Bayesian strategies for the infinite-armed Bernoulli bandit ⋮ Modeling item-item similarities for personalized recommendations on Yahoo! front page ⋮ Arbitrary side observations in bandit problems ⋮ Randomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewards ⋮ Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1184219&oldid=12049276"