Woodroofe's one-armed bandit problem revisited

From MaRDI portal

Publication:835072

Jump to:navigation, search

DOI10.1214/08-AAP589zbMath1168.62071arXiv0909.0119OpenAlexW3106279838MaRDI QIDQ835072

Alexander Goldenshluger, Assaf J. Zeevi

Publication date: 27 August 2009

Published in: The Annals of Applied Probability (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/0909.0119

zbMATH Keywords

minimax estimation online learning regret bandit problems sequential allocation inferior sampling rate rate-optimal policy

Mathematics Subject Classification ID

Minimax procedures in statistical decision theory (62C20) Stopping times; optimal stopping problems; gambling theory (60G40) Sequential statistical design (62L05)

Related Items

A non-parametric solution to the multi-armed bandit problem with covariates, Bandit and covariate processes, with finite or non-denumerable set of arms, A linear response bandit problem, Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes, MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS, The multi-armed bandit problem with covariates, One-armed bandit process with a covariate, Unnamed Item, Transfer learning for contextual multi-armed bandits, Randomized allocation with arm elimination in a bandit problem with covariates, Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit, Nonparametric Pricing Analytics with Customer Covariates

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:835072&oldid=12773054"