The multi-armed bandit problem with covariates
DOI10.1214/13-AOS1101zbMath1360.62436arXiv1110.6084OpenAlexW3100895096WikidataQ56675681 ScholiaQ56675681MaRDI QIDQ355096
Vianney Perchet, Philippe Rigollet
Publication date: 24 July 2013
Published in: The Annals of Statistics (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1110.6084
nonparametric banditmulti-armed banditregret boundscontextual banditadaptive partitionsequential allocationsuccessive elimination
Nonparametric regression and quantile regression (62G08) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05) Sequential statistical design (62L05) Sequential estimation (62L12)
Related Items (26)
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- UCB revisited: improved regret bounds for the stochastic multi-armed bandit problem
- Woodroofe's one-armed bandit problem revisited
- Fast learning rates for plug-in classifiers
- Asymptotically efficient adaptive allocation rules
- Smooth discrimination analysis
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
- Optimal aggregation of classifiers in statistical learning.
- An Asymptotic Minimax Theorem for the Two Armed Bandit Problem
- A One-Armed Bandit Problem with a Concomitant Variable
- A Note on Performance Limitations in Bandit Problems With Side Information
- Online Learning with Prior Knowledge
- Prediction, Learning, and Games
- Some aspects of the sequential design of experiments
- Finite-time analysis of the multiarmed bandit problem
This page was built for publication: The multi-armed bandit problem with covariates