The multi-armed bandit problem with covariates

From MaRDI portal
Revision as of 03:47, 30 January 2024 by Import240129110155 (talk | contribs) (Created automatically from import240129110155)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:355096

DOI10.1214/13-AOS1101zbMath1360.62436arXiv1110.6084OpenAlexW3100895096WikidataQ56675681 ScholiaQ56675681MaRDI QIDQ355096

Vianney Perchet, Philippe Rigollet

Publication date: 24 July 2013

Published in: The Annals of Statistics (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1110.6084




Related Items (26)

Bayesian adaptive bandit-based designs using the Gittins index for multi-armed trials with normally distributed endpointsA non-parametric solution to the multi-armed bandit problem with covariatesBatched bandit problemsBandit and covariate processes, with finite or non-denumerable set of armsSmoothness-Adaptive Contextual BanditsSmooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret RegimesA Single-Index Model With a Surface-Link for Optimizing Individualized Dose RulesRanking and Selection with Covariates for Personalized Decision MakingTechnical note—Knowledge gradient for selection with covariates: Consistency and computationNonstochastic Multi-Armed Bandits with Graph-Structured FeedbackUnnamed ItemAn adaptive multiclass nearest neighbor classifierTreatment recommendation with distributional targetsTransfer learning for contextual multi-armed banditsLearning the distribution with largest mean: two bandit frameworksGaussian process bandits with adaptive discretizationOnline Decision Making with High-Dimensional CovariatesRandomized allocation with nonparametric estimation for contextual multi-armed bandits with delayed rewardsUnnamed ItemInfinite Arms Bandit: Optimality via Confidence BoundsRandomized allocation with arm elimination in a bandit problem with covariatesDynamic Assortment Personalization in High DimensionsNonparametric Pricing Analytics with Customer CovariatesOnline Learning of Nash Equilibria in Congestion GamesStatistical Inference for Online Decision Making: In a Contextual Bandit SettingLearning in Repeated Auctions



Cites Work


This page was built for publication: The multi-armed bandit problem with covariates