Bandit problems with side observations
From MaRDI portal
Publication:5274013
DOI10.1109/TAC.2005.844079zbMath1366.91063arXivcs/0501063MaRDI QIDQ5274013
Sanjeev R. Kulkarni, Chih-Chun Wang, H. Vincent Poor
Publication date: 12 July 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/cs/0501063
62C10: Bayesian problems; characterization of Bayes procedures
91B06: Decision theory
62C05: General considerations in statistical decision theory
Related Items
Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback, Infinite Arms Bandit: Optimality via Confidence Bounds, Bounded Regret for Finitely Parameterized Multi-Armed Bandits, Smoothness-Adaptive Contextual Bandits, Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes, MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS, Dealing with expert bias in collective decision-making, A non-parametric solution to the multi-armed bandit problem with covariates, Multiclass classification with bandit feedback using adaptive regularization, Bandit and covariate processes, with finite or non-denumerable set of arms, Arbitrary side observations in bandit problems, Bayesian Incentive-Compatible Bandit Exploration