Bandit problems with side observations
From MaRDI portal
Publication:5274013
DOI10.1109/TAC.2005.844079zbMath1366.91063arXivcs/0501063OpenAlexW2138859735MaRDI QIDQ5274013
Sanjeev R. Kulkarni, Chih-Chun Wang, H. Vincent Poor
Publication date: 12 July 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/cs/0501063
Bayesian problems; characterization of Bayes procedures (62C10) Decision theory (91B06) General considerations in statistical decision theory (62C05)
Related Items (12)
A non-parametric solution to the multi-armed bandit problem with covariates ⋮ Bounded Regret for Finitely Parameterized Multi-Armed Bandits ⋮ Bandit and covariate processes, with finite or non-denumerable set of arms ⋮ Smoothness-Adaptive Contextual Bandits ⋮ Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes ⋮ MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS ⋮ Dealing with expert bias in collective decision-making ⋮ Nonstochastic Multi-Armed Bandits with Graph-Structured Feedback ⋮ Multiclass classification with bandit feedback using adaptive regularization ⋮ Arbitrary side observations in bandit problems ⋮ Infinite Arms Bandit: Optimality via Confidence Bounds ⋮ Bayesian Incentive-Compatible Bandit Exploration
This page was built for publication: Bandit problems with side observations