A linear response bandit problem
From MaRDI portal
Publication:5168867
DOI10.1214/11-SSY032zbMath1352.91009OpenAlexW2069129115MaRDI QIDQ5168867
Assaf J. Zeevi, Alexander Goldenshluger
Publication date: 21 July 2014
Full work available at URL: https://doi.org/10.1214/11-ssy032
Stopping times; optimal stopping problems; gambling theory (60G40) Probabilistic games; gambling (91A60)
Related Items
Smoothness-Adaptive Contextual Bandits, Smooth Contextual Bandits: Bridging the Parametric and Nondifferentiable Regret Regimes, Bandit Theory: Applications to Learning Healthcare Systems and Clinical Trials, Ranking and Selection with Covariates for Personalized Decision Making, Optimal designs for the development of personalized treatment rules, A general characterization of optimal tie-breaker designs, Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection, Transfer learning for contextual multi-armed bandits, Online Decision Making with High-Dimensional Covariates, Infinite Arms Bandit: Optimality via Confidence Bounds, Randomized allocation with arm elimination in a bandit problem with covariates, Dynamic Assortment Personalization in High Dimensions, Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit, Statistical Inference for Online Decision Making via Stochastic Gradient Descent, Nonparametric Pricing Analytics with Customer Covariates, Statistical Inference for Online Decision Making: In a Contextual Bandit Setting, Unnamed Item
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Woodroofe's one-armed bandit problem revisited
- Asymptotically efficient adaptive allocation rules
- Adaptive treatment allocation and the multi-armed bandit problem
- One-armed bandit problems with covariates
- Stochastic differential systems, stochastic control theory and applications. (Proceedings of a workshop, held at IMA, Minnesota University, Minneapolis, June 9-19, 1986)
- Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
- Optimal aggregation of classifiers in statistical learning.
- Applications of the van Trees inequality: A Bayesian Cramér-Rao bound
- Arbitrary side observations in bandit problems
- Linearly Parameterized Bandits
- A One-Armed Bandit Problem with a Concomitant Variable
- The Nonstochastic Multiarmed Bandit Problem
- 10.1162/153244303321897663
- Machine learning and nonparametric bandit theory
- A Structured Multiarmed Bandit Problem and the Greedy Policy
- A Note on Performance Limitations in Bandit Problems With Side Information
- Prediction, Learning, and Games
- Some aspects of the sequential design of experiments
- Finite-time analysis of the multiarmed bandit problem