MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS

DOI10.5705/SS.202020.0454MaRDI QIDQ5072149zbMATH OpenOpenAlexFDO

Authors Tze Leung Lai, Huanzhong Xu, Dongwoo Kim

Publication date 25 April 2022

Published in STATISTICA SINICA (Search for Journal in Brave)

Full work available at URL https://doi.org/10.5705/ss.202020.0454

personalized medicine reinforcement learning recommender system contextual multi-armed bandits \(\epsilon\)-greedy randomization

Mathematics Subject Classification ID

Statistics (62-XX)

Recommendations

A non-parametric solution to the multi-armed bandit problem with covariates
Randomized allocation with arm elimination in a bandit problem with covariates
The multi-armed bandit problem with covariates
Randomized allocation with nonparametric estimation for a multi-armed bandit problem with covariates
Kernel estimation and model combination in a bandit problem with covariates

Cites work

scientific article; zbMATH DE number 991833 (Why is no real title available?)
scientific article; zbMATH DE number 3124347 (Why is no real title available?)
scientific article; zbMATH DE number 3126094 (Why is no real title available?)
scientific article; zbMATH DE number 3687126 (Why is no real title available?)
scientific article; zbMATH DE number 3733065 (Why is no real title available?)
scientific article; zbMATH DE number 3474804 (Why is no real title available?)
scientific article; zbMATH DE number 3638998 (Why is no real title available?)
A One-Armed Bandit Problem with a Concomitant Variable
Adaptive treatment allocation and the multi-armed bandit problem
Asymptotically efficient adaptive allocation rules
Bandit problems with side observations
Convergence rate of sieve estimates
Finite-time analysis of the multiarmed bandit problem
Information and asymptotic efficiency in parametric-nonparametric models
Information-theoretic determination of minimax rates of convergence
Local linear regression smoothers and their minimax efficiencies
Machine learning and nonparametric bandit theory
Minimax-optimal nonparametric regression in high dimensions
Multivariate locally weighted least squares regression
Nonparametric bandit methods
On adaptive estimation
One-armed bandit problems with covariates
Optimal stopping and dynamic allocation
Rates of convergence for minimum contrast estimators
Some aspects of the sequential design of experiments
Statistical science in information technology and precision medicine
Woodroofe's one-armed bandit problem revisited

Cited in

(6)

Matrices -- compensating the loss of anschauung
One-armed bandit problems with covariates
Bandit and covariate processes, with finite or non-denumerable set of arms
Transfer learning for contextual multi-armed bandits
A revised approach for risk-averse multi-armed bandits under CVaR criterion
Encounters with Martingales in Statistics and Stochastic Optimization

This page was built for publication: MULTI-ARMED BANDITS WITH COVARIATES:THEORY AND APPLICATIONS

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5072149)