Penalized Q-learning for dynamic treatment regimens
DOI10.5705/ss.2012.364zbMath1415.62054arXiv1108.5338OpenAlexW2015687733WikidataQ40642766 ScholiaQ40642766MaRDI QIDQ2950196
Michael R. Kosorok, Donglin Zeng, Rui Song, Wei-wei Wang
Publication date: 8 October 2015
Published in: Statistica Sinica (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1108.5338
shrinkageQ-learningtwo-stage proceduremulti-stageindividual selectiondynamic treatment regimenpenalized Q-learning
Asymptotic properties of parametric estimators (62F12) Ridge regression; shrinkage estimators (Lasso) (62J07) Applications of statistics to biology and medical sciences; meta analysis (62P10)
Related Items (24)
This page was built for publication: Penalized Q-learning for dynamic treatment regimens