Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes

From MaRDI portal

Publication:5242485

Jump to:navigation, search

DOI10.1080/01621459.2018.1506341zbMath1428.62246OpenAlexW2885521542WikidataQ92590693 ScholiaQ92590693MaRDI QIDQ5242485

Wensheng Zhu, Rui Song, Donglin Zeng

Publication date: 12 November 2019

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Full work available at URL: http://europepmc.org/articles/pmc6953729

zbMATH Keywords

variable selection Q-learning hard threshold value function inference

Mathematics Subject Classification ID

Asymptotic properties of parametric estimators (62F12) Estimation in multivariate analysis (62H12) Ridge regression; shrinkage estimators (Lasso) (62J07) General considerations in statistical decision theory (62C05)

Related Items (5)

Estimation of Optimal Individualized Treatment Rules Using a Covariate-Specific Treatment Effect Curve With High-Dimensional Covariates ⋮ Model-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High Dimension ⋮ Optimal Treatment Regimes: A Review and Empirical Comparison ⋮ Statistical Learning for Individualized Asset Allocation ⋮ Generalization error bounds of dynamic treatment regimes in penalized regression-based learning

Uses Software

qLearn

Cites Work

This page was built for publication: Proper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment Regimes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5242485&oldid=19863441"