Penalized Q-learning for dynamic treatment regimens

From MaRDI portal
Publication:2950196

DOI10.5705/ss.2012.364zbMath1415.62054arXiv1108.5338OpenAlexW2015687733WikidataQ40642766 ScholiaQ40642766MaRDI QIDQ2950196

Michael R. Kosorok, Donglin Zeng, Rui Song, Wei-wei Wang

Publication date: 8 October 2015

Published in: Statistica Sinica (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1108.5338




Related Items (24)

Quantile-Optimal Treatment RegimesHigh-dimensional inference for personalized treatment decisionAdaptive treatment and robust controlDynamic treatment regimes: technical challenges and applicationsComment on ``Dynamic treatment regimes: technical challenges and applicationsFairness-Oriented Learning for Optimal Individualized Treatment RulesDynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning FrameworkModel-Assisted Uniformly Honest Inference for Optimal Treatment Regimes in High DimensionOptimal Treatment Regimes: A Review and Empirical ComparisonA multiagent reinforcement learning framework for off-policy evaluation in two-sided marketsTransformation-Invariant Learning of Optimal Individualized Decision Rules with Time-to-Event OutcomesStatistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsNearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset SelectionFlexible inference of optimal individualized treatment strategy in covariate adjusted randomization with multiple covariatesLearning Non-monotone Optimal Individualized Treatment RegimesDynamic treatment regimes using Bayesian additive regression trees for censored outcomesResampling‐based confidence intervals for model‐free robust inference on optimal treatment regimesA Sequential Significance Test for Treatment by Covariate InteractionsUnnamed ItemRegularized outcome weighted subgroup identification for differential treatment effectsSequential Advantage Selection for Optimal Treatment RegimesProper Inference for Value Function in High-Dimensional Q-Learning for Dynamic Treatment RegimesGeneralization error bounds of dynamic treatment regimes in penalized regression-based learningPersonalized Policy Learning Using Longitudinal Mobile Health Data







This page was built for publication: Penalized Q-learning for dynamic treatment regimens