Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning

From MaRDI portal

Publication:5130615

Jump to:navigation, search

DOI10.1080/01621459.2018.1537919zbMath1445.62279arXiv1611.03531OpenAlexW2963735256WikidataQ99578499 ScholiaQ99578499MaRDI QIDQ5130615

David M. Maahs, Anna R. Kahkoska, Michael R. Kosorok, Elizabeth Mayer-Davis, Eric B. Laber, Daniel J. Luckett

Publication date: 28 October 2020

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1611.03531

zbMATH Keywords

Markov decision processes reinforcement learning type 1 diabetes precision medicine

Mathematics Subject Classification ID

Applications of statistics to biology and medical sciences; meta analysis (62P10) Nonparametric estimation (62G05) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Sequential estimation (62L12) General considerations in statistical decision theory (62C05)

Related Items (24)

Data-guided Treatment Recommendation with Feature Scores ⋮ Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning ⋮ Estimation of Optimal Individualized Treatment Rules Using a Covariate-Specific Treatment Effect Curve With High-Dimensional Covariates ⋮ Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework ⋮ Personalized dynamic treatment regimes in continuous time: a Bayesian approach for optimizing clinical decisions with timing ⋮ Optimal Treatment Regimes: A Review and Empirical Comparison ⋮ A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets ⋮ Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons ⋮ Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process ⋮ Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning ⋮ Statistical Learning for Individualized Asset Allocation ⋮ Off-policy evaluation in partially observed Markov decision processes under sequential ignorability ⋮ Projected state-action balancing weights for offline reinforcement learning ⋮ Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning ⋮ Off-policy evaluation for tabular reinforcement learning with synthetic trajectories ⋮ Stochastic approximation: from statistical origin to big-data, multidisciplinary applications ⋮ A Bayesian time-varying effect model for behavioral mHealth data ⋮ Unnamed Item ⋮ Generalization error bounds of dynamic treatment regimes in penalized regression-based learning ⋮ A Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under Endogeneity ⋮ Learning When-to-Treat Policies ⋮ Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health ⋮ Personalized Policy Learning Using Longitudinal Mobile Health Data ⋮ Batch policy learning in average reward Markov decision processes

Uses Software

Cites Work

This page was built for publication: Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5130615&oldid=19663393"