Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning

From MaRDI portal
Publication:5130615

DOI10.1080/01621459.2018.1537919zbMath1445.62279arXiv1611.03531OpenAlexW2963735256WikidataQ99578499 ScholiaQ99578499MaRDI QIDQ5130615

David M. Maahs, Anna R. Kahkoska, Michael R. Kosorok, Elizabeth Mayer-Davis, Eric B. Laber, Daniel J. Luckett

Publication date: 28 October 2020

Published in: Journal of the American Statistical Association (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1611.03531




Related Items (24)

Data-guided Treatment Recommendation with Feature ScoresEfficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement LearningEstimation of Optimal Individualized Treatment Rules Using a Covariate-Specific Treatment Effect Curve With High-Dimensional CovariatesDynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning FrameworkPersonalized dynamic treatment regimes in continuous time: a Bayesian approach for optimizing clinical decisions with timingOptimal Treatment Regimes: A Review and Empirical ComparisonA multiagent reinforcement learning framework for off-policy evaluation in two-sided marketsStatistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite HorizonsOff-Policy Confidence Interval Estimation with Confounded Markov Decision ProcessEstimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-LearningStatistical Learning for Individualized Asset AllocationOff-policy evaluation in partially observed Markov decision processes under sequential ignorabilityProjected state-action balancing weights for offline reinforcement learningOnline Bootstrap Inference For Policy Evaluation In Reinforcement LearningOff-policy evaluation for tabular reinforcement learning with synthetic trajectoriesStochastic approximation: from statistical origin to big-data, multidisciplinary applicationsA Bayesian time-varying effect model for behavioral mHealth dataUnnamed ItemGeneralization error bounds of dynamic treatment regimes in penalized regression-based learningA Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under EndogeneityLearning When-to-Treat PoliciesOff-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile HealthPersonalized Policy Learning Using Longitudinal Mobile Health DataBatch policy learning in average reward Markov decision processes


Uses Software


Cites Work


This page was built for publication: Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning