Interactive model building for Q-learning
From MaRDI portal
Publication:2934764
DOI10.1093/BIOMET/ASU043zbMATH Open1306.62235OpenAlexW2059101466WikidataQ34760298 ScholiaQ34760298MaRDI QIDQ2934764FDOQ2934764
Authors: Eric B. Laber, Kristin A. Linn, Leonard Stefanski
Publication date: 22 December 2014
Published in: Biometrika (Search for Journal in Brave)
Full work available at URL: http://europepmc.org/articles/pmc4274394
Recommendations
- \(Q\)- and \(A\)-learning methods for estimating optimal dynamic treatment regimes
- Robust Q-learning
- Q-learning for estimating optimal dynamic treatment rules from observational data
- A smoothed Q‐learning algorithm for estimating optimal dynamic treatment regimes
- Penalized Q-learning for dynamic treatment regimens
Applications of statistics to biology and medical sciences; meta analysis (62P10) Sequential estimation (62L12) General considerations in statistical decision theory (62C05)
Cited In (22)
- Efficient augmentation and relaxation learning for individualized treatment rules using observational data
- The QLBS Q-Learner goes NuQLear: fitted Q iteration, inverse RL, and option portfolios
- High-dimensional inference for personalized treatment decision
- Dynamic treatment regimes: technical challenges and applications
- A semiparametric instrumental variable approach to optimal treatment regimes under endogeneity
- Relative contrast estimation and inference for treatment recommendation
- Improved doubly robust estimation in learning optimal individualized treatment rules
- Adaptive treatment and robust control
- Title not available (Why is that?)
- Using decision lists to construct interpretable and parsimonious treatment regimes
- Q-learning with censored data
- Incorporating patient preferences into estimation of optimal individualized treatment rules
- Interpretable dynamic treatment regimes
- Estimating dynamic treatment regimes in mobile health using V-learning
- Constructing dynamic treatment regimes with shared parameters for censored data
- Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness
- A cure-rate model for Q-learning: estimating an adaptive immunosuppressant treatment strategy for allogeneic hematopoietic cell transplant patients
- Optimal dynamic treatment regimes with survival endpoints: introducing DWSurv in the R package DTRreg
- The optimal dynamic treatment rule superlearner: considerations, performance, and application to criminal justice interventions
- Personalized Policy Learning Using Longitudinal Mobile Health Data
- Robust Q-learning
- Optimal Treatment Regimes: A Review and Empirical Comparison
This page was built for publication: Interactive model building for Q-learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2934764)