Pages that link to "Item:Q5130615"
From MaRDI portal
The following pages link to Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning (Q5130615):
Displayed 24 items.
- Stochastic approximation: from statistical origin to big-data, multidisciplinary applications (Q2038304) (← links)
- A Bayesian time-varying effect model for behavioral mHealth data (Q2078764) (← links)
- Generalization error bounds of dynamic treatment regimes in penalized regression-based learning (Q2091828) (← links)
- Batch policy learning in average reward Markov decision processes (Q2112817) (← links)
- Data-guided Treatment Recommendation with Feature Scores (Q5040488) (← links)
- Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning (Q5060503) (← links)
- (Q5148951) (← links)
- A Semiparametric Instrumental Variable Approach to Optimal Treatment Regimes Under Endogeneity (Q5857107) (← links)
- Learning When-to-Treat Policies (Q5857115) (← links)
- Estimation of Optimal Individualized Treatment Rules Using a Covariate-Specific Treatment Effect Curve With High-Dimensional Covariates (Q5857150) (← links)
- Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153) (← links)
- Personalized Policy Learning Using Longitudinal Mobile Health Data (Q5857154) (← links)
- Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework (Q6077592) (← links)
- Personalized dynamic treatment regimes in continuous time: a Bayesian approach for optimizing clinical decisions with timing (Q6121782) (← links)
- Optimal Treatment Regimes: A Review and Empirical Comparison (Q6131429) (← links)
- A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets (Q6138596) (← links)
- Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons (Q6153987) (← links)
- Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process (Q6153991) (← links)
- Estimating Optimal Infinite Horizon Dynamic Treatment Regimes via pT-Learning (Q6154019) (← links)
- Statistical Learning for Individualized Asset Allocation (Q6154020) (← links)
- Off-policy evaluation in partially observed Markov decision processes under sequential ignorability (Q6183750) (← links)
- Projected state-action balancing weights for offline reinforcement learning (Q6183753) (← links)
- Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning (Q6185586) (← links)
- Off-policy evaluation for tabular reinforcement learning with synthetic trajectories (Q6190662) (← links)