scientific article; zbMATH DE number 7306868
From MaRDI portal
Publication:5148951
Nathan Kallus, Masatoshi Uehara
Publication date: 5 February 2021
Full work available at URL: https://arxiv.org/abs/1908.08526
Title: zbMATH Open Web Interface contents unavailable due to conflicting licenses.
Related Items
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning, Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons, Off-policy evaluation in partially observed Markov decision processes under sequential ignorability, Projected state-action balancing weights for offline reinforcement learning, Settling the sample complexity of model-based offline reinforcement learning, Unnamed Item, Toward theoretical understandings of robust Markov decision processes: sample complexity and asymptotics, Batch policy learning in average reward Markov decision processes
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Large Sample Properties of Generalized Method of Moments Estimators
- Doubly robust policy evaluation and optimization
- Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine
- The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions
- On differentiable functionals
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Consistent estimation of the influence function of locally asymptotically linear estimators
- A matrix extension of the Cauchy-Schwarz inequality
- The use of polynomial splines and their tensor products in multivariate function estimation. (With discussion)
- Efficient estimation of panel data models with sequential moment restrictions
- On methods of sieves and penalization
- Unified methods for censored longitudinal data and causality
- Introduction to empirical processes and semiparametric inference
- Semiparametric theory and missing data.
- Local Rademacher complexities
- Q( $$\lambda $$ ) with Off-Policy Corrections
- Bias and Variance Approximation in Value Function Estimates
- Asymptotic Statistics
- Marginal Mean Models for Dynamic Regimes
- On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects
- Adjusting for Nonignorable Drop-Out Using Semiparametric Nonresponse Models
- Constructing dynamic treatment regimes over indefinite time horizons
- Optimal Dynamic Treatment Regimes
- A new approach to causal inference in mortality studies with a sustained exposure period—application to control of the healthy worker survivor effect
- 10.1162/1532443041827907
- Double/debiased machine learning for treatment and structural parameters
- Robust inference on the average treatment effect using the outcome highly adaptive lasso
- Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning
- Robust estimation of optimal dynamic treatment regimes for sequential treatment decisions
- Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score
- Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions