Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear least-squares algorithms for temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine / rank
 
Normal rank
Property / cites work
 
Property / cites work: The stratified micro-randomized trial design: sample size considerations for testing nested causal effects of time-varying treatments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Series estimation of semilinear models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2834459 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255598 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477863 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Marginal Mean Models for Dynamic Regimes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Support Vector Machines / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3655724 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A partially linear framework for massive heterogeneous data / rank
 
Normal rank

Latest revision as of 22:26, 24 July 2024

scientific article; zbMATH DE number 7329750
Language Label Description Also known as
English
Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health
scientific article; zbMATH DE number 7329750

    Statements

    Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (English)
    0 references
    0 references
    0 references
    0 references
    30 March 2021
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    Markov decision process
    0 references
    policy evaluation
    0 references
    reinforcement learning
    0 references
    sequential decision making
    0 references
    0 references
    0 references