Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (Q5857153): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claim: author (P16): Item:Q548553
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 5 users not shown)
Property / author
 
Property / author: Susan A. Murphy / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W3081008408 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1912.13088 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path / rank
 
Normal rank
Property / cites work
 
Property / cites work: Linear least-squares algorithms for temporal difference learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Statistical methods for dynamic treatment regimes. Reinforcement learning, causal inference, and personalized medicine / rank
 
Normal rank
Property / cites work
 
Property / cites work: The stratified micro-randomized trial design: sample size considerations for testing nested causal effects of time-varying treatments / rank
 
Normal rank
Property / cites work
 
Property / cites work: Series estimation of semilinear models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2834459 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model selection in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4255598 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5477863 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Marginal Mean Models for Dynamic Regimes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Support Vector Machines / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3655724 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A partially linear framework for massive heterogeneous data / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 21:26, 24 July 2024

scientific article; zbMATH DE number 7329750
Language Label Description Also known as
English
Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health
scientific article; zbMATH DE number 7329750

    Statements

    Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health (English)
    0 references
    0 references
    0 references
    0 references
    30 March 2021
    0 references
    Markov decision process
    0 references
    policy evaluation
    0 references
    reinforcement learning
    0 references
    sequential decision making
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references