Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning (Q5060503): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Changed an Item
Property / describes a project that uses
 
Property / describes a project that uses: OpenAI Gym / rank
 
Normal rank

Revision as of 16:11, 29 February 2024

scientific article; zbMATH DE number 7640294
Language Label Description Also known as
English
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
scientific article; zbMATH DE number 7640294

    Statements

    Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning (English)
    0 references
    0 references
    0 references
    10 January 2023
    0 references
    0 references
    off-policy evaluation
    0 references
    Markov decision processes
    0 references
    infinite horizon
    0 references
    semiparametric efficiency
    0 references