Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning

From MaRDI portal
Publication:5060503

DOI10.1287/OPRE.2021.2249OpenAlexW2994709386MaRDI QIDQ5060503FDOQ5060503


Authors: Nathan Kallus, Masatoshi Uehara Edit this on Wikidata


Publication date: 10 January 2023

Published in: Operations Research (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1909.05850




Recommendations




Cites Work


Cited In (9)

Uses Software





This page was built for publication: Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5060503)