Double reinforcement learning for efficient off-policy evaluation in Markov decision processes

From MaRDI portal
Publication:5148951

MaRDI QIDQ5148951FDOQ5148951


Authors: Nathan Kallus, Masatoshi Uehara Edit this on Wikidata


Publication date: 5 February 2021


Full work available at URL: https://arxiv.org/abs/1908.08526




Recommendations




Cites Work


Cited In (12)





This page was built for publication: Double reinforcement learning for efficient off-policy evaluation in Markov decision processes

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5148951)