An incremental off-policy search in a model-free Markov decision process using a single sample path

From MaRDI portal
Revision as of 03:25, 1 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:1621868

DOI10.1007/s10994-018-5697-1zbMath1465.90116arXiv1801.10287OpenAlexW2963057120MaRDI QIDQ1621868

Ajin George Joseph, Shalabh Bhatnagar

Publication date: 12 November 2018

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1801.10287





Uses Software


Cites Work


This page was built for publication: An incremental off-policy search in a model-free Markov decision process using a single sample path