An analysis of temporal-difference learning with function approximation

From MaRDI portal
Publication:4362297

DOI10.1109/9.580874zbMATH Open0914.93075OpenAlexW2139418546MaRDI QIDQ4362297FDOQ4362297


Authors: Benjamin Van Roy, John N. Tsitsiklis Edit this on Wikidata


Publication date: 6 May 1999

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/9.580874




Recommendations





Cited In (97)





This page was built for publication: An analysis of temporal-difference learning with function approximation

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4362297)