On the worst-case analysis of temporal-difference learning algorithms

From MaRDI portal

Jump to:navigation, search

DOI10.1007/BF00114725MaRDI QIDQ1911342zbMATH OpenOpenAlexFDO

Authors Robert E. Schapire, Manfred K. Warmuth

Publication date 21 April 1996

Published in Machine Learning (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1007/bf00114725

zbMATH Keywords

learning algorithms Sutton's method of temporal differences

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)

Recommendations

Cites work

Cited in

(10)

This page was built for publication: On the worst-case analysis of temporal-difference learning algorithms

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1911342)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=On_the_worst-case_analysis_of_temporal-difference_learning_algorithms&oldid=73954095"