Q-learning algorithms with random truncation bounds and applications to effective parallel computing

From MaRDI portal

Publication:946195

Jump to:navigation, search

DOI10.1007/s10957-007-9331-9zbMath1328.68178OpenAlexW2138267312MaRDI QIDQ946195

L. Y. Wang, G. George Yin, Cheng-Zhong Xu

Publication date: 22 September 2008

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10957-007-9331-9

zbMATH Keywords

convergence rate of convergence recursive algorithms Q-learning

Mathematics Subject Classification ID

Nonnumerical algorithms (68W05) Learning and adaptive systems in artificial intelligence (68T05) Parallel algorithms in computer science (68W10)

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:946195&oldid=12917320"