Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control

From MaRDI portal

Publication:2800471

Jump to:navigation, search

DOI10.1002/oca.2156zbMath1353.68243OpenAlexW1520148289MaRDI QIDQ2800471

No author found.

Publication date: 15 April 2016

Published in: Optimal Control Applications and Methods (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1002/oca.2156

zbMATH Keywords

rate of convergence temporal difference learning value function approximation polynomial basis functions badly conditioned learning

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Applications of optimal control and differential games (49N90) Dynamic programming (90C39)

Uses Software

Approxrl

Cites Work

This page was built for publication: Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2800471&oldid=15704010"