Temporal difference-based policy iteration for optimal control of stochastic systems

From MaRDI portal

Publication:467477

Jump to:navigation, search

DOI10.1007/S10957-013-0418-1zbMATH Open1306.93074OpenAlexW2080453320MaRDI QIDQ467477FDOQ467477

Authors: Kang Cheng, Xiao-Mei Liu, Kanjian Zhang, Haikun Wei, Shumin Fei

Publication date: 3 November 2014

Published in: Journal of Optimization Theory and Applications (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10957-013-0418-1

Recommendations

zbMATH Keywords

stochastic optimal control approximate dynamic programming learning algorithms: discrete-time systems least squares policy evaluation algorithm

Mathematics Subject Classification ID

Dynamic programming (90C39) Existence of optimal solutions to problems involving randomness (49J55) Dynamic programming in optimal control and differential games (49L20) Discrete-time control/observation systems (93C55) Stochastic systems in control theory (general) (93E03) Optimal stochastic control (93E20)

Cites Work

Cited In (10)

This page was built for publication: Temporal difference-based policy iteration for optimal control of stochastic systems

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q467477)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:467477&oldid=12345347"