A projected primal-dual gradient optimal control method for deep reinforcement learning

From MaRDI portal

Publication:1980960

Jump to:navigation, search

DOI10.1186/S13362-020-00075-3MaRDI QIDQ1980960zbMATH OpenOpenAlexFDO

Authors Simon Gottschalk, Michael Burger, Matthias Gerdts

Publication date 9 September 2021

Published in Journal of Mathematics in Industry (Search for Journal in Brave)

Copyright license Creative Commons Attribution 4.0 International

Full work available at URL https://doi.org/10.1186/s13362-020-00075-3

zbMATH Keywords

neural networks optimal control necessary optimality conditions reinforcement learning Markov Decision Process

Mathematics Subject Classification ID

Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Optimality conditions for problems involving ordinary differential equations (49K15) Markov and semi-Markov decision processes (90C40) Stochastic learning and adaptive control (93E35) Networks and circuits as models of computation; circuit complexity (68Q06)

Recommendations

Cites work

Cited in

(5)

This page was built for publication: A projected primal-dual gradient optimal control method for deep reinforcement learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1980960)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1980960&oldid=14444821"