Natural actor-critic algorithms

From MaRDI portal
Publication:1049136


DOI10.1016/j.automatica.2009.07.008zbMath1183.93130MaRDI QIDQ1049136

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee

Publication date: 8 January 2010

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.automatica.2009.07.008


49L20: Dynamic programming in optimal control and differential games

60J20: Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.)

93E35: Stochastic learning and adaptive control


Related Items



Cites Work