A tutorial survey of reinforcement learning

From MaRDI portal

Publication:5955768

Jump to:navigation, search

DOI10.1007/BF02743935zbMath1026.93520MaRDI QIDQ5955768

No author found.

Publication date: 18 February 2002

Published in: Sādhanā (Search for Journal in Brave)

zbMATH Keywords

optimal control dynamic programming neural networks reinforcement learning model-free methods

Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)

Related Items

Stochastic approximation with two time scales ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ The actor-critic algorithm as multi-time-scale stochastic approximation. ⋮ Stochastic approximation algorithms: overview and recent trends.

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5955768&oldid=12122330"