A tutorial survey of reinforcement learning
From MaRDI portal
Publication:5955768
DOI10.1007/BF02743935zbMath1026.93520MaRDI QIDQ5955768
No author found.
Publication date: 18 February 2002
Published in: Sādhanā (Search for Journal in Brave)
Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)
Related Items
Stochastic approximation with two time scales ⋮ From Reinforcement Learning to Deep Reinforcement Learning: An Overview ⋮ The actor-critic algorithm as multi-time-scale stochastic approximation. ⋮ Stochastic approximation algorithms: overview and recent trends.
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Associative search network: A reinforcement learning associative memory
- Landmark learning: An illustration of associative search
- Practical issues in temporal difference learning
- \({\mathcal Q}\)-learning
- Transfer of learning by composing solutions of elemental sequential tasks
- Real-time heuristic search
- A Survey of Some Results in Stochastic Adaptive Control
- Distributed dynamic programming
- A New Approach to Manipulator Control: The Cerebellar Model Articulation Controller (CMAC)
- Pattern-recognizing stochastic learning automata