Integral reinforcement learning and experience replay for adaptive optimal control of partially-unknown constrained-input continuous-time systems
From MaRDI portal
Publication:463819
DOI10.1016/j.automatica.2013.09.043zbMath1298.49042MaRDI QIDQ463819
Frank L. Lewis, Hamidreza Modares, Mohammad-Bagher Naghibi-Sistani
Publication date: 17 October 2014
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2013.09.043
optimal control; neural networks; input constraints; integral reinforcement learning; experience replay
68T05: Learning and adaptive systems in artificial intelligence
49L20: Dynamic programming in optimal control and differential games
92B20: Neural networks for/in biological studies, artificial life and related topics
93C40: Adaptive control/observation systems
Related Items
Online concurrent reinforcement learning algorithm to solve two‐player zero‐sum games for partially unknown nonlinear continuous‐time systems, Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning, Model-based reinforcement learning for approximate optimal regulation, Optimal control of a class of nonlinear stochastic systems
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
- Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
- Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
- Adaptive optimal control for continuous-time linear systems based on policy iteration
- Real-time reinforcement learning by sequential actor-critics and experience replay
- A novel actor-critic-identifier architecture for approximate optimal control of uncertain nonlinear systems
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- Approximate Dynamic Programming
- Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers