On policy iteration-based discounted optimal control
From MaRDI portal
Publication:6496275
DOI10.1002/RNC.7245MaRDI QIDQ6496275
Wei-Dong Zhang, Botao Dong, Xiwen Ma, Longyang Huang
Publication date: 3 May 2024
Published in: International Journal of Robust and Nonlinear Control (Search for Journal in Brave)
Linear systems in control theory (93C05) Existence theories for optimal control problems involving ordinary differential equations (49J15) Control/observation systems governed by ordinary differential equations (93C15)
Cites Work
- Unnamed Item
- Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning
- Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design
- Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
- Stabilization with discounted optimal control
- Optimal control of Boolean control networks with average cost: a policy iteration approach
- Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems
- Model-free \(H_\infty\) tracking control for de-oiling hydrocyclone systems via off-policy reinforcement learning
- Homotopic policy iteration-based learning design for unknown linear continuous-time systems
- Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
- Approximate Dynamic Programming
- Finite-Horizon Discounted Optimal Control: Stability and Performance
- Reinforcement learning for robust stabilization of nonlinear systems with asymmetric saturating actuators
- Neural optimal tracking control of constrained nonaffine systems with a wastewater treatment application
- H∞ optimal control of unknown linear systems by adaptive dynamic programming with applications to time‐delay systems
- Event-triggered \(H_\infty\) consensus for uncertain nonlinear systems using integral sliding mode based adaptive dynamic programming
- Robust optimal tracking control for multiplayer systems by off‐policy Q‐learning approach
This page was built for publication: On policy iteration-based discounted optimal control