Data-driven adaptive dynamic programming for partially observable nonzero-sum games via Q-learning method
From MaRDI portal
Publication:5025895
Recommendations
- Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
- Model-free Q-learning designs for linear discrete-time zero-sum games with application to H^ control
- Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems
- Nonzero-sum differential games of continuous-time nonlinear systems with uniformly ultimately \(\varepsilon\)-bounded by adaptive dynamic programming
- Output feedback Q-learning for discrete-time linear zero-sum games with application to the \(H_\infty\) control
Cites work
- scientific article; zbMATH DE number 1243371 (Why is no real title available?)
- Complete stability analysis of a heuristic approximate dynamic programming control design
- Distributed \(\mathcal H_{\infty}\) state estimation with stochastic parameters and nonlinearities through sensor networks: the finite-horizon case
- Event-triggered consensus control for discrete-time stochastic multi-agent systems: the input-to-state stability in probability
- Finite-Horizon <inline-formula> <tex-math notation="TeX">${\cal H}_{\infty}$</tex-math></inline-formula> Control for Discrete Time-Varying Systems With Randomly Occurring Nonlinearities and Fading Measurements
- Finite-horizon differential games for missile–target interception system using adaptive dynamic programming with input constraints
- Finite-time synchronization of uncertain coupled switched neural networks under asynchronous switching
- Iterative algorithms for computing the feedback Nash equilibrium point for positive systems
- LMI-based approach to stability analysis for fractional-order neural networks with discrete and distributed delays
- Matrix Riccati equations in control and systems theory
- Model-free control performance improvement using virtual reference feedback tuning and reinforcement Q-learning
- Multi-agent discrete-time graphical games and reinforcement learning solutions
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Nonzero-sum differential games
- Optimal control
- Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
- Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers
- Reinforcement learning. An introduction
- \({\mathcal Q}\)-learning
Cited in
(9)- Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics
- A model-free deep integral policy iteration structure for robust control of uncertain systems
- Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems
- Model-free finite-horizon optimal control of discrete-time two-player zero-sum games
- Output feedback Q-learning for discrete-time linear zero-sum games with application to the \(H_\infty\) control
- Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state
- Reinforcement learning and non-zero-sum game output regulation for multi-player linear uncertain systems
- Dynamic Graphical Games:
- Off-policy based adaptive dynamic programming method for nonzero-sum games on discrete-time system
This page was built for publication: Data-driven adaptive dynamic programming for partially observable nonzero-sum games via \(Q\)-learning method
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5025895)