Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state
From MaRDI portal
Publication:2063842
DOI10.1016/j.automatica.2021.110076zbMath1480.91015OpenAlexW4200574011MaRDI QIDQ2063842
Zhenfei Xiao, Jialu Fan, Jinna Li, Frank L. Lewis, Tian-You Chai
Publication date: 3 January 2022
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2021.110076
game theorynetwork-induced delayadaptive dynamic programming (ADP)off-policy Q-learningunmeasured state
Related Items (2)
Heterogeneous multi-player imitation learning ⋮ Two networked predictive control methods for output tracking of networked systems with plant-model mismatch
Cites Work
- Unnamed Item
- Multi-agent zero-sum differential graphical games for disturbance rejection in distributed control
- Optimal model-free output synchronization of heterogeneous systems using off-policy reinforcement learning
- \(\mathrm{H}_\infty\) control of linear discrete-time systems: off-policy reinforcement learning
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
- Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systems
- Optimal linear filtering for networked control systems with time-correlated fading channels
- Cooperative adaptive optimal output regulation of nonlinear discrete-time multi-agent systems
- Cascade structure predictive observer design for consensus control with applications to UAVs formation flying
- Periodic event-triggered control for networked control systems based on non-monotonic Lyapunov functions
- \(\mathcal{L}_p\) stability of networked control systems implemented on WirelessHART
- Reinforcement learning and non-zero-sum game output regulation for multi-player linear uncertain systems
- Linear Quadratic Tracking Control of Partially-Unknown Continuous-Time Systems Using Reinforcement Learning
- Leader-to-Formation Stability of Multiagent Systems: An Adaptive Optimal Control Approach
- Distributed data‐driven observer for linear time invariant systems
- Self-Learning Optimal Regulation for Discrete-Time Nonlinear Systems Under Event-Driven Formulation
- Hybrid online learning control in networked multiagent systems: A survey
This page was built for publication: Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state