Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning
From MaRDI portal
Publication:2318731
DOI10.1016/j.jfranklin.2019.06.007zbMath1418.93250WikidataQ127636852 ScholiaQ127636852MaRDI QIDQ2318731
Chaoxu Mu, Zhong-Ke Gao, Qian Zhao, Chang-Yin Sun
Publication date: 16 August 2019
Published in: Journal of the Franklin Institute (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.jfranklin.2019.06.007
05C90: Applications of graph theory
93C55: Discrete-time control/observation systems
93D05: Lyapunov and other classical stabilities (Lagrange, Poisson, (L^p, l^p), etc.) in control theory
93A14: Decentralized systems
93D99: Stability of control systems
Related Items
Output-feedback optimized consensus for directed graph multi-agent systems based on reinforcement learning and subsystem error derivatives, Heterogeneous optimal formation control of nonlinear multi-agent systems with unknown dynamics by safe reinforcement learning, Suboptimal consensus protocol design for a class of multiagent systems, Optimal output synchronization of heterogeneous multi-agent systems using measured input-output data, Finite‐horizon H∞ tracking control for discrete‐time linear systems, Efficient off‐policy Q‐learning for multi‐agent systems by solving dual games, Time-varying formation control for nonlinear multi-agent systems against actuator attacks, Adaptive fuzzy sliding-mode consensus control of nonlinear under-actuated agents in a near-optimal reinforcement learning framework, Dual ML-ADHDP method for heterogeneous discrete-time nonlinear multi-agent systems with unknown dynamics and time delay, Model-free finite-horizon optimal tracking control of discrete-time linear systems, Distributed constrained optimization for multi-agent systems over a directed graph with piecewise stepsize, Data-based optimal coordination control of continuous-time nonlinear multi-agent systems via adaptive dynamic programming method, Adaptive output-feedback time-varying formation tracking control for multi-agent systems with switching directed networks, On the effect of probing noise in optimal control LQR via Q-learning using adaptive filtering algorithms
Cites Work
- Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning
- Consensus disturbance rejection for Lipschitz nonlinear multi-agent systems with input delay: a DOBC approach
- Multi-player non-zero-sum games: online adaptive learning solution of coupled Hamilton-Jacobi equations
- Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control
- Complete stability analysis of a heuristic approximate dynamic programming control design
- Trajectory tracking control for rotary steerable systems using interval type-2 fuzzy logic and reinforcement learning
- Online adaptive policy iteration based fault-tolerant control algorithm for continuous-time nonlinear tracking systems with actuator failures
- A boundedness result for the direct heuristic dynamic programming
- Discrete-time consensus strategy for a class of high-order linear multiagent systems under stochastic communication topologies
- Consensus of fractional-order multiagent system via sampled-data event-triggered control
- Multi-agent discrete-time graphical games and reinforcement learning solutions
- Multi-agent differential graphical games: online adaptive learning solution for synchronization with optimality
- Novel iterative neural dynamic programming for data-based approximate optimal control design
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- An Optimal Control Scheme for a Class of Discrete-time Nonlinear Systems with Time Delays Using Adaptive Dynamic Programming
- Discrete-time dynamic graphical games: model-free reinforcement learning solution
- Flocking of Multi-Agent Non-Holonomic Systems With Proximity Graphs
- Information Flow and Cooperative Control of Vehicle Formations
- Consensus Problems in Networks of Agents With Switching Topology and Time-Delays
- Online solution of nonlinear two‐player zero‐sum games using synchronous policy iteration