Q-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learning

From MaRDI portal

Publication:2318731

Jump to:navigation, search

DOI10.1016/j.jfranklin.2019.06.007zbMath1418.93250WikidataQ127636852 ScholiaQ127636852MaRDI QIDQ2318731

Chaoxu Mu, Zhong-Ke Gao, Qian Zhao, Chang-Yin Sun

Publication date: 16 August 2019

Published in: Journal of the Franklin Institute (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.jfranklin.2019.06.007

zbMATH Keywords

reinforcement learning; Q-learning; discrete-time multiagent systems; optimal consensus control

Mathematics Subject Classification ID

05C90: Applications of graph theory

93C55: Discrete-time control/observation systems

93D05: Lyapunov and other classical stabilities (Lagrange, Poisson, (L^p, l^p), etc.) in control theory

93A14: Decentralized systems

93D99: Stability of control systems

Related Items

Output-feedback optimized consensus for directed graph multi-agent systems based on reinforcement learning and subsystem error derivatives, Heterogeneous optimal formation control of nonlinear multi-agent systems with unknown dynamics by safe reinforcement learning, Suboptimal consensus protocol design for a class of multiagent systems, Optimal output synchronization of heterogeneous multi-agent systems using measured input-output data, Finite‐horizon H∞ tracking control for discrete‐time linear systems, Efficient off‐policy Q‐learning for multi‐agent systems by solving dual games, Time-varying formation control for nonlinear multi-agent systems against actuator attacks, Adaptive fuzzy sliding-mode consensus control of nonlinear under-actuated agents in a near-optimal reinforcement learning framework, Dual ML-ADHDP method for heterogeneous discrete-time nonlinear multi-agent systems with unknown dynamics and time delay, Model-free finite-horizon optimal tracking control of discrete-time linear systems, Distributed constrained optimization for multi-agent systems over a directed graph with piecewise stepsize, Data-based optimal coordination control of continuous-time nonlinear multi-agent systems via adaptive dynamic programming method, Adaptive output-feedback time-varying formation tracking control for multi-agent systems with switching directed networks, On the effect of probing noise in optimal control LQR via Q-learning using adaptive filtering algorithms

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2318731&oldid=14914014"