Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control

From MaRDI portal
Revision as of 16:37, 30 January 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:875484

DOI10.1016/J.AUTOMATICA.2006.09.019zbMath1137.93321OpenAlexW2005437559MaRDI QIDQ875484

Murad Abu-Khalaf, Asma Al-Tamimi, Frank L. Lewis

Publication date: 13 April 2007

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.automatica.2006.09.019






Related Items (87)

Online identifier–actor–critic algorithm for optimal control of nonlinear systemsA review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applicationsOutput feedback Q-learning for discrete-time linear zero-sum games with application to the \(H_\infty\) controlThree bounded proofs for nonlinear multi‐input multi‐output approximate dynamic programming based on the <scp>L</scp>yapunov stability theoryAn integrated data-driven Markov parameters sequence identification and adaptive dynamic programming method to design fault-tolerant optimal tracking control for completely unknown model systemsEvent-triggered adaptive dynamic programming for discrete-time multi-player gamesComputational adaptive optimal control for continuous-time linear systems with completely unknown dynamicsModel-free \(H_{\infty }\) control design for unknown linear discrete-time systems via Q-learning with LMIOn the effect of probing noise in optimal control LQR via Q-learning using adaptive filtering algorithmsH optimal control of unknown linear systems by adaptive dynamic programming with applications to time‐delay systemsModified general policy iteration based adaptive dynamic programming for unknown discrete‐time linear systemsNonlinear discrete time optimal control based on Fuzzy Modelsoptimal control for semi‐Markov jump linear systems via TP‐free temporal difference () learningImproved model-free H∞ control for batch processes via off-policy 2D game Q-learningA new self-learning optimal control laws for a class of discrete-time nonlinear systems based on ESN architectureOptimal event‐triggered control for C‐T system with asymmetric constraints based on dual heuristic dynamic programing structureoptimal control of unknown continuous time linear periodic systems by adaptive dynamic programming with applications to magnetic attitude controlComplete stability analysis of a heuristic approximate dynamic programming control designOutput‐feedback Q‐learning for discrete‐time linear H tracking control: A Stackelberg game approachOptimal game theoretic solution of the pursuit‐evasion intercept problem using on‐policy reinforcement learningModel-free finite-horizon optimal control of discrete-time two-player zero-sum gamesOptimal output tracking control of linear discrete-time systems with unknown dynamics by adaptive dynamic programming and output feedbackMinimax Q-learning control for linear systems using the Wasserstein metricUndiscounted reinforcement learning for infinite-time optimal output tracking and disturbance rejection of discrete-time LTI systems with unknown dynamicsOff‐policy model‐based end‐to‐end safe reinforcement learningSpecified convergence rate guaranteed output tracking of discrete-time systems via reinforcement learningUnnamed ItemAn adaptive dynamic programming-based algorithm for infinite-horizon linear quadratic stochastic optimal control problemsModel-based reinforcement learning for approximate optimal regulationRobust H tracking of linear <scp>discrete‐time</scp> systems using <scp>Q‐learning</scp>Non-zero sum Nash Q-learning for unknown deterministic continuous-time linear systemsData-based \(\mathcal{L}_2\) gain optimal control for discrete-time system with unknown dynamicsAn iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential gamesModel-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian gamesReinforcement learning algorithms with function approximation: recent advances and applicationsData‐driven control for networked systems with multiple packet dropoutsRobust optimal control of the multi‐input systems with unknown disturbance based on adaptive integral reinforcement learning Q‐functionOutput feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approachStochastic optimal control of unknown linear networked control system in the presence of random delays and packet lossesFrom model-based control to data-driven control: survey, classification and perspectiveDesigning of robust adaptive passivity-based controller based on reinforcement learning for nonlinear port-Hamiltonian model with disturbanceReinforcement \(Q\)-learning for optimal tracking control of linear discrete-time systems with unknown dynamicsA Q-learning predictive control scheme with guaranteed stabilityData-driven policy iteration algorithm for continuous-time stochastic linear-quadratic optimal control problemsFinite-horizon optimal control of discrete-time linear systems with completely unknown dynamics using Q-learningQ-learning for continuous-time linear systems: A model-free infinite horizon optimal control approachOptimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical gamesData-driven approximate value iteration with optimality error bound analysis\(\mathrm{H}_\infty\) control of linear discrete-time systems: off-policy reinforcement learningIntegral \(Q\)-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systemsStochastic Optimal Design for Unknown Linear Discrete‐Time System Zero‐Sum Games in Input‐Output form Under Communication Constraints\( \mathbb{Q} \)-learning algorithm in solving consensusability problem of discrete-time multi-agent systemsData driven secure control for cyber-physical systems under hybrid attacks: a Stackelberg game approachA reinforcement learning-based scheme for direct adaptive optimal control of linear stochastic systemsStability and monotone convergence of generalised policy iteration for discrete-time linear quadratic regulationsConvergence of the standard RLS method andUDUTfactorisation of covariance matrix for solving the algebraic Riccati equation of the DLQR via heuristic approximate dynamic programmingMultiperson zero-sum differential games for a class of uncertain nonlinear systemsNearly data-based optimal control for linear discrete model-free systems with delays via reinforcement learningValue iteration for LQR control of unknown stochastic-parameter linear systemsReinforcement learning and non-zero-sum game output regulation for multi-player linear uncertain systemsDisturbance compensation based model-free adaptive tracking control for nonlinear systems with unknown disturbanceInfinite time linear quadratic Stackelberg game problem for unknown stochastic discrete-time systems via adaptive dynamic programming approachFinite-frequency output feedback MPC for Markov jump systemsFinite-horizon Q-learning for discrete-time zero-sum games with application to \(H_{\infty}\) controlReinforcement learning for inverse linear-quadratic dynamic non-cooperative games\(\mathcal{L}_2\) gain tracking control of linear completely unknown discrete-time networked control systems with dropoutInput-output data based tracking control under DoS attacksRobust \(Q\)-learning algorithm for Markov decision processes under Wasserstein uncertaintySuccessive over relaxation for model-free LQR control of discrete-time Markov jump systemsMean field LQG social optimization: a reinforcement learning approachAdaptive optimal control for continuous-time linear systems based on policy iterationModel-FreeHControl Design for Unknown Continuous-Time Linear System Using Adaptive Dynamic ProgrammingBias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systemsOff-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured stateDesign and Comparison Base Analysis of Adaptive Estimator for Completely Unknown Linear Systems in the Presence of OE Noise and Constant Input Time DelayLinear-quadratic zero-sum mean-field type games: optimality conditions and policy optimizationQ-learning solution for optimal consensus control of discrete-time multiagent systems using reinforcement learningAdaptive optimal output regulation of linear discrete-time systems based on event-triggered output-feedbackAdaptive optimal tracking controls of unknown multi-input systems based on nonzero-sum game theoryApproximate-optimal control algorithm for constrained zero-sum differential games through event-triggering mechanismData-driven control of hydraulic servo actuator based on adaptive dynamic programmingMixed density methods for approximate dynamic programmingDissipativity-based verification for autonomous systems in adversarial environmentsMulti-agent reinforcement learning: a selective overview of theories and algorithmsComputational intelligence in uncertainty quantification for learning control and differential gamesFinite-horizon optimal control for continuous-time uncertain nonlinear systems using reinforcement learningMulti-objective optimal control for a class of nonlinear time-delay systems via adaptive dynamic programming




Cites Work




This page was built for publication: Model-free \(Q\)-learning designs for linear discrete-time zero-sum games with application to \(H^\infty\) control