Two-phase selective decentralization to improve reinforcement learning systems with MDP

DOI10.3233/AIC-180766MaRDI QIDQ5145441zbMATH OpenOpenAlexWikidataFDO

Authors

Publication date 20 January 2021

Published in AI Communications (Search for Journal in Brave)

Full work available at URL https://doi.org/10.3233/aic-180766

Markov process decentralized control Hamilton-Jacobi-Bellman equation multi-agent systems

Markov and semi-Markov decision processes (90C40) Decentralized systems (93A14) Linear systems in control theory (93C05) Nonlinear systems in control theory (93C10) Multi-agent systems (93A16)

Recommendations

Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes
Why the `selfish' optimizing agents could solve the decentralized reinforcement learning problems
An approximate dynamic programming approach to decentralized control of stochastic systems
Decentralized learning in finite Markov chains
scientific article

Cites work

scientific article; zbMATH DE number 1945830 (Why is no real title available?)
scientific article; zbMATH DE number 783783 (Why is no real title available?)
scientific article; zbMATH DE number 795580 (Why is no real title available?)
scientific article; zbMATH DE number 802915 (Why is no real title available?)
<formula formulatype="inline"><tex Notation="TeX">$ {\cal H}_{2}$</tex></formula>-Optimal Decentralized Control Over Posets: A State-Space Solution for State-Feedback
A numerical algorithm for fully nonlinear HJB equations: an approach by control randomization
An Approximation Theory of Optimal Control for Trainable Manipulators
An introduction to mechanics.
Decentralized Learning in Finite Markov Chains: Revisited
Decentralized Q-Learning for Stochastic Teams and Games
Decentralized adaptive control of interconnected systems
Decentralized adaptive control: structural conditions for stability
Decentralized adaptive controller design for large-scale systems with higher order interconnections
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
Improving transient response of adaptive control systems using multiple models and switching
Kernel methods in system identification, machine learning and function estimation: a survey
Model-Free Adaptive Switching Control of Time-Varying Plants
Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
New Concepts in Adaptive Control Using Multiple Models
On the Theory of Dynamic Programming
Optimal Decentralized Control of Coupled Subsystems With Control Sharing
Pattern recognition and machine learning.
Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers
Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints
Simulation-based algorithms for Markov decision processes
Solving Hamilton-Jacobi-Bellman equations by a modified method of characteristics
System identification. An introduction.
The Number of Partitions of a Set
Variable resolution discretization in optimal control
${\mathcal Q}$-learning

Cited in

(1)

Why the `selfish' optimizing agents could solve the decentralized reinforcement learning problems

Describes a project that uses

Uses Software

PRMLT
RICPAC

This page was built for publication: Two-phase selective decentralization to improve reinforcement learning systems with MDP

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5145441)