Why the `selfish' optimizing agents could solve the decentralized reinforcement learning problems
From MaRDI portal
Publication:5145451
Recommendations
- Two-phase selective decentralization to improve reinforcement learning systems with MDP
- Decentralized reinforcement learning of robot behaviors
- scientific article; zbMATH DE number 2243376
- Important scientific problems of multi-agent deep reinforcement learning
- Distributed policy evaluation via inexact ADMM in multi-agent reinforcement learning
Cites work
- scientific article; zbMATH DE number 3763739 (Why is no real title available?)
- scientific article; zbMATH DE number 783783 (Why is no real title available?)
- scientific article; zbMATH DE number 802915 (Why is no real title available?)
- A fully automated recurrent neural network for unknown dynamic system identification and control
- An Approximation Theory of Optimal Control for Trainable Manipulators
- Decentralized adaptive control of interconnected systems
- Decentralized adaptive controller design for large-scale systems with higher order interconnections
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- Optimization of coupled systems - A critical overview of approaches
- Solving Hamilton-Jacobi-Bellman equations by a modified method of characteristics
- \({\mathcal Q}\)-learning
Cited in
(3)
This page was built for publication: Why the `selfish' optimizing agents could solve the decentralized reinforcement learning problems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5145451)