Two-phase selective decentralization to improve reinforcement learning systems with MDP
From MaRDI portal
Publication:5145441
Recommendations
- Partially decentralized reinforcement learning in finite, multi-agent Markov decision processes
- Why the `selfish' optimizing agents could solve the decentralized reinforcement learning problems
- An approximate dynamic programming approach to decentralized control of stochastic systems
- Decentralized learning in finite Markov chains
- scientific article
Cites work
- scientific article; zbMATH DE number 1945830 (Why is no real title available?)
- scientific article; zbMATH DE number 783783 (Why is no real title available?)
- scientific article; zbMATH DE number 795580 (Why is no real title available?)
- scientific article; zbMATH DE number 802915 (Why is no real title available?)
- <formula formulatype="inline"><tex Notation="TeX">$ {\cal H}_{2}$</tex></formula>-Optimal Decentralized Control Over Posets: A State-Space Solution for State-Feedback
- A numerical algorithm for fully nonlinear HJB equations: an approach by control randomization
- An Approximation Theory of Optimal Control for Trainable Manipulators
- An introduction to mechanics.
- Decentralized Learning in Finite Markov Chains: Revisited
- Decentralized Q-Learning for Stochastic Teams and Games
- Decentralized adaptive control of interconnected systems
- Decentralized adaptive control: structural conditions for stability
- Decentralized adaptive controller design for large-scale systems with higher order interconnections
- Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
- Improving transient response of adaptive control systems using multiple models and switching
- Kernel methods in system identification, machine learning and function estimation: a survey
- Model-Free Adaptive Switching Control of Time-Varying Plants
- Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach
- New Concepts in Adaptive Control Using Multiple Models
- On the Theory of Dynamic Programming
- Optimal Decentralized Control of Coupled Subsystems With Control Sharing
- Pattern recognition and machine learning.
- Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers
- Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints
- Simulation-based algorithms for Markov decision processes
- Solving Hamilton-Jacobi-Bellman equations by a modified method of characteristics
- System identification. An introduction.
- The Number of Partitions of a Set
- Variable resolution discretization in optimal control
- \({\mathcal Q}\)-learning
Cited in
(1)
This page was built for publication: Two-phase selective decentralization to improve reinforcement learning systems with MDP
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5145441)