Two-phase selective decentralization to improve reinforcement learning systems with MDP (Q5145441)

scientific article; zbMATH DE number 7298906

Language	Label	Description	Also known as
default for all languages	No label defined
English	Two-phase selective decentralization to improve reinforcement learning systems with MDP	scientific article; zbMATH DE number 7298906

Statements

instance of

scholarly article

0 references

title

Two-phase selective decentralization to improve reinforcement learning systems with MDP (English)

0 references

published in

AI Communications

0 references

publication date

20 January 2021

0 references

zbMATH Keywords

decentralized control

0 references

Hamilton-Jacobi-Bellman equation

0 references

Markov process

0 references

multi-agent systems

0 references

describes a project that uses

PRMLT

0 references

RICPAC

0 references

MaRDI profile type

MaRDI publication profile

0 references

cites work

Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach

0 references

Decentralized Q-Learning for Stochastic Teams and Games

0 references

Model-Free Adaptive Switching Control of Time-Varying Plants

0 references

Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation

0 references

On the Theory of Dynamic Programming

0 references

Pattern recognition and machine learning.

0 references

Decentralized Learning in Finite Markov Chains: Revisited

0 references

Simulation-based algorithms for Markov decision processes

0 references

Decentralized adaptive control: structural conditions for stability

0 references

New Concepts in Adaptive Control Using Multiple Models

0 references

Solving Hamilton-Jacobi-Bellman equations by a modified method of characteristics

0 references

Decentralized adaptive control of interconnected systems

0 references

System identification. An introduction.

0 references

A numerical algorithm for fully nonlinear HJB equations: an approach by control randomization

0 references

An introduction to mechanics.

0 references

Q4850020

0 references

Q4410591

0 references

Reinforcement Learning and Feedback Control: Using Natural Decision Methods to Design Optimal Adaptive Controllers

0 references

Optimal Decentralized Control of Coupled Subsystems With Control Sharing

0 references

Q4845461

0 references

Variable resolution discretization in optimal control

0 references

Improving transient response of adaptive control systems using multiple models and switching

0 references

Kernel methods in system identification, machine learning and function estimation: a survey

0 references

The Number of Partitions of a Set

0 references

Q4843187

0 references

An Approximation Theory of Optimal Control for Trainable Manipulators

0 references

<formula formulatype="inline"><tex Notation="TeX">$ {\cal H}_{2}$</tex></formula>-Optimal Decentralized Control Over Posets: A State-Space Solution for State-Feedback

0 references

Decentralized adaptive controller design for large-scale systems with higher order interconnections

0 references

${\mathcal Q}$-learning

0 references

Reinforcement learning for adaptive optimal control of unknown continuous-time nonlinear systems with input constraints

0 references

full work available at URL

https://doi.org/10.3233/aic-180766

0 references

Identifiers

zbMATH Open document ID

1467.93017

0 references

DOI

10.3233/AIC-180766

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:5145441