Two-phase selective decentralization to improve reinforcement learning systems with MDP (Q5145441)

From MaRDI portal
scientific article; zbMATH DE number 7298906
Language Label Description Also known as
English
Two-phase selective decentralization to improve reinforcement learning systems with MDP
scientific article; zbMATH DE number 7298906

    Statements

    Two-phase selective decentralization to improve reinforcement learning systems with MDP (English)
    0 references
    0 references
    20 January 2021
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    decentralized control
    0 references
    Hamilton-Jacobi-Bellman equation
    0 references
    Markov process
    0 references
    multi-agent systems
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references