Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state (Q2063842)

From MaRDI portal
Revision as of 22:30, 16 December 2024 by Import241208061232 (talk | contribs) (Normalize DOI.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)





scientific article
Language Label Description Also known as
English
Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state
scientific article

    Statements

    Off-policy Q-learning: solving Nash equilibrium of multi-player games with network-induced delay and unmeasured state (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    3 January 2022
    0 references
    adaptive dynamic programming (ADP)
    0 references
    game theory
    0 references
    network-induced delay
    0 references
    off-policy Q-learning
    0 references
    unmeasured state
    0 references
    0 references

    Identifiers