Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731)

From MaRDI portal
Revision as of 15:46, 9 December 2024 by Import241208061232 (talk | contribs) (Normalize DOI.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Q-learning and policy iteration algorithms for stochastic shortest path problems
scientific article

    Statements

    Q-learning and policy iteration algorithms for stochastic shortest path problems (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    approximate dynamic programming
    0 references
    value iteration
    0 references
    policy iteration
    0 references
    stochastic shortest paths
    0 references
    stochastic approximation
    0 references

    Identifiers