Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731)

From MaRDI portal
Revision as of 23:58, 25 March 2024 by Daniel (talk | contribs) (‎Created claim: Wikidata QID (P12): Q115147448, #quickstatements; #temporary_batch_1711407341029)
scientific article
Language Label Description Also known as
English
Q-learning and policy iteration algorithms for stochastic shortest path problems
scientific article

    Statements

    Q-learning and policy iteration algorithms for stochastic shortest path problems (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    approximate dynamic programming
    0 references
    value iteration
    0 references
    policy iteration
    0 references
    stochastic shortest paths
    0 references
    stochastic approximation
    0 references

    Identifiers