Q-learning and policy iteration algorithms for stochastic shortest path problems (Q378731): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
Created claim: Wikidata QID (P12): Q115147448, #quickstatements; #temporary_batch_1711407341029
Property / Wikidata QID
 
Property / Wikidata QID: Q115147448 / rank
 
Normal rank

Revision as of 23:58, 25 March 2024

scientific article
Language Label Description Also known as
English
Q-learning and policy iteration algorithms for stochastic shortest path problems
scientific article

    Statements

    Q-learning and policy iteration algorithms for stochastic shortest path problems (English)
    0 references
    0 references
    0 references
    12 November 2013
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    approximate dynamic programming
    0 references
    value iteration
    0 references
    policy iteration
    0 references
    stochastic shortest paths
    0 references
    stochastic approximation
    0 references

    Identifiers