Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming (Q2884305)

From MaRDI portal
Revision as of 09:15, 22 August 2023 by Importer (talk | contribs) (‎Created a new Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
scientific article

    Statements

    Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming (English)
    0 references
    0 references
    0 references
    24 May 2012
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    policy iteration
    0 references
    value iteration stochastic approximation
    0 references
    reinforcement learning
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references