Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming (Q2884305): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import241208061232 (talk | contribs)
Normalize DOI.
 
(3 intermediate revisions by 3 users not shown)
Property / DOI
 
Property / DOI: 10.1287/moor.1110.0532 / rank
Normal rank
 
Property / MaRDI profile type
 
Property / MaRDI profile type: Publication / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2148864095 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1287/MOOR.1110.0532 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 03:12, 20 December 2024

scientific article
Language Label Description Also known as
English
Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
scientific article

    Statements

    Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming (English)
    0 references
    0 references
    0 references
    24 May 2012
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    policy iteration
    0 references
    value iteration stochastic approximation
    0 references
    reinforcement learning
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references