On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems (Q5169662): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2161950876 / rank
 
Normal rank

Latest revision as of 19:14, 19 March 2024

scientific article; zbMATH DE number 6316326
Language Label Description Also known as
English
On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems
scientific article; zbMATH DE number 6316326

    Statements

    On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems (English)
    0 references
    0 references
    0 references
    11 July 2014
    0 references
    Markov decision processes
    0 references
    Q-learning
    0 references
    stochastic approximation
    0 references
    dynamic programming
    0 references
    reinforcement learning
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references