On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (Q4323346): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claims
Normalize DOI.
 
(4 intermediate revisions by 4 users not shown)
Property / DOI
 
Property / DOI: 10.1162/neco.1994.6.6.1185 / rank
Normal rank
 
Property / author
 
Property / author: Tommi S. Jaakkola / rank
 
Normal rank
Property / author
 
Property / author: Satinder Pal Singh / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2165131254 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1162/NECO.1994.6.6.1185 / rank
 
Normal rank

Latest revision as of 19:26, 29 December 2024

scientific article; zbMATH DE number 723814
Language Label Description Also known as
English
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
scientific article; zbMATH DE number 723814

    Statements

    On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (English)
    0 references
    0 references
    0 references
    0 references
    18 October 1995
    0 references
    \(Q\)-learning algorithm
    0 references

    Identifiers