On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (Q4323346): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
ReferenceBot (talk | contribs)
Changed an Item
 
(5 intermediate revisions by 4 users not shown)
Property / author
 
Property / author: Tommi S. Jaakkola / rank
Normal rank
 
Property / author
 
Property / author: Satinder Pal Singh / rank
Normal rank
 
Property / author
 
Property / author: Tommi S. Jaakkola / rank
 
Normal rank
Property / author
 
Property / author: Satinder Pal Singh / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2165131254 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:59, 23 May 2024

scientific article; zbMATH DE number 723814
Language Label Description Also known as
English
On the Convergence of Stochastic Iterative Dynamic Programming Algorithms
scientific article; zbMATH DE number 723814

    Statements

    On the Convergence of Stochastic Iterative Dynamic Programming Algorithms (English)
    0 references
    0 references
    0 references
    0 references
    18 October 1995
    0 references
    \(Q\)-learning algorithm
    0 references

    Identifiers