The Borkar-Meyn theorem for asynchronous stochastic approximations (Q553371): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3527701 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4346705 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Natural actor-critic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4001523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Asynchronous Stochastic Approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4209222 / rank
 
Normal rank

Latest revision as of 07:54, 4 July 2024

scientific article
Language Label Description Also known as
English
The Borkar-Meyn theorem for asynchronous stochastic approximations
scientific article

    Statements

    The Borkar-Meyn theorem for asynchronous stochastic approximations (English)
    0 references
    0 references
    27 July 2011
    0 references
    the Borkar-Meyn theorem
    0 references
    asynchronous stochastic approximation with delays
    0 references
    temporal difference learning
    0 references

    Identifiers