The Borkar-Meyn theorem for asynchronous stochastic approximations
From MaRDI portal
Publication:553371
DOI10.1016/j.sysconle.2011.04.002zbMath1222.93229OpenAlexW1971048324MaRDI QIDQ553371
Publication date: 27 July 2011
Published in: Systems \& Control Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.sysconle.2011.04.002
temporal difference learningasynchronous stochastic approximation with delaysthe Borkar-Meyn theorem
Related Items
An online actor-critic algorithm with function approximation for constrained Markov decision processes, Q-learning for Markov decision processes with a satisfiability criterion, Event-driven stochastic approximation, Whittle index based Q-learning for restless bandits with average reward
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
- Natural actor-critic algorithms
- An analysis of temporal-difference learning with function approximation
- Asynchronous Stochastic Approximations
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning