The Borkar-Meyn theorem for asynchronous stochastic approximations
From MaRDI portal
Publication:553371
Recommendations
Cites work
- scientific article; zbMATH DE number 5348356 (Why is no real title available?)
- scientific article; zbMATH DE number 51132 (Why is no real title available?)
- scientific article; zbMATH DE number 1206370 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 1043533 (Why is no real title available?)
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
- An analysis of temporal-difference learning with function approximation
- Asynchronous Stochastic Approximations
- Natural actor-critic algorithms
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
Cited in
(9)- Whittle index based Q-learning for restless bandits with average reward
- A generalization of the Borkar-Meyn theorem for stochastic recursive inclusions
- Event-driven stochastic approximation
- An online actor-critic algorithm with function approximation for constrained Markov decision processes
- Asynchronous stochastic approximation and Q-learning
- Asymptotic agreement and convergence of asynchronous stochastic algorithms
- Asynchronous stochastic approximation with differential inclusions
- Q-learning for Markov decision processes with a satisfiability criterion
- An asynchronous stochastic approximation theorem and some applications
This page was built for publication: The Borkar-Meyn theorem for asynchronous stochastic approximations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q553371)