The Borkar-Meyn theorem for asynchronous stochastic approximations
From MaRDI portal
Publication:553371
DOI10.1016/J.SYSCONLE.2011.04.002zbMATH Open1222.93229OpenAlexW1971048324MaRDI QIDQ553371FDOQ553371
Authors: Shalabh Bhatnagar
Publication date: 27 July 2011
Published in: Systems \& Control Letters (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.sysconle.2011.04.002
Recommendations
asynchronous stochastic approximation with delaystemporal difference learningthe Borkar-Meyn theorem
Cites Work
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Title not available (Why is that?)
- Natural actor-critic algorithms
- Actor-Critic--Type Learning Algorithms for Markov Decision Processes
- An analysis of temporal-difference learning with function approximation
- Asynchronous Stochastic Approximations
- The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
- An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes
Cited In (8)
- Whittle index based Q-learning for restless bandits with average reward
- Event-driven stochastic approximation
- An online actor-critic algorithm with function approximation for constrained Markov decision processes
- Asynchronous stochastic approximation with differential inclusions
- Asymptotic agreement and convergence of asynchronous stochastic algorithms
- Asynchronous stochastic approximation and Q-learning
- Q-learning for Markov decision processes with a satisfiability criterion
- An asynchronous stochastic approximation theorem and some applications
This page was built for publication: The Borkar-Meyn theorem for asynchronous stochastic approximations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q553371)