The Borkar-Meyn theorem for asynchronous stochastic approximations

From MaRDI portal

Revision as of 07:58, 30 January 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:553371

Jump to:navigation, search

DOI10.1016/j.sysconle.2011.04.002zbMath1222.93229OpenAlexW1971048324MaRDI QIDQ553371

Shalabh Bhatnagar

Publication date: 27 July 2011

Published in: Systems \& Control Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/j.sysconle.2011.04.002

zbMATH Keywords

temporal difference learning asynchronous stochastic approximation with delays the Borkar-Meyn theorem

Mathematics Subject Classification ID

Stochastic stability in control theory (93E15) Stochastic learning and adaptive control (93E35)

Related Items

An online actor-critic algorithm with function approximation for constrained Markov decision processes, Q-learning for Markov decision processes with a satisfiability criterion, Event-driven stochastic approximation, Whittle index based Q-learning for restless bandits with average reward

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:553371&oldid=12446264"