Asynchronous Stochastic Approximations
From MaRDI portal
Publication:4388937
DOI10.1137/S0363012995282784zbMATH Open0922.62081OpenAlexW2080631849MaRDI QIDQ4388937FDOQ4388937
Authors: Vivek Borkar
Publication date: 10 May 1998
Published in: SIAM Journal on Control and Optimization (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1137/s0363012995282784
Recommendations
Cited In (29)
- Asymptotics of Reinforcement Learning with Neural Networks
- Whittle index based Q-learning for restless bandits with average reward
- Fully asynchronous stochastic coordinate descent: a tight lower bound on the parallelism achieving linear speedup
- Reinforcement learning for long-run average cost.
- The Borkar-Meyn theorem for asynchronous stochastic approximations
- Iterative learning control for large scale nonlinear systems with observation noise
- Partially asynchronous co-state prediction algorithm
- A stochastic gradient type algorithm for closed-loop problems
- Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design
- Event-driven stochastic approximation
- Asymptotic behavior of asynchronous stochastic approximation
- An online actor-critic algorithm with function approximation for constrained Markov decision processes
- Nonlinear gossip
- Charge-based control of DiffServ-like queues
- Convergence analysis of contrastive divergence algorithm based on gradient method with errors
- Stochastic fictitious play with continuous action sets
- Distributed time synchronization for networks with random delays and measurement noise
- Asynchronous stochastic approximation with differential inclusions
- Asymptotic agreement and convergence of asynchronous stochastic algorithms
- Convergence rates for stochastic approximation: biased noise with unbounded variance, and applications
- A note on the effect of asynchronous sampling on estimation accuracy
- Random asynchronous iterations in distributed coordination algorithms
- Stochastic approximation algorithms: overview and recent trends.
- Reinforcement learning based algorithms for average cost Markov decision processes
- Title not available (Why is that?)
- Current fluctuations of a self-interacting diffusion on a ring
- A sensitivity formula for risk-sensitive cost and the actor-critic algorithm
- Q-learning for Markov decision processes with a satisfiability criterion
- Approachability in Stackelberg stochastic games with vector costs
This page was built for publication: Asynchronous Stochastic Approximations
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4388937)