Stochastic approximation with two time scales

Cites work

scientific article; zbMATH DE number 3826915 (Why is no real title available?)
scientific article; zbMATH DE number 51708 (Why is no real title available?)
scientific article; zbMATH DE number 193190 (Why is no real title available?)
scientific article; zbMATH DE number 3538599 (Why is no real title available?)
scientific article; zbMATH DE number 6458575 (Why is no real title available?)
scientific article; zbMATH DE number 3232230 (Why is no real title available?)
A tutorial survey of reinforcement learning
Asynchronous stochastic approximation and Q-learning
Singular perturbations and asymptotic analysis in control systems
Stochastic approximation methods for constrained and unconstrained systems
Weak convergence methods and singularly perturbed stochastic control and filtering problems

Cited in

(68)

The actor-critic algorithm as multi-time-scale stochastic approximation.
Generative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991)
Computing VaR and CVaR using stochastic approximation and adaptive unconstrained importance sampling
A Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex Optimization
Sequential online subsampling for thinning experimental designs
A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
Adaptive Monte Carlo variance reduction for Lévy processes with two-time-scale stochastic approximation
Risk-Sensitive Reinforcement Learning via Policy Gradient Search
Online calibrated forecasts: memory efficiency versus universality for learning in games
Two-time-scale nonparametric recursive regression estimator for independent functional data
Convergence rate of linear two-time-scale stochastic approximation.
Joint online parameter estimation and optimal sensor placement for the partially observed stochastic advection-diffusion equation
Adaptive importance sampling and control variates
Non asymptotic controls on a recursive superquantile approximation
Stochastic Dynamic Information Flow Tracking Game with Reinforcement Learning
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
A new learning algorithm for optimal stopping
Recursive regression estimation based on the two-time-scale stochastic approximation method and Bernstein polynomials
Convergent multiple-timescales reinforcement learning algorithms in normal form games
Reinforcement learning for long-run average cost.
On the sample complexity of actor-critic method for reinforcement learning with function approximation
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
Simulation optimization using multi-time-scale adaptive random search
A Stochastic Approximation Method for Simulation-Based Quantile Optimization
Natural actor-critic algorithms
Multiscale Q-learning with linear function approximation
A stability criterion for two timescale stochastic approximation schemes
Robustness properties in fictitious-play-type algorithms
Distributed stochastic approximation with local projections
A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization
Weak convergence of dynamical systems in two timescales
Single-leader-multiple-follower games with boundedly rational agents
Unified reinforcement Q-learning for mean field game and control problems
Approach to expressing the second moment with a class of stochas- tic completion time
Stochastic approximation algorithms for superquantiles estimation
Analyzing approximate value iteration algorithms
Nonlinear gossip
Stochastic fictitious play with continuous action sets
Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations
Optimizing adaptive importance sampling by stochastic approximation
A biologically plausible neural network for multichannel canonical correlation analysis
Stochastic approximation on Riemannian manifolds
Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems
Asynchronous stochastic approximation with differential inclusions
Batching Adaptive Variance Reduction
Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise
New algorithms of the Q-learning type
Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
An actor-critic algorithm for constrained Markov decision processes
Gradient-based adaptive stochastic search for simulation optimization over continuous space
Generalization error of GAN from the discriminator's perspective
Stochastic heavy ball
scientific article; zbMATH DE number 7625165 (Why is no real title available?)
Linear stochastic approximation driven by slowly varying Markov chains
Stochastic approximation algorithms: overview and recent trends.
Geometrical Insights for Implicit Generative Modeling
Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement
Two Timescale Analysis of the Alopex Algorithm for Optimization
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions
Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach
Convergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithms
Q-learning for Markov decision processes with a satisfiability criterion
A sensitivity formula for risk-sensitive cost and the actor-critic algorithm
Actor-critic algorithms with online feature adaptation
Adaptive Monte Carlo Variance Reduction with Two-time-scale Stochastic Approximation
REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES
Reinforcement learning algorithms with function approximation: recent advances and applications
Bayesian experimental design without posterior calculations: an adversarial approach

This page was built for publication: Stochastic approximation with two time scales

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1391875)