Stochastic approximation with two time scales
From MaRDI portal
Cites work
- scientific article; zbMATH DE number 3826915 (Why is no real title available?)
- scientific article; zbMATH DE number 51708 (Why is no real title available?)
- scientific article; zbMATH DE number 193190 (Why is no real title available?)
- scientific article; zbMATH DE number 3538599 (Why is no real title available?)
- scientific article; zbMATH DE number 6458575 (Why is no real title available?)
- scientific article; zbMATH DE number 3232230 (Why is no real title available?)
- A tutorial survey of reinforcement learning
- Asynchronous stochastic approximation and Q-learning
- Singular perturbations and asymptotic analysis in control systems
- Stochastic approximation methods for constrained and unconstrained systems
- Weak convergence methods and singularly perturbed stochastic control and filtering problems
Cited in
(68)- The actor-critic algorithm as multi-time-scale stochastic approximation.
- Generative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991)
- Computing VaR and CVaR using stochastic approximation and adaptive unconstrained importance sampling
- A Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex Optimization
- Sequential online subsampling for thinning experimental designs
- A Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-Critic
- Adaptive Monte Carlo variance reduction for Lévy processes with two-time-scale stochastic approximation
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search
- Online calibrated forecasts: memory efficiency versus universality for learning in games
- Two-time-scale nonparametric recursive regression estimator for independent functional data
- Convergence rate of linear two-time-scale stochastic approximation.
- Joint online parameter estimation and optimal sensor placement for the partially observed stochastic advection-diffusion equation
- Adaptive importance sampling and control variates
- Non asymptotic controls on a recursive superquantile approximation
- Stochastic Dynamic Information Flow Tracking Game with Reinforcement Learning
- An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method
- A new learning algorithm for optimal stopping
- Recursive regression estimation based on the two-time-scale stochastic approximation method and Bernstein polynomials
- Convergent multiple-timescales reinforcement learning algorithms in normal form games
- Reinforcement learning for long-run average cost.
- On the sample complexity of actor-critic method for reinforcement learning with function approximation
- Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences
- Simulation optimization using multi-time-scale adaptive random search
- A Stochastic Approximation Method for Simulation-Based Quantile Optimization
- Natural actor-critic algorithms
- Multiscale Q-learning with linear function approximation
- A stability criterion for two timescale stochastic approximation schemes
- Robustness properties in fictitious-play-type algorithms
- Distributed stochastic approximation with local projections
- A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization
- Weak convergence of dynamical systems in two timescales
- Single-leader-multiple-follower games with boundedly rational agents
- Unified reinforcement Q-learning for mean field game and control problems
- Approach to expressing the second moment with a class of stochas- tic completion time
- Stochastic approximation algorithms for superquantiles estimation
- Analyzing approximate value iteration algorithms
- Nonlinear gossip
- Stochastic fictitious play with continuous action sets
- Towards multi‐agent reinforcement learning‐driven over‐the‐counter market simulations
- Optimizing adaptive importance sampling by stochastic approximation
- A biologically plausible neural network for multichannel canonical correlation analysis
- Stochastic approximation on Riemannian manifolds
- Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems
- Asynchronous stochastic approximation with differential inclusions
- Batching Adaptive Variance Reduction
- Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise
- New algorithms of the Q-learning type
- Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
- An actor-critic algorithm for constrained Markov decision processes
- Gradient-based adaptive stochastic search for simulation optimization over continuous space
- Generalization error of GAN from the discriminator's perspective
- Stochastic heavy ball
- scientific article; zbMATH DE number 7625165 (Why is no real title available?)
- Linear stochastic approximation driven by slowly varying Markov chains
- Stochastic approximation algorithms: overview and recent trends.
- Geometrical Insights for Implicit Generative Modeling
- Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement
- Two Timescale Analysis of the Alopex Algorithm for Optimization
- Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions
- Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach
- Convergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithms
- Q-learning for Markov decision processes with a satisfiability criterion
- A sensitivity formula for risk-sensitive cost and the actor-critic algorithm
- Actor-critic algorithms with online feature adaptation
- Adaptive Monte Carlo Variance Reduction with Two-time-scale Stochastic Approximation
- REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Bayesian experimental design without posterior calculations: an adversarial approach
This page was built for publication: Stochastic approximation with two time scales
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1391875)