Stochastic approximation with two time scales

From MaRDI portal
Publication:1391875

DOI10.1016/S0167-6911(97)90015-3zbMath0895.62085OpenAlexW2094364653MaRDI QIDQ1391875

Vivek S. Borkar

Publication date: 23 July 1998

Published in: Systems \& Control Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0167-6911(97)90015-3




Related Items (66)

A Biologically Plausible Neural Network for Multichannel Canonical Correlation AnalysisRecursive regression estimation based on the two-time-scale stochastic approximation method and Bernstein polynomialsA Two-Timescale Stochastic Algorithm Framework for Bilevel Optimization: Complexity Analysis and Application to Actor-CriticConvergence rate of linear two-time-scale stochastic approximation.Sequential online subsampling for thinning experimental designsAn online prediction algorithm for reinforcement learning with linear function approximation using cross entropy methodA new learning algorithm for optimal stoppingMultiscale Q-learning with linear function approximationAsynchronous stochastic approximation with differential inclusionsA Stochastic Approximation Method for Simulation-Based Quantile OptimizationOnline calibrated forecasts: memory efficiency versus universality for learning in gamesUnified reinforcement Q-learning for mean field game and control problemsConvergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithmsDistributed Stochastic Approximation with Local ProjectionsA Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex OptimizationA stability criterion for two timescale stochastic approximation schemesOptimizing Adaptive Importance Sampling by Stochastic ApproximationRobustness Properties in Fictitious-Play-Type AlgorithmsRisk-Sensitive Reinforcement Learning via Policy Gradient SearchStochastic fictitious play with continuous action setsBatching Adaptive Variance ReductionBayesian experimental design without posterior calculations: an adversarial approachOn the sample complexity of actor-critic method for reinforcement learning with function approximationGeometrical Insights for Implicit Generative ModelingStochastic heavy ballTwo-time-scale nonparametric recursive regression estimator for independent functional dataReinforcement learning algorithms with function approximation: recent advances and applicationsTowards multi‐agent reinforcement learning‐driven over‐the‐counter market simulationsTwo-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placementWeak convergence of dynamical systems in two timescalesMonte-Carlo estimation of time-dependent statistical characteristics of random dynamical systemsNew algorithms of the Q-learning typeGradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous SpaceReinforcement learning for long-run average cost.Convergent multiple-timescales reinforcement learning algorithms in normal form gamesAdaptive Monte Carlo variance reduction for Lévy processes with two-time-scale stochastic approximationStochastic Dynamic Information Flow Tracking Game with Reinforcement LearningNon asymptotic controls on a recursive superquantile approximationGenerative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991)Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functionsQ-learning for continuous-time linear systems: A model-free infinite horizon optimal control approachUnnamed ItemQ-learning for Markov decision processes with a satisfiability criterionAdaptive Monte Carlo Variance Reduction with Two-time-scale Stochastic ApproximationAdaptive importance sampling and control variatesSingle-leader-multiple-follower games with boundedly rational agentsTwo Timescale Analysis of the Alopex Algorithm for OptimizationStochastic approximation on Riemannian manifoldsLinear stochastic approximation driven by slowly varying Markov chainsAn actor-critic algorithm for constrained Markov decision processesThe actor-critic algorithm as multi-time-scale stochastic approximation.Stochastic approximation algorithms: overview and recent trends.REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMESA sensitivity formula for risk-sensitive cost and the actor-critic algorithmA two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric OptimizationSimulation Optimization Using Multi-Time-Scale Adaptive Random SearchStochastic approximation algorithms for superquantiles estimationNonlinear GossipTwo Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference LearningStochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov NoiseGeneralization error of GAN from the discriminator's perspectiveComputing VaR and CVaR using stochastic approximation and adaptive unconstrained importance samplingNatural actor-critic algorithmsJoint Online Parameter Estimation and Optimal Sensor Placement for the Partially Observed Stochastic Advection-Diffusion EquationAnalyzing Approximate Value Iteration AlgorithmsActor-Critic Algorithms with Online Feature Adaptation



Cites Work


This page was built for publication: Stochastic approximation with two time scales