Pages that link to "Item:Q1391875"
From MaRDI portal
The following pages link to Stochastic approximation with two time scales (Q1391875):
Displaying 50 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Stochastic fictitious play with continuous action sets (Q403729) (← links)
- Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions (Q507334) (← links)
- Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach (Q511735) (← links)
- Monte-Carlo estimation of time-dependent statistical characteristics of random dynamical systems (Q636548) (← links)
- Sequential online subsampling for thinning experimental designs (Q830692) (← links)
- A new learning algorithm for optimal stopping (Q839001) (← links)
- Convergence rate and averaging of nonlinear two-time-scale stochastic approximation algo\-rithms (Q862224) (← links)
- Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
- Adaptive Monte Carlo variance reduction for Lévy processes with two-time-scale stochastic approximation (Q931375) (← links)
- Natural actor-critic algorithms (Q1049136) (← links)
- Reinforcement learning for long-run average cost. (Q1427588) (← links)
- Convergent multiple-timescales reinforcement learning algorithms in normal form games (Q1429103) (← links)
- An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method (Q1631797) (← links)
- Stochastic heavy ball (Q1697485) (← links)
- Q-learning for Markov decision processes with a satisfiability criterion (Q1749413) (← links)
- Convergence rate of linear two-time-scale stochastic approximation. (Q1879892) (← links)
- Generative adversarial networks are special cases of artificial curiosity (1990) and also closely related to predictability minimization (1991) (Q1982402) (← links)
- Adaptive importance sampling and control variates (Q2009326) (← links)
- Stochastic approximation on Riemannian manifolds (Q2020323) (← links)
- Stochastic approximation algorithms for superquantiles estimation (Q2042800) (← links)
- Generalization error of GAN from the discriminator's perspective (Q2071033) (← links)
- Recursive regression estimation based on the two-time-scale stochastic approximation method and Bernstein polynomials (Q2121627) (← links)
- Unified reinforcement Q-learning for mean field game and control problems (Q2153488) (← links)
- Weak convergence of dynamical systems in two timescales (Q2203452) (← links)
- Non asymptotic controls on a recursive superquantile approximation (Q2233588) (← links)
- Single-leader-multiple-follower games with boundedly rational agents (Q2270555) (← links)
- Online calibrated forecasts: memory efficiency versus universality for learning in games (Q2384142) (← links)
- A stability criterion for two timescale stochastic approximation schemes (Q2409333) (← links)
- New algorithms of the Q-learning type (Q2440701) (← links)
- Linear stochastic approximation driven by slowly varying Markov chains (Q2503529) (← links)
- An actor-critic algorithm for constrained Markov decision processes (Q2504518) (← links)
- Two-timescale stochastic gradient descent in continuous time with applications to joint online parameter estimation and optimal sensor placement (Q2692526) (← links)
- Nonlinear Gossip (Q2813309) (← links)
- Stochastic Dynamic Information Flow Tracking Game with Reinforcement Learning (Q3297679) (← links)
- Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise (Q3387930) (← links)
- Computing VaR and CVaR using stochastic approximation and adaptive unconstrained importance sampling (Q3654434) (← links)
- Two Timescale Analysis of the Alopex Algorithm for Optimization (Q4409386) (← links)
- REINFORCEMENT LEARNING IN MARKOVIAN EVOLUTIONARY GAMES (Q4425274) (← links)
- Distributed Stochastic Approximation with Local Projections (Q4562250) (← links)
- Optimizing Adaptive Importance Sampling by Stochastic Approximation (Q4584930) (← links)
- Robustness Properties in Fictitious-Play-Type Algorithms (Q4588840) (← links)
- A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization (Q4950732) (← links)
- A Biologically Plausible Neural Network for Multichannel Canonical Correlation Analysis (Q5033519) (← links)
- (Q5054599) (← links)
- A Stochastic Approximation Method for Simulation-Based Quantile Optimization (Q5060775) (← links)
- A Diffusion Approximation Theory of Momentum Stochastic Gradient Descent in Nonconvex Optimization (Q5084492) (← links)
- Risk-Sensitive Reinforcement Learning via Policy Gradient Search (Q5102286) (← links)
- Gradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous Space (Q5131717) (← links)
- Asynchronous stochastic approximation with differential inclusions (Q5168859) (← links)