Pages that link to "Item:Q4564833"
From MaRDI portal
The following pages link to Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences (Q4564833):
Displaying 10 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Actor-critic algorithms for hierarchical Markov decision processes (Q856510) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
- Weak convergence of dynamical systems in two timescales (Q2203452) (← links)
- Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
- An adaptive optimization scheme with satisfactory transient performance (Q2390563) (← links)
- A stability criterion for two timescale stochastic approximation schemes (Q2409333) (← links)
- New algorithms of the Q-learning type (Q2440701) (← links)
- Parallel Simultaneous Perturbation Optimization (Q5223039) (← links)