Pages that link to "Item:Q4564833"
From MaRDI portal
The following pages link to Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences (Q4564833):
Displayed 7 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Actor-critic algorithms for hierarchical Markov decision processes (Q856510) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
- An adaptive optimization scheme with satisfactory transient performance (Q2390563) (← links)
- A stability criterion for two timescale stochastic approximation schemes (Q2409333) (← links)
- New algorithms of the Q-learning type (Q2440701) (← links)