Pages that link to "Item:Q674970"
From MaRDI portal
The following pages link to A one-measurement form of simultaneous perturbation stochastic approximation (Q674970):
Displaying 29 items.
- Multiscale Q-learning with linear function approximation (Q312650) (← links)
- Stochastic approximation search algorithms with randomization at the input (Q747226) (← links)
- Actor-critic algorithms for hierarchical Markov decision processes (Q856510) (← links)
- The stochastic approximation method for the estimation of a multivariate probability density (Q1015895) (← links)
- Accelerated randomized stochastic optimization. (Q1434014) (← links)
- On stochastic extremum seeking via adaptive perturbation-demodulation loop (Q1626541) (← links)
- Online estimation of hazard rate under random censoring (Q1642739) (← links)
- Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
- An alternating variable method with varying replications for simulation response optimization (Q1770651) (← links)
- Simulation response optimization via direct conjugate direction method (Q1870805) (← links)
- Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
- Stochastic relaxed inertial forward-backward-forward splitting for monotone inclusions in Hilbert spaces (Q2082546) (← links)
- Derivative-free optimization over multi-user MIMO networks (Q2090228) (← links)
- Sampled-data extremum-seeking framework for constrained optimization of nonlinear dynamical systems (Q2151945) (← links)
- Online estimation of integrated squared density derivatives (Q2216958) (← links)
- A compact law of the iterated logarithm for online estimator of hazard rate under random censoring (Q2244598) (← links)
- Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
- Recursive estimators of integrated squared density derivatives (Q2288772) (← links)
- The multivariate Révész's online estimator of a regression function and its averaging (Q2396740) (← links)
- New algorithms of the Q-learning type (Q2440701) (← links)
- A companion for the Kiefer-Wolfowitz-Blum stochastic approximation algorithm (Q2456019) (← links)
- A randomized stochastic optimization algorithm: its estimation accuracy (Q2457537) (← links)
- Fuzzy age-dependent replacement policy and SPSA algorithm based-on fuzzy simulation (Q2466114) (← links)
- Reinforcement learning based algorithms for average cost Markov decision processes (Q2643632) (← links)
- Simultaneous Perturbation Stochastic Approximation with Norm-Limited Update Vector (Q2813999) (← links)
- Designing inharmonic strings (Q4620414) (← links)
- Stochastic approximation with nondecaying gain: Error bound and data‐driven gain‐tuning (Q6054830) (← links)
- Simultaneous perturbation stochastic approximation: towards one-measurement per iteration (Q6076930) (← links)
- No-regret learning for repeated non-cooperative games with lossy bandits (Q6152576) (← links)