Pages that link to "Item:Q674970"

From MaRDI portal

← A one-measurement form of simultaneous perturbation stochastic approximation (Q674970)

Jump to:navigation, search

The following pages link to A one-measurement form of simultaneous perturbation stochastic approximation (Q674970):

Displaying 29 items.

Multiscale Q-learning with linear function approximation (Q312650) (← links)
Stochastic approximation search algorithms with randomization at the input (Q747226) (← links)
Actor-critic algorithms for hierarchical Markov decision processes (Q856510) (← links)
The stochastic approximation method for the estimation of a multivariate probability density (Q1015895) (← links)
Accelerated randomized stochastic optimization. (Q1434014) (← links)
On stochastic extremum seeking via adaptive perturbation-demodulation loop (Q1626541) (← links)
Online estimation of hazard rate under random censoring (Q1642739) (← links)
Variance-constrained actor-critic algorithms for discounted and average reward MDPs (Q1689603) (← links)
An alternating variable method with varying replications for simulation response optimization (Q1770651) (← links)
Simulation response optimization via direct conjugate direction method (Q1870805) (← links)
Revisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximation (Q2070010) (← links)
Stochastic relaxed inertial forward-backward-forward splitting for monotone inclusions in Hilbert spaces (Q2082546) (← links)
Derivative-free optimization over multi-user MIMO networks (Q2090228) (← links)
Sampled-data extremum-seeking framework for constrained optimization of nonlinear dynamical systems (Q2151945) (← links)
Online estimation of integrated squared density derivatives (Q2216958) (← links)
A compact law of the iterated logarithm for online estimator of hazard rate under random censoring (Q2244598) (← links)
Simultaneous perturbation Newton algorithms for simulation optimization (Q2260692) (← links)
Recursive estimators of integrated squared density derivatives (Q2288772) (← links)
The multivariate Révész's online estimator of a regression function and its averaging (Q2396740) (← links)
New algorithms of the Q-learning type (Q2440701) (← links)
A companion for the Kiefer-Wolfowitz-Blum stochastic approximation algorithm (Q2456019) (← links)
A randomized stochastic optimization algorithm: its estimation accuracy (Q2457537) (← links)
Fuzzy age-dependent replacement policy and SPSA algorithm based-on fuzzy simulation (Q2466114) (← links)
Reinforcement learning based algorithms for average cost Markov decision processes (Q2643632) (← links)
Simultaneous Perturbation Stochastic Approximation with Norm-Limited Update Vector (Q2813999) (← links)
Designing inharmonic strings (Q4620414) (← links)
Stochastic approximation with nondecaying gain: Error bound and data‐driven gain‐tuning (Q6054830) (← links)
Simultaneous perturbation stochastic approximation: towards one-measurement per iteration (Q6076930) (← links)
No-regret learning for repeated non-cooperative games with lossy bandits (Q6152576) (← links)

Retrieved from "https://portal.mardi4nfdi.de/wiki/Special:WhatLinksHere"