A one-measurement form of simultaneous perturbation stochastic approximation

From MaRDI portal
Publication:674970

DOI10.1016/S0005-1098(96)00149-5zbMath0867.93086OpenAlexW2012117977MaRDI QIDQ674970

James C. Spall

Publication date: 7 August 1997

Published in: Automatica (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/s0005-1098(96)00149-5




Related Items (30)

On stochastic extremum seeking via adaptive perturbation-demodulation loopMultiscale Q-learning with linear function approximationOnline estimation of hazard rate under random censoringSampled-data extremum-seeking framework for constrained optimization of nonlinear dynamical systemsActor-critic algorithms for hierarchical Markov decision processesReinforcement learning based algorithms for average cost Markov decision processesThe multivariate Révész's online estimator of a regression function and its averagingStochastic approximation with nondecaying gain: Error bound and data‐driven gain‐tuningSimultaneous perturbation stochastic approximation: towards one-measurement per iterationVariance-constrained actor-critic algorithms for discounted and average reward MDPsThree-string inharmonic networksNo-regret learning for repeated non-cooperative games with lossy banditsNew algorithms of the Q-learning typeDesigning inharmonic stringsOnline estimation of integrated squared density derivativesAccelerated randomized stochastic optimization.A companion for the Kiefer-Wolfowitz-Blum stochastic approximation algorithmA randomized stochastic optimization algorithm: its estimation accuracyFuzzy age-dependent replacement policy and SPSA algorithm based-on fuzzy simulationA compact law of the iterated logarithm for online estimator of hazard rate under random censoringSimultaneous perturbation Newton algorithms for simulation optimizationAn alternating variable method with varying replications for simulation response optimizationRecursive estimators of integrated squared density derivativesSimultaneous Perturbation Stochastic Approximation with Norm-Limited Update VectorStochastic approximation search algorithms with randomization at the inputThe stochastic approximation method for the estimation of a multivariate probability densityRevisiting the ODE method for recursive algorithms: fast convergence using quasi stochastic approximationStochastic relaxed inertial forward-backward-forward splitting for monotone inclusions in Hilbert spacesDerivative-free optimization over multi-user MIMO networksSimulation response optimization via direct conjugate direction method




Cites Work




This page was built for publication: A one-measurement form of simultaneous perturbation stochastic approximation