Multivariate stochastic approximation using a simultaneous perturbation gradient approximation

From MaRDI portal
Publication:4006251

DOI10.1109/9.119632zbMath0745.60110OpenAlexW2124289529WikidataQ64357626 ScholiaQ64357626MaRDI QIDQ4006251

James C. Spall

Publication date: 26 September 1992

Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/9.119632




Related Items (only showing first 100 items - show all)

Full-low evaluation methods for derivative-free optimizationLog-normalization constant estimation using the ensemble Kalman–Bucy filter with application to high-dimensional modelsFinite Difference Gradient Approximation: To Randomize or Not?A modified second‐order SPSA optimization algorithm for finite samplesApplication of stochastic approximation techniques in neural modelling and controlActor-Critic–Like Stochastic Adaptive Search for Continuous Simulation OptimizationParameter estimation in a highly non-linear model using simultaneous perturbation stochastic approximationEnsemble Gradient for Learning Turbulence Models from Indirect ObservationsConvergence of a Distributed Kiefer-Wolfowitz AlgorithmZeroth-order optimization with orthogonal random directionsAn introduction to variational quantum algorithms for combinatorial optimization problemsOnline data‐driven control of variable speed wind turbines using the simultaneous perturbation stochastic approximation approachStochastic approximation with nondecaying gain: Error bound and data‐driven gain‐tuningRisk-Sensitive Reinforcement Learning via Policy Gradient SearchA Zeroth-Order Proximal Stochastic Gradient Method for Weakly Convex Stochastic OptimizationTechnical note: <scp>Finite‐time</scp> regret analysis of <scp>Kiefer‐Wolfowitz</scp> stochastic approximation algorithm and nonparametric <scp>multi‐product</scp> dynamic pricing with unknown demandA quasi-Newton trust-region method for optimization under uncertainty using stochastic simplex approximate gradientsSimultaneous perturbation stochastic approximation: towards one-measurement per iterationStochastic search for a parametric cost function approximation: energy storage with rolling forecastsNear term algorithms for linear systems of equationsA unified stochastic approximation framework for learning in gamesData‐driven fault‐tolerant control for SISO nonlinear system with unknown sensor faultA binary monkey search algorithm variation for solving the set covering problemDetecting entanglement of unknown states by violating the Clauser-Horne-Shimony-Holt inequalityNonsmooth optimization by Lie bracket approximations into random directionsPractical adaptive quantum tomographyInput–Output Uncertainty Comparisons for Discrete Optimization via SimulationDesigning inharmonic stringsGradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous SpaceSurrogate-Based Promising Area Search for Lipschitz Continuous Simulation OptimizationStochastic optimisation with inequality constraints using simultaneous perturbations and penalty functionsEfficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector SetsVariational Quantum Eigensolver and Its ApplicationsRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaA Kiefer‐Wolfowitz Algorithm Based Iterative Learning Control for Hammerstein‐Wiener SystemsMaximizing Complex Likelihoods via Directed Stochastic Searching AlgorithmStochastic approximationMultidimensional stochastic approximationTwo Timescale Analysis of the Alopex Algorithm for OptimizationRandom-direction optimization algorithms with applications to threshold controlsA short note on SPSA techniques and their use in nonlinear bioprocess identificationUnnamed ItemUnnamed ItemStochastic approximation algorithms: overview and recent trends.A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric OptimizationExtremum Seeking Control with Two-Sided Stochastic PerturbationsSimulation Optimization Using Multi-Time-Scale Adaptive Random SearchProduction planning system for a combination of make-to-stock and make-to-order productsSimulation optimization: a review of algorithms and applicationsDetermination of the Mechanical Properties of a Solid Elastic Medium from a Seismic Wave Propagation Using Two Statistical EstimatorsStochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov NoiseParallel Simultaneous Perturbation OptimizationDerivative-free optimization methodsMulti-agent consensus under a communication–broadcast mixed environmentSimulation Optimization: A Review and Exploration in the New Era of Cloud Computing and Big DataEfficient Bayesian Experimentation Using an Expected Information Gain Lower BoundIterative learning control using faded measurements without system information: a gradient estimation approachActor-Critic Algorithms with Online Feature AdaptationSmoothed Functional Algorithms for Stochastic Optimization Using q -Gaussian DistributionsA Stochastic Simplex Approximate Gradient (StoSAG) for optimization under uncertaintyEfficient and direct estimation of the variance-covariance matrix in EM algorithm with interpolation methodA Bayesian stochastic approximation methodAn incremental off-policy search in a model-free Markov decision process using a single sample pathStopping rules for optimization algorithms based on stochastic approximationOn stochastic extremum seeking via adaptive perturbation-demodulation loopStochastic approximation of global minimum pointsBroadcast control of multi-agent systemsStochastic derivative-free optimization using a trust region frameworkExtremum seeking-based optimal EGR set-point design for combustion engines in lean-burn modeGaussian process surrogates for failure detection: a Bayesian experimental design approachMultilevel estimation of normalization constants using ensemble Kalman-Bucy filtersFinding local optima of high-dimensional functions using direct search methodsMultiscale Q-learning with linear function approximationA combined direction stochastic approximation algorithmEstimating the position of a moving object based on test disturbance of camera positionAuxiliary controller design and performance comparative analysis in closed-loop brain-machine interface systemContinuous action set learning automata for stochastic optimizationOn the use of an SPSA-based model-free controller in quality improvementAnnealing stochastic approximation Monte Carlo algorithm for neural network trainingGlobal convergence rate analysis of unconstrained optimization methods based on probabilistic modelsStochastic optimization using a trust-region method and random modelsConstrained optimization via stochastic approximation with a simultaneous perturbation gradient approximationActor-critic algorithms for hierarchical Markov decision processesReinforcement learning based algorithms for average cost Markov decision processesSimulation optimization for revenue management of airlines with cancellations and overbookingAn adaptive optimization scheme with satisfactory transient performanceAlgorithm for stochastic approximation with trial input perturbation in the nonstationary problem of optimizationTheoretical connections between optimization algorithms based on an approximate gradientSIMULATION-BASED OPTIMIZATION BY NEW STOCHASTIC APPROXIMATION ALGORITHMMinimax efficient finite-difference stochastic gradient estimators using black-box function evaluationsPseudo-perturbation-based broadcast control of multi-agent systemsSimultaneous perturbation stochastic approximation of nonsmooth functionsFeature selection using stochastic approximation with Barzilai and Borwein non-monotone gainsA novel ADP based model-free predictive controlConvergence guarantees for generalized adaptive stochastic search methods for continuous global optimizationExtremum seeking of dynamical systems via gradient descent and stochastic approximation methodsAn actor-critic algorithm with function approximation for discounted cost constrained Markov decision processesVariance-constrained actor-critic algorithms for discounted and average reward MDPsInterval type-2 recurrent fuzzy neural system for nonlinear systems control using stable simultaneous perturbation stochastic approximation algorithmImproved sampling strategies for ensemble-based optimization




This page was built for publication: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation