Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
From MaRDI portal
Publication:4006251
DOI10.1109/9.119632zbMath0745.60110OpenAlexW2124289529WikidataQ64357626 ScholiaQ64357626MaRDI QIDQ4006251
Publication date: 26 September 1992
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/9.119632
stochastic approximationgradient approximationfinite-difference-based algorithms in large-dimensional problemsKiefer-Wolfowitz type procedures
Related Items (only showing first 100 items - show all)
Full-low evaluation methods for derivative-free optimization ⋮ Log-normalization constant estimation using the ensemble Kalman–Bucy filter with application to high-dimensional models ⋮ Finite Difference Gradient Approximation: To Randomize or Not? ⋮ A modified second‐order SPSA optimization algorithm for finite samples ⋮ Application of stochastic approximation techniques in neural modelling and control ⋮ Actor-Critic–Like Stochastic Adaptive Search for Continuous Simulation Optimization ⋮ Parameter estimation in a highly non-linear model using simultaneous perturbation stochastic approximation ⋮ Ensemble Gradient for Learning Turbulence Models from Indirect Observations ⋮ Convergence of a Distributed Kiefer-Wolfowitz Algorithm ⋮ Zeroth-order optimization with orthogonal random directions ⋮ An introduction to variational quantum algorithms for combinatorial optimization problems ⋮ Online data‐driven control of variable speed wind turbines using the simultaneous perturbation stochastic approximation approach ⋮ Stochastic approximation with nondecaying gain: Error bound and data‐driven gain‐tuning ⋮ Risk-Sensitive Reinforcement Learning via Policy Gradient Search ⋮ A Zeroth-Order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization ⋮ Technical note: <scp>Finite‐time</scp> regret analysis of <scp>Kiefer‐Wolfowitz</scp> stochastic approximation algorithm and nonparametric <scp>multi‐product</scp> dynamic pricing with unknown demand ⋮ A quasi-Newton trust-region method for optimization under uncertainty using stochastic simplex approximate gradients ⋮ Simultaneous perturbation stochastic approximation: towards one-measurement per iteration ⋮ Stochastic search for a parametric cost function approximation: energy storage with rolling forecasts ⋮ Near term algorithms for linear systems of equations ⋮ A unified stochastic approximation framework for learning in games ⋮ Data‐driven fault‐tolerant control for SISO nonlinear system with unknown sensor fault ⋮ A binary monkey search algorithm variation for solving the set covering problem ⋮ Detecting entanglement of unknown states by violating the Clauser-Horne-Shimony-Holt inequality ⋮ Nonsmooth optimization by Lie bracket approximations into random directions ⋮ Practical adaptive quantum tomography ⋮ Input–Output Uncertainty Comparisons for Discrete Optimization via Simulation ⋮ Designing inharmonic strings ⋮ Gradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous Space ⋮ Surrogate-Based Promising Area Search for Lipschitz Continuous Simulation Optimization ⋮ Stochastic optimisation with inequality constraints using simultaneous perturbations and penalty functions ⋮ Efficient Multi-objective Reinforcement Learning via Multiple-gradient Descent with Iteratively Discovered Weight-Vector Sets ⋮ Variational Quantum Eigensolver and Its Applications ⋮ Risk-Constrained Reinforcement Learning with Percentile Risk Criteria ⋮ A Kiefer‐Wolfowitz Algorithm Based Iterative Learning Control for Hammerstein‐Wiener Systems ⋮ Maximizing Complex Likelihoods via Directed Stochastic Searching Algorithm ⋮ Stochastic approximation ⋮ Multidimensional stochastic approximation ⋮ Two Timescale Analysis of the Alopex Algorithm for Optimization ⋮ Random-direction optimization algorithms with applications to threshold controls ⋮ A short note on SPSA techniques and their use in nonlinear bioprocess identification ⋮ Unnamed Item ⋮ Unnamed Item ⋮ Stochastic approximation algorithms: overview and recent trends. ⋮ A two Timescale Stochastic Approximation Scheme for Simulation-Based Parametric Optimization ⋮ Extremum Seeking Control with Two-Sided Stochastic Perturbations ⋮ Simulation Optimization Using Multi-Time-Scale Adaptive Random Search ⋮ Production planning system for a combination of make-to-stock and make-to-order products ⋮ Simulation optimization: a review of algorithms and applications ⋮ Determination of the Mechanical Properties of a Solid Elastic Medium from a Seismic Wave Propagation Using Two Statistical Estimators ⋮ Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise ⋮ Parallel Simultaneous Perturbation Optimization ⋮ Derivative-free optimization methods ⋮ Multi-agent consensus under a communication–broadcast mixed environment ⋮ Simulation Optimization: A Review and Exploration in the New Era of Cloud Computing and Big Data ⋮ Efficient Bayesian Experimentation Using an Expected Information Gain Lower Bound ⋮ Iterative learning control using faded measurements without system information: a gradient estimation approach ⋮ Actor-Critic Algorithms with Online Feature Adaptation ⋮ Smoothed Functional Algorithms for Stochastic Optimization Using q -Gaussian Distributions ⋮ A Stochastic Simplex Approximate Gradient (StoSAG) for optimization under uncertainty ⋮ Efficient and direct estimation of the variance-covariance matrix in EM algorithm with interpolation method ⋮ A Bayesian stochastic approximation method ⋮ An incremental off-policy search in a model-free Markov decision process using a single sample path ⋮ Stopping rules for optimization algorithms based on stochastic approximation ⋮ On stochastic extremum seeking via adaptive perturbation-demodulation loop ⋮ Stochastic approximation of global minimum points ⋮ Broadcast control of multi-agent systems ⋮ Stochastic derivative-free optimization using a trust region framework ⋮ Extremum seeking-based optimal EGR set-point design for combustion engines in lean-burn mode ⋮ Gaussian process surrogates for failure detection: a Bayesian experimental design approach ⋮ Multilevel estimation of normalization constants using ensemble Kalman-Bucy filters ⋮ Finding local optima of high-dimensional functions using direct search methods ⋮ Multiscale Q-learning with linear function approximation ⋮ A combined direction stochastic approximation algorithm ⋮ Estimating the position of a moving object based on test disturbance of camera position ⋮ Auxiliary controller design and performance comparative analysis in closed-loop brain-machine interface system ⋮ Continuous action set learning automata for stochastic optimization ⋮ On the use of an SPSA-based model-free controller in quality improvement ⋮ Annealing stochastic approximation Monte Carlo algorithm for neural network training ⋮ Global convergence rate analysis of unconstrained optimization methods based on probabilistic models ⋮ Stochastic optimization using a trust-region method and random models ⋮ Constrained optimization via stochastic approximation with a simultaneous perturbation gradient approximation ⋮ Actor-critic algorithms for hierarchical Markov decision processes ⋮ Reinforcement learning based algorithms for average cost Markov decision processes ⋮ Simulation optimization for revenue management of airlines with cancellations and overbooking ⋮ An adaptive optimization scheme with satisfactory transient performance ⋮ Algorithm for stochastic approximation with trial input perturbation in the nonstationary problem of optimization ⋮ Theoretical connections between optimization algorithms based on an approximate gradient ⋮ SIMULATION-BASED OPTIMIZATION BY NEW STOCHASTIC APPROXIMATION ALGORITHM ⋮ Minimax efficient finite-difference stochastic gradient estimators using black-box function evaluations ⋮ Pseudo-perturbation-based broadcast control of multi-agent systems ⋮ Simultaneous perturbation stochastic approximation of nonsmooth functions ⋮ Feature selection using stochastic approximation with Barzilai and Borwein non-monotone gains ⋮ A novel ADP based model-free predictive control ⋮ Convergence guarantees for generalized adaptive stochastic search methods for continuous global optimization ⋮ Extremum seeking of dynamical systems via gradient descent and stochastic approximation methods ⋮ An actor-critic algorithm with function approximation for discounted cost constrained Markov decision processes ⋮ Variance-constrained actor-critic algorithms for discounted and average reward MDPs ⋮ Interval type-2 recurrent fuzzy neural system for nonlinear systems control using stable simultaneous perturbation stochastic approximation algorithm ⋮ Improved sampling strategies for ensemble-based optimization
This page was built for publication: Multivariate stochastic approximation using a simultaneous perturbation gradient approximation