Stochastic recursive algorithms for optimization. Simultaneous perturbation methods

From MaRDI portal
Revision as of 05:05, 30 January 2024 by Import240129110155 (talk | contribs) (Created automatically from import240129110155)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:441138

DOI10.1007/978-1-4471-4285-0zbMath1260.90002OpenAlexW2493209382MaRDI QIDQ441138

H. L. Prasad, L. A. Prashanth, Shalabh Bhatnagar

Publication date: 20 August 2012

Published in: Lecture Notes in Control and Information Sciences (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/978-1-4471-4285-0




Related Items (23)

Multiscale Q-learning with linear function approximationRisk-Sensitive Reinforcement Learning via Policy Gradient SearchVariance-constrained actor-critic algorithms for discounted and average reward MDPsVariable ansatz applied to spectral operator decomposition in a physical superconducting quantum deviceA model for data transmission and its optimizationStochastic approximation procedures for Lévy-driven SDEsModeling and control of data transmissionLearning equilibrium mean‐variance strategyNested kriging predictions for datasets with a large number of observationsGeneralization of a result of Fabian on the asymptotic normality of stochastic approximationGradient-Based Adaptive Stochastic Search for Simulation Optimization Over Continuous SpaceNewton-based stochastic optimization using \(q\)-Gaussian smoothed functional algorithmsSmoothed functional-based gradient algorithms for off-policy reinforcement learning: a non-asymptotic viewpointQuasi-Newton smoothed functional algorithms for unconstrained and constrained simulation optimizationRisk-Constrained Reinforcement Learning with Percentile Risk CriteriaSimultaneous perturbation Newton algorithms for simulation optimizationA simulation‐based algorithm for optimal pricing policy under demand uncertaintyRecurrent neural networks as optimal mesh refinement strategiesDerivative-free optimization methodsSimulation methods for robust risk assessment and the distorted mix approachQuantum simulation of the ground-state Stark effect in small molecules: a case study using IBM QIterative learning control using faded measurements without system information: a gradient estimation approachActor-Critic Algorithms with Online Feature Adaptation




This page was built for publication: Stochastic recursive algorithms for optimization. Simultaneous perturbation methods