Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
From MaRDI portal
Publication:851872
Recommendations
- scientific article; zbMATH DE number 4003938
- Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes
- Adaptive stochastic approximation algorithm
- Adaptive value function approximation for continuous-state stochastic dynamic programming
- Rates of convergence of adaptive step-size of stochastic approximation algorithms
Cites work
- scientific article; zbMATH DE number 5957196 (Why is no real title available?)
- scientific article; zbMATH DE number 3155238 (Why is no real title available?)
- scientific article; zbMATH DE number 4043678 (Why is no real title available?)
- scientific article; zbMATH DE number 4085419 (Why is no real title available?)
- scientific article; zbMATH DE number 3626409 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 1043533 (Why is no real title available?)
- scientific article; zbMATH DE number 1380646 (Why is no real title available?)
- scientific article; zbMATH DE number 4118164 (Why is no real title available?)
- scientific article; zbMATH DE number 870530 (Why is no real title available?)
- scientific article; zbMATH DE number 3206654 (Why is no real title available?)
- A Stochastic Approximation Method
- A method of aggregate stochastic subgradients with on-line stepsize rules for convex stochastic programming problems
- A stochastic gradient adaptive filter with gradient adaptive step size
- Accelerated Stochastic Approximation
- Adaptive filtering
- Analysis of adaptive step-size SA algorithms for parameter tracking
- Approximation Methods which Converge with Probability one
- Exponentiated gradient versus gradient descent for linear predictors
- Forecasting sales by exponentially weighted moving averages
- Introduction to Stochastic Search and Optimization
- Learning Applied to Successive Approximation Algorithms
- Multidimensional Stochastic Approximation Methods
- On the identification of non-stationary linear processes
- Recursive estimation and time-series analysis. An introduction
- Stochastic Estimation of the Maximum of a Regression Function
- The elements of statistical learning. Data mining, inference, and prediction
Cited in
(24)- Integrated condition-based maintenance and multi-item lot-sizing with stochastic demand
- Approximate dynamic programming for lateral transshipment problems in multi-location inventory systems
- Minimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programming
- Risk-averse approximate dynamic programming with quantile-based risk measures
- A stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertainty
- Scalable estimation strategies based on stochastic approximations: classical results and new insights
- An approximated dynamic programming model for the supply vessel fleet sizing problem
- A tutorial on value function approximation for stochastic and dynamic transportation
- Convergence rates and decoupling in linear stochastic approximation algorithms
- Benchmarking a scalable approximate dynamic programming algorithm for stochastic control of grid-level energy storage
- Probabilistic line searches for stochastic optimization
- An inexact restoration-nonsmooth algorithm with variable accuracy for stochastic nonsmooth convex optimization problems in machine learning and stochastic linear complementarity problems
- Adaptive step-size selection for state-space probabilistic differential equation solvers
- Stochastic model predictive control with adaptive constraint tightening for non-conservative chance constraints satisfaction
- A unified framework for stochastic optimization
- Reinforcement learning algorithms with function approximation: recent advances and applications
- Cross-docking based factory logistics unitisation process: an approximate dynamic programming approach
- Projected stochastic gradients for convex constrained problems in Hilbert spaces
- A stochastic successive minimization method for nonsmooth nonconvex optimization with applications to transceiver design in wireless communication networks
- Bayesian exploration for approximate dynamic programming
- A stochastic line search method with expected complexity analysis
- Block-cyclic stochastic coordinate descent for deep neural networks
- ASD+M: automatic parameter tuning in stochastic optimization and on-line learning
- Autonomous reinforcement learning with experience replay
This page was built for publication: Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q851872)