Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming

DOI10.1007/S10994-006-8365-9MaRDI QIDQ851872zbMATH OpenOpenAlexFDO

Authors Abraham George, Warren Powell

Publication date 22 November 2006

Published in Machine Learning (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1007/s10994-006-8365-9

zbMATH Keywords

Kalman filter approximate dynamic programming adaptive learning stochastic stepsize

Mathematics Subject Classification ID

Dynamic programming (90C39)

Recommendations

Cites work

Cited in

(24)

Projected stochastic gradients for convex constrained problems in Hilbert spaces
ASD+M: automatic parameter tuning in stochastic optimization and on-line learning
Risk-averse approximate dynamic programming with quantile-based risk measures
Minimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programming
An inexact restoration-nonsmooth algorithm with variable accuracy for stochastic nonsmooth convex optimization problems in machine learning and stochastic linear complementarity problems
A stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertainty
Bayesian exploration for approximate dynamic programming
Probabilistic line searches for stochastic optimization
Benchmarking a scalable approximate dynamic programming algorithm for stochastic control of grid-level energy storage
Cross-docking based factory logistics unitisation process: an approximate dynamic programming approach
Scalable estimation strategies based on stochastic approximations: classical results and new insights
Autonomous reinforcement learning with experience replay
Block-cyclic stochastic coordinate descent for deep neural networks
Convergence rates and decoupling in linear stochastic approximation algorithms
Approximate dynamic programming for lateral transshipment problems in multi-location inventory systems
Integrated condition-based maintenance and multi-item lot-sizing with stochastic demand
Adaptive step-size selection for state-space probabilistic differential equation solvers
A tutorial on value function approximation for stochastic and dynamic transportation
A stochastic line search method with expected complexity analysis
A stochastic successive minimization method for nonsmooth nonconvex optimization with applications to transceiver design in wireless communication networks
Reinforcement learning algorithms with function approximation: recent advances and applications
An approximated dynamic programming model for the supply vessel fleet sizing problem
Stochastic model predictive control with adaptive constraint tightening for non-conservative chance constraints satisfaction
A unified framework for stochastic optimization

This page was built for publication: Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q851872)