Pages that link to "Item:Q851872"
From MaRDI portal
The following pages link to Adaptive stepsizes for recursive estimation with applications in approximate dynamic programming (Q851872):
Displayed 21 items.
- A stochastic successive minimization method for nonsmooth nonconvex optimization with applications to transceiver design in wireless communication networks (Q301668) (← links)
- Autonomous reinforcement learning with experience replay (Q461126) (← links)
- Minimizing total tardiness in a stochastic single machine scheduling problem using approximate dynamic programming (Q633553) (← links)
- Reinforcement learning algorithms with function approximation: recent advances and applications (Q903601) (← links)
- Approximate dynamic programming for lateral transshipment problems in multi-location inventory systems (Q1681324) (← links)
- Stochastic model predictive control with adaptive constraint tightening for non-conservative chance constraints satisfaction (Q1716435) (← links)
- A unified framework for stochastic optimization (Q1719609) (← links)
- An inexact restoration-nonsmooth algorithm with variable accuracy for stochastic nonsmooth convex optimization problems in machine learning and stochastic linear complementarity problems (Q2112678) (← links)
- ASD+M: automatic parameter tuning in stochastic optimization and on-line learning (Q2179079) (← links)
- Integrated condition-based maintenance and multi-item lot-sizing with stochastic demand (Q2698605) (← links)
- Probabilistic Line Searches for Stochastic Optimization (Q4637044) (← links)
- Bayesian Exploration for Approximate Dynamic Programming (Q4971589) (← links)
- Benchmarking a Scalable Approximate Dynamic Programming Algorithm for Stochastic Control of Grid-Level Energy Storage (Q5131714) (← links)
- A Stochastic Line Search Method with Expected Complexity Analysis (Q5215517) (← links)
- Risk-Averse Approximate Dynamic Programming with Quantile-Based Risk Measures (Q5219554) (← links)
- Projected Stochastic Gradients for Convex Constrained Problems in Hilbert Spaces (Q5231699) (← links)
- Convergence Rates and Decoupling in Linear Stochastic Approximation Algorithms (Q5254881) (← links)
- Scalable estimation strategies based on stochastic approximations: classical results and new insights (Q5963780) (← links)
- A stochastic gradient method for a class of nonlinear PDE-constrained optimal control problems under uncertainty (Q6041823) (← links)
- Block-cyclic stochastic coordinate descent for deep neural networks (Q6054553) (← links)
- Cross-docking based factory logistics unitisation process: an approximate dynamic programming approach (Q6168593) (← links)