New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system
From MaRDI portal
(Redirected from Publication:320866)
Recommendations
- The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems
- Approximate dynamic programming by practical examples
- Simplex algorithm for countable-state discounted Markov decision processes
- Approximate dynamic programming for stochastic linear control problems on compact state spaces
- Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
Cites work
- scientific article; zbMATH DE number 3126094 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 700091 (Why is no real title available?)
- A comparison of production-line control mechanisms
- A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
- Approximate Dynamic Programming
- Approximate Dynamic Programming via a Smoothed Linear Program
- Approximate policy iteration: a survey and some new methods
- CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
- Computing Optimal Policies for Controlled Tandem Queueing Systems
- Optimal numbers of two kinds of kanbans in a JIT production system
- Simulation-based algorithms for Markov decision processes.
- Simulation-based optimization: Parametric optimization techniques and reinforcement learning
- Solving semi-Markov decision problems using average reward reinforcement learning
- Stochastic learning and optimization. A sensitivity-based approach.
- The optimal control of just-in-time-based production and distribution systems and performance comparisons with optimized pull systems
Cited in
(7)- A typology and literature review on stochastic multi-echelon inventory models
- Dynamic assignment of a multi-skilled workforce in job shops: an approximate dynamic programming approach
- Integrated optimization of material supplying, manufacturing, and product distribution: models and fast algorithms
- A performance-centred approach to optimising maintenance of complex systems
- Fast heuristic approach for control of complex authentication systems
- Sensitivity and covariance in stochastic complementarity problems with an application to north American natural gas markets
- Relevant states and memory in Markov chain bootstrapping and simulation
This page was built for publication: New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q320866)