The control of a two-level Markov decision process by time aggregation
From MaRDI portal
Publication:2641752
DOI10.1016/j.automatica.2005.11.006zbMath1127.90074MaRDI QIDQ2641752
Publication date: 23 August 2007
Published in: Automatica (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/j.automatica.2005.11.006
Markov decision processes; policy iteration; performance potentials; time aggregation; coupled decisions; two-level systems
Related Items
Cites Work
- Unnamed Item
- Singulary perturbed Markov control problem: Limiting average cost
- Single sample path-based optimization of Markov chains
- The relations among potentials, perturbation analysis, and Markov decision processes
- A time aggregation approach to Markov decision processes
- A single sample path-based performance sensitivity formula for Markov chains
- Performance gradient estimation for the very large finite Markov chains
- Algorithms for singularly perturbed limiting average Markov control problems
- Perturbation realization, potentials, and sensitivity analysis of Markov processes
- Control of singularly perturbed hybrid stochastic systems
- Multitime scale markov decision processes
- A two-factor stochastic production model with two time scales