An Iterative Aggregation Procedure for Markov Decision Processes
From MaRDI portal
Publication:3939622
DOI10.1287/opre.30.1.62zbMath0481.90090OpenAlexW2102962763MaRDI QIDQ3939622
Publication date: 1982
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/6e3e8afa15d6f34537cdc5bd6e1c6e6fe93eba3a
global convergenceiterative aggregation procedurelarge scale finite state finite action Markov decision processessequence of finite subproblems
Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)
Related Items (17)
Revenue management for operations with urgent orders ⋮ Approximate dynamic programming with state aggregation applied to UAV perimeter patrol ⋮ Block-scaling of value-iteration for discounted Markov renewal programming ⋮ Replacement process decomposition for discounted Markov renewal programming ⋮ Iterative variable aggregation and disaggregation in IP: an application ⋮ On using discrete random models within decision support systems ⋮ State partitioning based linear program for stochastic dynamic programs: an invariance property ⋮ Easy Affine Markov Decision Processes ⋮ Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation ⋮ A global convergence theorem for aggregation algorithms ⋮ Markov decision processes ⋮ Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation ⋮ Multi-phase dynamic constraint aggregation for set partitioning type problems ⋮ Modified iterative aggregation procedure for maintenance optimisation of multi-component systems with failure interaction ⋮ Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds ⋮ Estimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniques ⋮ Aggregation and disaggregation in Markov decision models for inventory control
This page was built for publication: An Iterative Aggregation Procedure for Markov Decision Processes