An Iterative Aggregation Procedure for Markov Decision Processes
From MaRDI portal
Publication:3939622
DOI10.1287/opre.30.1.62zbMath0481.90090OpenAlexW2102962763MaRDI QIDQ3939622
Publication date: 1982
Published in: Operations Research (Search for Journal in Brave)
Full work available at URL: https://semanticscholar.org/paper/6e3e8afa15d6f34537cdc5bd6e1c6e6fe93eba3a
global convergenceiterative aggregation procedurelarge scale finite state finite action Markov decision processessequence of finite subproblems
Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)
Related Items
Revenue management for operations with urgent orders, Approximate dynamic programming with state aggregation applied to UAV perimeter patrol, Block-scaling of value-iteration for discounted Markov renewal programming, Replacement process decomposition for discounted Markov renewal programming, Iterative variable aggregation and disaggregation in IP: an application, On using discrete random models within decision support systems, State partitioning based linear program for stochastic dynamic programs: an invariance property, Easy Affine Markov Decision Processes, Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation, A global convergence theorem for aggregation algorithms, Markov decision processes, Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation, Multi-phase dynamic constraint aggregation for set partitioning type problems, Modified iterative aggregation procedure for maintenance optimisation of multi-component systems with failure interaction, Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds, Estimating equilibrium probabilities for band diagonal Markov chains using aggregation and disaggregation techniques, Aggregation and disaggregation in Markov decision models for inventory control