Aggregation of the policy iteration method for nearly completely decomposable Markov chains
From MaRDI portal
Publication:3984583
DOI10.1109/9.67293zbMath0762.93079OpenAlexW2138519053MaRDI QIDQ3984583
Rabah W. Aldhaheri, Hassan K. Khalil
Publication date: 27 June 1992
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1109/9.67293
reduced-order systemill-conditioned equationsHoward's algorithmsteady-state optimal control problemlarge scale finite-state Markov chains
Related Items
A time aggregation approach to Markov decision processes ⋮ Event-based optimization approach for solving stochastic decision problems with probabilistic constraint ⋮ Revenue management for operations with urgent orders ⋮ A multi-cluster time aggregation approach for Markov chains ⋮ Approximate optimal adaptive control for weakly coupled nonlinear systems: A neuro-inspired approach