Multilayer control of large Markov chains
From MaRDI portal
Publication:4167206
DOI10.1109/TAC.1978.1101707zbMath0386.49009MaRDI QIDQ4167206
Jean-Pierre Forestier, Pravin P. Varaiya
Publication date: 1978
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Markov and semi-Markov decision processes (90C40) Existence of optimal solutions to problems involving randomness (49J55)
Related Items
A time aggregation approach to Markov decision processes ⋮ Control of a finite-dimensional system using a supervisor ⋮ Actor-critic algorithms for hierarchical Markov decision processes ⋮ Time scale decomposition in production planning for unreliable flexible manufacturing systems ⋮ A multi-cluster time aggregation approach for Markov chains ⋮ Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds