Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
From MaRDI portal
Publication:5282301
DOI10.1109/TAC.2007.908359zbMath1366.90218MaRDI QIDQ5282301
Peter B. Luh, Tao Sun, Qian-Chuan Zhao
Publication date: 27 July 2017
Published in: IEEE Transactions on Automatic Control (Search for Journal in Brave)
Optimal stochastic control (93E20) Stochastic learning and adaptive control (93E35) Markov and semi-Markov decision processes (90C40)
Related Items (6)
Event-based optimization approach for solving stochastic decision problems with probabilistic constraint ⋮ Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm ⋮ Revenue management for operations with urgent orders ⋮ A numerical study of Markov decision process algorithms for multi-component replacement problems ⋮ Time aggregated Markov decision processes via standard dynamic programming ⋮ A unified approach to time-aggregated Markov decision processes
This page was built for publication: Incremental Value Iteration for Time-Aggregated Markov-Decision Processes