Computational comparison of policy iteration algorithms for discounted Markov decision processes
From MaRDI portal
Publication:1088914
DOI10.1016/0305-0548(86)90028-6zbMath0617.90086OpenAlexW2087482191WikidataQ115104694 ScholiaQ115104694MaRDI QIDQ1088914
A. C. Lavercombe, Lyn C. Thomas, Roger T. Hartley
Publication date: 1986
Published in: Computers \& Operations Research (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1016/0305-0548(86)90028-6
Numerical mathematical programming methods (65K05) Markov and semi-Markov decision processes (90C40)
Cites Work
- Computational comparison of value iteration algorithms for discounted Markov decision processes
- Computing the discounted return in markov and semi-markov chains
- Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
- Bounds and Transformations for Discounted Finite Markov Decision Chains
- Technical Note—Accelerated Computation of the Expected Discounted Return in a Markov Chain
- Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
- Unnamed Item
- Unnamed Item