A polynomial time bound for Howard's policy improvement algorithm
From MaRDI portal
Publication:1079511
DOI10.1007/BF01720771zbMath0597.90091WikidataQ115149439 ScholiaQ115149439MaRDI QIDQ1079511
U. Meister, Ulrich D. Holzbaur
Publication date: 1986
Published in: OR Spektrum (Search for Journal in Brave)
optimal policyfinite state and action spacediscounted Markovian Decision ProcessHoward's policy improvement
Analysis of algorithms and problem complexity (68Q25) Markov and semi-Markov decision processes (90C40)
Related Items (1)
Cites Work
This page was built for publication: A polynomial time bound for Howard's policy improvement algorithm