A polynomial time bound for Howard's policy improvement algorithm

From MaRDI portal

Publication:1079511

Jump to:navigation, search

DOI10.1007/BF01720771zbMath0597.90091WikidataQ115149439 ScholiaQ115149439MaRDI QIDQ1079511

U. Meister, Ulrich D. Holzbaur

Publication date: 1986

Published in: OR Spektrum (Search for Journal in Brave)

zbMATH Keywords

optimal policy finite state and action space discounted Markovian Decision Process Howard's policy improvement

Mathematics Subject Classification ID

Analysis of algorithms and problem complexity (68Q25) Markov and semi-Markov decision processes (90C40)

Related Items (1)

Bounds for the quality and the number of steps in Bellman's value iteration algorithm

Cites Work

This page was built for publication: A polynomial time bound for Howard's policy improvement algorithm

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1079511&oldid=13098133"