The convergence of value iteration in average cost Markov decision chains

From MaRDI portal

Revision as of 07:48, 3 February 2024 by Import240129110113 (talk | contribs) (Created automatically from import240129110113)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:2564235

Jump to:navigation, search

DOI10.1016/0167-6377(96)00018-1zbMath0865.90134MaRDI QIDQ2564235

Linn I. Sennott

Publication date: 7 January 1997

Published in: Operations Research Letters (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1016/0167-6377(96)00018-1

zbMATH Keywords

stochastic dynamic programming; value iteration; countable state space; Markov decision chain; minimum long-run expected average cost

Mathematics Subject Classification ID

90C40: Markov and semi-Markov decision processes

Related Items

The value iteration method for countable state Markov decision processes

Cites Work

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2564235&oldid=15323199"