Publication:3724110

From MaRDI portal

Jump to:navigation, search

zbMath0593.90082MaRDI QIDQ3724110

Karel Sladký

Publication date: 1986

zbMATH Keywords

nonnegative matrices; discrete time; policy iteration; Markov decision chains; decomposition of the state space; maximum expected rewards

Mathematics Subject Classification ID

90C39: Dynamic programming

15B48: Positive matrices and their generalizations; cones of matrices

90C40: Markov and semi-Markov decision processes

Related Items

Dynamics of piecewise linear maps and sets of nonnegative matrices

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:3724110&oldid=17233261"