Publication:3724110
From MaRDI portal
zbMath0593.90082MaRDI QIDQ3724110
Publication date: 1986
nonnegative matrices; discrete time; policy iteration; Markov decision chains; decomposition of the state space; maximum expected rewards
90C39: Dynamic programming
15B48: Positive matrices and their generalizations; cones of matrices
90C40: Markov and semi-Markov decision processes
Related Items