Second order bounds for Markov decision processes
From MaRDI portal
Publication:1150312
DOI10.1016/0022-247X(81)90106-2zbMath0455.90089MaRDI QIDQ1150312
Publication date: 1981
Published in: Journal of Mathematical Analysis and Applications (Search for Journal in Brave)
Markov decision process; second order bounds; lower bound on the optimal value; value iteration algorithm
90C40: Markov and semi-Markov decision processes
Related Items
Cites Work