A value iteration method for undiscounted multichain Markov decision processes

From MaRDI portal
Publication:3789375