Action-dependent stopping times and Markov decision process with unbounded rewards

From MaRDI portal
Publication:1158111

DOI10.1007/BF01783952zbMath0471.90094OpenAlexW2058997544MaRDI QIDQ1158111

J. A. E. E. Van Nunen, Shaler jun. Stidham

Publication date: 1981

Published in: OR Spektrum (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01783952



Related Items



Cites Work