The method of value oriented successive approximations for the average reward Markov decision process

From MaRDI portal
Publication:1144501

DOI10.1007/BF01719500zbMath0443.90109OpenAlexW2123114919MaRDI QIDQ1144501

S. H. Smith

Publication date: 1980

Published in: OR Spektrum (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf01719500



Related Items



Cites Work