A successive approximation algorithm for an undiscounted Markov decision process

From MaRDI portal

Publication:1231344

Jump to:navigation, search

DOI10.1007/BF02276760zbMath0339.60069OpenAlexW2163408601MaRDI QIDQ1231344

Publication date: 1976

Published in: Computing (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/bf02276760

Mathematics Subject Classification ID

Optimal stochastic control (93E20) Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20)

Related Items (3)

Some basic concepts of numerical treatment of Markov decision models ⋮ The method of value oriented successive approximations for the average reward Markov decision process ⋮ Improved iterative computation of the expected discounted return in Markov and semi-Markov chains

Cites Work

This page was built for publication: A successive approximation algorithm for an undiscounted Markov decision process

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:1231344&oldid=13308541"