The Optimal Reward Operator in Negative Dynamic Programming
From MaRDI portal
Publication:4027786
DOI10.1287/moor.17.4.921zbMath0773.90087OpenAlexW2025643969MaRDI QIDQ4027786
A. P. Maitra, William D. Sudderth
Publication date: 1 March 1993
Published in: Mathematics of Operations Research (Search for Journal in Brave)
Full work available at URL: http://hdl.handle.net/11299/199576
Related Items (5)
A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies ⋮ Ashok Prasad Maitra (1938-2008) ⋮ Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies ⋮ Finitely Additive Dynamic Programming ⋮ On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes
This page was built for publication: The Optimal Reward Operator in Negative Dynamic Programming