Note—A Note on Dynamic Programming with Unbounded Rewards
From MaRDI portal
Publication:4151296
DOI10.1287/mnsc.24.5.576zbMath0374.49015OpenAlexW2000859117MaRDI QIDQ4151296
Jaap Wessels, J. A. E. E. Van Nunen
Publication date: 1978
Published in: Management Science (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1287/mnsc.24.5.576
Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Operations research and management science (90B99) Hamilton-Jacobi theories (49L99)
Related Items
A Contraction Theorem in Inventory Problems ⋮ Bayesian estimation of the mean holding time in average semi-Markov control processes ⋮ Nonuniqueness versus uniqueness of optimal policies in convex discounted Markov decision processes ⋮ Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates ⋮ Average optimal strategies for zero-sum Markov games with poorly known payoff function on one side ⋮ Robustness inequality for Markov control processes with unbounded costs ⋮ Arbitrary state semi-Markov decision processes ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ Action-dependent stopping times and Markov decision process with unbounded rewards ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Discounted cost optimality problem: Stability with respect to weak metrics ⋮ Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards ⋮ Unnamed Item