Note—A Note on Dynamic Programming with Unbounded Rewards

From MaRDI portal

Publication:4151296

Jump to:navigation, search

DOI10.1287/mnsc.24.5.576zbMath0374.49015OpenAlexW2000859117MaRDI QIDQ4151296

Jaap Wessels, J. A. E. E. Van Nunen

Publication date: 1978

Published in: Management Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1287/mnsc.24.5.576

Mathematics Subject Classification ID

Applications of Markov chains and discrete-time Markov processes on general state spaces (social mobility, learning theory, industrial processes, etc.) (60J20) Operations research and management science (90B99) Hamilton-Jacobi theories (49L99)

Related Items

A Contraction Theorem in Inventory Problems ⋮ Bayesian estimation of the mean holding time in average semi-Markov control processes ⋮ Nonuniqueness versus uniqueness of optimal policies in convex discounted Markov decision processes ⋮ Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates ⋮ Average optimal strategies for zero-sum Markov games with poorly known payoff function on one side ⋮ Robustness inequality for Markov control processes with unbounded costs ⋮ Arbitrary state semi-Markov decision processes ⋮ On theory and algorithms for Markov decision problems with the total reward criterion ⋮ Action-dependent stopping times and Markov decision process with unbounded rewards ⋮ Adaptive control of stochastic systems with unknown disturbance distribution: discounted criteria ⋮ Discounted cost optimality problem: Stability with respect to weak metrics ⋮ Finite state approximations for denumerable state infinite horizon discounted Markov decision processes with unbounded rewards ⋮ Unnamed Item

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4151296&oldid=17958364"