Finite-horizon dynamic optimisation when the terminal reward is a concave functional of the distribution of the final state
From MaRDI portal
Publication:4391409
DOI10.1239/aap/1035227995zbMath0904.90171OpenAlexW2003418532MaRDI QIDQ4391409
E. J. Collins, John M. McNamara
Publication date: 2 June 1998
Published in: Advances in Applied Probability (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1239/aap/1035227995
Dynamic programming in optimal control and differential games (49L20) Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40) Ecology (92D40)
Related Items
Finite-horizon variance penalised Markov decision processes ⋮ Markov decision processes with average-value-at-risk criteria ⋮ Optimal policies for constrained average-cost Markov decision processes ⋮ The double chain markov model ⋮ High-order extensions of the Double Chain Markov Model