Uniform value in dynamic programming

DOI10.4171/JEMS/254zbMATH Open1229.90253arXiv0803.2758MaRDI QIDQ621850FDOQ621850

Publication date: 28 January 2011

Published in: Journal of the European Mathematical Society (JEMS) (Search for Journal in Brave)

Abstract: We consider dynamic programming problems with a large time horizon, and give sufficient conditions for the existence of the uniform value. As a consequence, we obtain an existence result when the state space is precompact, payoffs are uniformly continuous and the transition correspondence is non expansive. In the same spirit, we give an existence result for the limit value. We also apply our results to Markov decision processes and obtain a few generalizations of existing results.

Full work available at URL: https://arxiv.org/abs/0803.2758

Recommendations

zbMATH Keywords

uniform value, dynamic programming, Markov decision processes, limit value, Blackwell optimality, average payoffs, long-run values, precompact state space, non expansive correspondence

Mathematics Subject Classification ID

Dynamic programming (90C39) Markov and semi-Markov decision processes (90C40)

Cited In (31)

This page was built for publication: Uniform value in dynamic programming

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q621850)