On the expected total reward with unbounded returns for Markov decision processes
From MaRDI portal
Publication:2198156
DOI10.1007/S00245-018-9533-6zbMath1441.90176arXiv1712.07874OpenAlexW2963608016WikidataQ129043658 ScholiaQ129043658MaRDI QIDQ2198156
Alexandre Genadot, François Dufour
Publication date: 9 September 2020
Published in: Applied Mathematics and Optimization (Search for Journal in Brave)
Full work available at URL: https://arxiv.org/abs/1712.07874
Discrete-time Markov processes on general state spaces (60J05) Markov and semi-Markov decision processes (90C40)
Related Items (3)
Constrained discounted stochastic games ⋮ Constrained discounted Markov decision processes with Borel state spaces ⋮ Constrained Markov Decision Processes with Expected Total Reward Criteria
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Discounted dynamic programming with unbounded returns: application to economic models
- Stochastic games with unbounded payoffs: applications to robust control in economics
- Persistently optimal plans for nonstationary dynamic programming: The topology of weak convergence case
- On dynamic programming and statistical decision theory
- On compactness of the space of policies in stochastic dynamic programming
- Stochastic optimal control. The discrete time case
- On dynamic programming: Compactness of the space of policies
- Markov programming by successive approximations with respect to weighted supremum norms
- Generalised discounting in dynamic programming with unbounded returns
- On discounted dynamic programming with unbounded returns
- Unbounded mappings and weak convergence of measures
- Existence Without Explicit Compactness in Stochastic Dynamic Programming
- Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
This page was built for publication: On the expected total reward with unbounded returns for Markov decision processes