On the expected total reward with unbounded returns for Markov decision processes

From MaRDI portal

Publication:2198156

Jump to:navigation, search

DOI10.1007/S00245-018-9533-6zbMath1441.90176arXiv1712.07874OpenAlexW2963608016WikidataQ129043658 ScholiaQ129043658MaRDI QIDQ2198156

Alexandre Genadot, François Dufour

Publication date: 9 September 2020

Published in: Applied Mathematics and Optimization (Search for Journal in Brave)

Full work available at URL: https://arxiv.org/abs/1712.07874

zbMATH Keywords

Markov decision processes expected total reward unbounded return weak convergence of measure

Mathematics Subject Classification ID

Discrete-time Markov processes on general state spaces (60J05) Markov and semi-Markov decision processes (90C40)

Related Items (3)

Constrained discounted stochastic games ⋮ Constrained discounted Markov decision processes with Borel state spaces ⋮ Constrained Markov Decision Processes with Expected Total Reward Criteria

Cites Work

This page was built for publication: On the expected total reward with unbounded returns for Markov decision processes

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:2198156&oldid=14736599"