On the expected total reward with unbounded returns for Markov decision processes (Q2198156): Difference between revisions

From MaRDI portal

Jump to:navigation, search

← Older edit Newer edit →

Revision as of 10:10, 29 August 2024

scientific article

Language	Label	Description	Also known as
English	On the expected total reward with unbounded returns for Markov decision processes	scientific article

Statements

scholarly article

0 references

On the expected total reward with unbounded returns for Markov decision processes (English)

0 references

Applied Mathematics and Optimization

0 references

publication date

9 September 2020

0 references

full work available at URL

https://arxiv.org/abs/1712.07874

0 references

zbMATH Keywords

Markov decision processes

0 references

expected total reward

0 references

unbounded return

0 references

weak convergence of measure

0 references

François Dufour

0 references

Alexandre Genadot

0 references

MaRDI profile type

0 references

On compactness of the space of policies in stochastic dynamic programming

0 references

Existence Without Explicit Compactness in Stochastic Dynamic Programming

0 references

Stochastic optimal control. The discrete time case

0 references

0 references

0 references

Generalised discounting in dynamic programming with unbounded returns

0 references

Discounted dynamic programming with unbounded returns: application to economic models

0 references

Stochastic games with unbounded payoffs: applications to robust control in economics

0 references

Persistently optimal plans for nonstationary dynamic programming: The topology of weak convergence case

0 references

On discounted dynamic programming with unbounded returns

0 references

0 references

Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal

0 references

On dynamic programming: Compactness of the space of policies

0 references

On dynamic programming and statistical decision theory

0 references

Markov programming by successive approximations with respect to weighted supremum norms

0 references

Unbounded mappings and weak convergence of measures

0 references

Identifiers

zbMATH Open document ID

0 references

10.1007/s00245-018-9533-6

0 references

Mathematics Subject Classification ID

0 references

0 references

zbMATH DE Number

0 references

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2198156

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2198156&oldid=37654830"