On the expected total reward with unbounded returns for Markov decision processes (Q2198156): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import241208061232 (talk | contribs)
Normalize DOI.
 
(8 intermediate revisions by 7 users not shown)
Property / DOI
 
Property / DOI: 10.1007/s00245-018-9533-6 / rank
Normal rank
 
Property / author
 
Property / author: François Dufour / rank
Normal rank
 
Property / author
 
Property / author: Alexandre Genadot / rank
Normal rank
 
Property / author
 
Property / author: François Dufour / rank
 
Normal rank
Property / author
 
Property / author: Alexandre Genadot / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2963608016 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1712.07874 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On compactness of the space of policies in stochastic dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Existence Without Explicit Compactness in Stochastic Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3415147 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5599448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Generalised discounting in dynamic programming with unbounded returns / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted dynamic programming with unbounded returns: application to economic models / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic games with unbounded payoffs: applications to robust control in economics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Persistently optimal plans for nonstationary dynamic programming: The topology of weak convergence case / rank
 
Normal rank
Property / cites work
 
Property / cites work: On discounted dynamic programming with unbounded returns / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3832355 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: On dynamic programming: Compactness of the space of policies / rank
 
Normal rank
Property / cites work
 
Property / cites work: On dynamic programming and statistical decision theory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov programming by successive approximations with respect to weighted supremum norms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Unbounded mappings and weak convergence of measures / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q129043658 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1007/S00245-018-9533-6 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 10:54, 17 December 2024

scientific article
Language Label Description Also known as
English
On the expected total reward with unbounded returns for Markov decision processes
scientific article

    Statements

    On the expected total reward with unbounded returns for Markov decision processes (English)
    0 references
    9 September 2020
    0 references
    Markov decision processes
    0 references
    expected total reward
    0 references
    unbounded return
    0 references
    weak convergence of measure
    0 references
    0 references
    0 references

    Identifiers