On the expected total reward with unbounded returns for Markov decision processes (Q2198156): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Created claim: Wikidata QID (P12): Q129043658, #quickstatements; #temporary_batch_1724922362815 |
||
Property / Wikidata QID | |||
Property / Wikidata QID: Q129043658 / rank | |||
Normal rank |
Revision as of 10:10, 29 August 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On the expected total reward with unbounded returns for Markov decision processes |
scientific article |
Statements
On the expected total reward with unbounded returns for Markov decision processes (English)
0 references
9 September 2020
0 references
Markov decision processes
0 references
expected total reward
0 references
unbounded return
0 references
weak convergence of measure
0 references
0 references
0 references
0 references