A value iteration method for undiscounted multichain Markov decision processes (Q3789375)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: A value iteration method for undiscounted multichain Markov decision processes |
scientific article; zbMATH DE number 4053390
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | A value iteration method for undiscounted multichain Markov decision processes |
scientific article; zbMATH DE number 4053390 |
Statements
A value iteration method for undiscounted multichain Markov decision processes (English)
0 references
1988
0 references
successive approximations
0 references
value iteration
0 references
infinite horizon average expected reward Markov decision processes
0 references
decomposition
0 references
\(\epsilon \)- optimal policy
0 references
0 references
0 references
0 references
0 references
0 references
0 references
0 references
0.8706006407737732
0 references
0.8614366054534912
0 references