On a set of optimal policies in continuous time Markovian decision problem (Q2544324)
From MaRDI portal
| This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: On a set of optimal policies in continuous time Markovian decision problem |
scientific article; zbMATH DE number 3335609
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | On a set of optimal policies in continuous time Markovian decision problem |
scientific article; zbMATH DE number 3335609 |
Statements
On a set of optimal policies in continuous time Markovian decision problem (English)
0 references
1971
0 references
This paper is concerned with a continuous time parameter Markovian decision problem with finitely many states and actions. An \(\alpha\)-optimal policy, which maximizes the total expected return with discount factor \(\alpha\), is considered. It is shown that the set of \(\alpha\)-optimal policies consists of the policies of which action belong to the specified set. An upper bound of the number of a-optimal regions and a simple procedure to find them are proposed from the view point of a sensitivity analysis. Moreover, the limiting properties of \(\alpha\)-optimal policies and the set of optimal policies in some sense are discussed.
0 references
0 references
0.8385366797447205
0 references
0.8343241214752197
0 references
0.8332033157348633
0 references
0.8330738544464111
0 references
0.8317187428474426
0 references