The method of value oriented successive approximations for the average reward Markov decision process (Q1144501)
From MaRDI portal
![]() | This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: The method of value oriented successive approximations for the average reward Markov decision process |
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The method of value oriented successive approximations for the average reward Markov decision process |
scientific article |
Statements
The method of value oriented successive approximations for the average reward Markov decision process (English)
0 references
1980
0 references
value oriented successive approximations
0 references
average reward
0 references
finite state space
0 references
finite action space
0 references
almost optimal solutions
0 references
convergence
0 references