The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/bf01719500 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2123114919 / rank | |||
Normal rank |
Latest revision as of 08:43, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | The method of value oriented successive approximations for the average reward Markov decision process |
scientific article |
Statements
The method of value oriented successive approximations for the average reward Markov decision process (English)
0 references
1980
0 references
value oriented successive approximations
0 references
average reward
0 references
finite state space
0 references
finite action space
0 references
almost optimal solutions
0 references
convergence
0 references
0 references
0 references
0 references