The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Set OpenAlex properties.
 
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/bf01719500 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2123114919 / rank
 
Normal rank

Latest revision as of 09:43, 30 July 2024

scientific article
Language Label Description Also known as
English
The method of value oriented successive approximations for the average reward Markov decision process
scientific article

    Statements

    The method of value oriented successive approximations for the average reward Markov decision process (English)
    0 references
    0 references
    0 references
    0 references
    1980
    0 references
    0 references
    value oriented successive approximations
    0 references
    average reward
    0 references
    finite state space
    0 references
    finite action space
    0 references
    almost optimal solutions
    0 references
    convergence
    0 references
    0 references