The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Import240304020342 (talk | contribs)
Set profile property.
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank

Revision as of 02:20, 5 March 2024

scientific article
Language Label Description Also known as
English
The method of value oriented successive approximations for the average reward Markov decision process
scientific article

    Statements

    The method of value oriented successive approximations for the average reward Markov decision process (English)
    0 references
    0 references
    0 references
    1980
    0 references
    value oriented successive approximations
    0 references
    average reward
    0 references
    finite state space
    0 references
    finite action space
    0 references
    almost optimal solutions
    0 references
    convergence
    0 references

    Identifiers