The method of value oriented successive approximations for the average reward Markov decision process (Q1144501)

From MaRDI portal
Revision as of 02:20, 5 March 2024 by Import240304020342 (talk | contribs) (Set profile property.)
scientific article
Language Label Description Also known as
English
The method of value oriented successive approximations for the average reward Markov decision process
scientific article

    Statements

    The method of value oriented successive approximations for the average reward Markov decision process (English)
    0 references
    0 references
    0 references
    1980
    0 references
    value oriented successive approximations
    0 references
    average reward
    0 references
    finite state space
    0 references
    finite action space
    0 references
    almost optimal solutions
    0 references
    convergence
    0 references

    Identifiers