The method of value oriented successive approximations for the average reward Markov decision process (Q1144501)

From MaRDI portal
Revision as of 08:43, 30 July 2024 by Openalex240730090724 (talk | contribs) (Set OpenAlex properties.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
The method of value oriented successive approximations for the average reward Markov decision process
scientific article

    Statements

    The method of value oriented successive approximations for the average reward Markov decision process (English)
    0 references
    0 references
    0 references
    1980
    0 references
    value oriented successive approximations
    0 references
    average reward
    0 references
    finite state space
    0 references
    finite action space
    0 references
    almost optimal solutions
    0 references
    convergence
    0 references

    Identifiers