A polynomial time bound for Howard's policy improvement algorithm (Q1079511)

From MaRDI portal
Revision as of 00:29, 31 January 2024 by Import240129110113 (talk | contribs) (Added link to MaRDI item.)
scientific article
Language Label Description Also known as
English
A polynomial time bound for Howard's policy improvement algorithm
scientific article

    Statements

    A polynomial time bound for Howard's policy improvement algorithm (English)
    0 references
    0 references
    0 references
    0 references
    1986
    0 references
    discounted Markovian Decision Process
    0 references
    finite state and action space
    0 references
    Howard's policy improvement
    0 references
    optimal policy
    0 references

    Identifiers