The policy iteration algorithm for average reward Markov decision processes with general state space (Q4395828)

From MaRDI portal
Revision as of 02:22, 7 February 2024 by Import240129110113 (talk | contribs) (Added link to MaRDI item.)
scientific article; zbMATH DE number 1164352
Language Label Description Also known as
English
The policy iteration algorithm for average reward Markov decision processes with general state space
scientific article; zbMATH DE number 1164352

    Statements

    The policy iteration algorithm for average reward Markov decision processes with general state space (English)
    0 references
    0 references
    12 August 1998
    0 references
    Howard's policy iteration algorithm
    0 references
    controlled Markov chains
    0 references
    optimal control
    0 references
    queueing networks
    0 references
    deterministic routing
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references