The policy iteration algorithm for average reward Markov decision processes with general state space (Q4395828)

From MaRDI portal
Revision as of 23:58, 12 February 2024 by RedirectionBot (talk | contribs) (‎Removed claim: author (P16): Item:Q313149)
scientific article; zbMATH DE number 1164352
Language Label Description Also known as
English
The policy iteration algorithm for average reward Markov decision processes with general state space
scientific article; zbMATH DE number 1164352

    Statements

    The policy iteration algorithm for average reward Markov decision processes with general state space (English)
    0 references
    12 August 1998
    0 references
    Howard's policy iteration algorithm
    0 references
    controlled Markov chains
    0 references
    optimal control
    0 references
    queueing networks
    0 references
    deterministic routing
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references