A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (Q481787)

From MaRDI portal
Revision as of 11:31, 22 March 2024 by Daniel (talk | contribs) (‎Created claim: Wikidata QID (P12): Q124799376, #quickstatements; #temporary_batch_1711094041063)
scientific article
Language Label Description Also known as
English
A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
scientific article

    Statements

    A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (English)
    0 references
    0 references
    0 references
    0 references
    15 December 2014
    0 references
    strong sample-path optimality
    0 references
    Lyapunov function condition
    0 references
    stationary policy
    0 references
    expected average reward criterion
    0 references

    Identifiers