A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (Q481787)

From MaRDI portal
Revision as of 18:49, 9 December 2024 by Import241208061232 (talk | contribs) (Normalize DOI.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
scientific article

    Statements

    A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (English)
    0 references
    0 references
    0 references
    0 references
    15 December 2014
    0 references
    strong sample-path optimality
    0 references
    Lyapunov function condition
    0 references
    stationary policy
    0 references
    expected average reward criterion
    0 references

    Identifiers