A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (Q481787)

From MaRDI portal
scientific article
Language Label Description Also known as
English
A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
scientific article

    Statements

    A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (English)
    0 references
    0 references
    0 references
    0 references
    15 December 2014
    0 references
    0 references
    strong sample-path optimality
    0 references
    Lyapunov function condition
    0 references
    stationary policy
    0 references
    expected average reward criterion
    0 references
    0 references
    0 references