A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (Q481787)

From MaRDI portal





scientific article; zbMATH DE number 6380451
Language Label Description Also known as
default for all languages
No label defined
    English
    A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion
    scientific article; zbMATH DE number 6380451

      Statements

      A counterexample on sample-path optimality in stable Markov decision chains with the average reward criterion (English)
      0 references
      0 references
      0 references
      0 references
      15 December 2014
      0 references
      strong sample-path optimality
      0 references
      Lyapunov function condition
      0 references
      stationary policy
      0 references
      expected average reward criterion
      0 references

      Identifiers