A value iteration method for undiscounted multichain Markov decision processes (Q3789375)

From MaRDI portal





scientific article; zbMATH DE number 4053390
Language Label Description Also known as
default for all languages
No label defined
    English
    A value iteration method for undiscounted multichain Markov decision processes
    scientific article; zbMATH DE number 4053390

      Statements

      A value iteration method for undiscounted multichain Markov decision processes (English)
      0 references
      1988
      0 references
      successive approximations
      0 references
      value iteration
      0 references
      infinite horizon average expected reward Markov decision processes
      0 references
      decomposition
      0 references
      \(\epsilon \)- optimal policy
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers