Adaptive control of discounted Markov decision chains (Q796461)

From MaRDI portal





scientific article; zbMATH DE number 3865009
Language Label Description Also known as
default for all languages
No label defined
    English
    Adaptive control of discounted Markov decision chains
    scientific article; zbMATH DE number 3865009

      Statements

      Adaptive control of discounted Markov decision chains (English)
      0 references
      1985
      0 references
      We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
      0 references
      discounted-reward finite-state Markov decision processes
      0 references
      adaptive policy
      0 references
      nonstationary value iteration
      0 references

      Identifiers