Adaptive control of discounted Markov decision chains (Q796461)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Adaptive control of discounted Markov decision chains
scientific article

    Statements

    Adaptive control of discounted Markov decision chains (English)
    0 references
    1985
    0 references
    We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
    0 references
    discounted-reward finite-state Markov decision processes
    0 references
    adaptive policy
    0 references
    nonstationary value iteration
    0 references

    Identifiers