Adaptive control of discounted Markov decision chains (Q796461)

From MaRDI portal
Revision as of 12:28, 14 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
scientific article
Language Label Description Also known as
English
Adaptive control of discounted Markov decision chains
scientific article

    Statements

    Adaptive control of discounted Markov decision chains (English)
    0 references
    1985
    0 references
    We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
    0 references
    discounted-reward finite-state Markov decision processes
    0 references
    adaptive policy
    0 references
    nonstationary value iteration
    0 references

    Identifiers