Adaptive control of discounted Markov decision chains (Q796461): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
RedirectionBot (talk | contribs)
Removed claim: author (P16): Item:Q217261
Property / author
 
Property / author: Onésimo Hernández-Lerma / rank
Normal rank
 

Revision as of 04:32, 11 February 2024

scientific article
Language Label Description Also known as
English
Adaptive control of discounted Markov decision chains
scientific article

    Statements

    Adaptive control of discounted Markov decision chains (English)
    0 references
    0 references
    1985
    0 references
    We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
    0 references
    discounted-reward finite-state Markov decision processes
    0 references
    adaptive policy
    0 references
    nonstationary value iteration
    0 references

    Identifiers