Adaptive control of discounted Markov decision chains (Q796461): Difference between revisions
From MaRDI portal
Removed claim: author (P16): Item:Q217261 |
Changed an Item |
||
Property / author | |||
Property / author: Onésimo Hernández-Lerma / rank | |||
Normal rank |
Revision as of 04:32, 11 February 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Adaptive control of discounted Markov decision chains |
scientific article |
Statements
Adaptive control of discounted Markov decision chains (English)
0 references
1985
0 references
We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
0 references
discounted-reward finite-state Markov decision processes
0 references
adaptive policy
0 references
nonstationary value iteration
0 references