Truncated policy iteration methods (Q1060136): Difference between revisions
From MaRDI portal
Set profile property. |
Set OpenAlex properties. |
||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1016/0167-6377(84)90054-3 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1999380740 / rank | |||
Normal rank |
Revision as of 22:54, 19 March 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Truncated policy iteration methods |
scientific article |
Statements
Truncated policy iteration methods (English)
0 references
1984
0 references
Policy iteration methods are important but often computationally expensive approaches for solving certain stochastic optimization problems. Modified policy iteration methods have been proposed to reduce the storage and computational burden. The asymptotic speed-of-convergence of such methods is, however, not well understood. In this paper we show how modified policy iteration methods may be constructed to achieve a preassigned rate-of-convergence. Our analysis provides a framework for analyzing the local behavior of such methods and provides impetus for perhaps more computationally efficient procedures than currently exist.
0 references
Markov chains
0 references
modified policy iteration methods
0 references
preassigned rate-of- convergence
0 references