Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations (Q5949886)
From MaRDI portal
scientific article; zbMATH DE number 1678807
Language | Label | Description | Also known as |
---|---|---|---|
English | Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations |
scientific article; zbMATH DE number 1678807 |
Statements
Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations (English)
0 references
5 December 2001
0 references
The authors present an iterative technique for deriving the maximal solution of a set of discrete-time coupled algebraic Riccati equations, based on temporal difference methods. They trace a parallel with the theory of temporal difference algorithms for Markovian decision processes to develop a \(\lambda\)-policy iteration like algorithm for the maximal solution of these equations. The advantage of the proposed method is that an appropriate choice of \(\lambda\) between 0 and 1 can speed up the convergence of the policy evaluation step of the policy iteration method by using value iteration.
0 references
maximal solution
0 references
discrete-time coupled algebraic Riccati equations
0 references
temporal difference methods
0 references
Markovian decision processes
0 references
\(\lambda\)-policy iteration
0 references
0 references
0 references
0 references
0 references