Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations (Q5949886)

From MaRDI portal
scientific article; zbMATH DE number 1678807
Language Label Description Also known as
English
Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations
scientific article; zbMATH DE number 1678807

    Statements

    Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations (English)
    0 references
    0 references
    0 references
    5 December 2001
    0 references
    The authors present an iterative technique for deriving the maximal solution of a set of discrete-time coupled algebraic Riccati equations, based on temporal difference methods. They trace a parallel with the theory of temporal difference algorithms for Markovian decision processes to develop a \(\lambda\)-policy iteration like algorithm for the maximal solution of these equations. The advantage of the proposed method is that an appropriate choice of \(\lambda\) between 0 and 1 can speed up the convergence of the policy evaluation step of the policy iteration method by using value iteration.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    maximal solution
    0 references
    discrete-time coupled algebraic Riccati equations
    0 references
    temporal difference methods
    0 references
    Markovian decision processes
    0 references
    \(\lambda\)-policy iteration
    0 references
    0 references