Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics (Q2004902)

From MaRDI portal





scientific article; zbMATH DE number 7257187
Language Label Description Also known as
default for all languages
No label defined
    English
    Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics
    scientific article; zbMATH DE number 7257187

      Statements

      Learning output reference model tracking for higher-order nonlinear systems with unknown dynamics (English)
      0 references
      0 references
      0 references
      0 references
      7 October 2020
      0 references
      Summary: Linearly and nonlinearly parameterized approximate dynamic programming approaches used for output reference model (ORM) tracking control are proposed. The ORM tracking problem is of significant interest in practice since, with a linear ORM, the closed-loop control system is indirectly feedback linearized and value iteration (VI) offers the means to achieve ORM tracking without using process dynamics. Ranging from linear to nonlinear parameterizations, a successful approximate VI implementation for continuous state-action spaces depends on several key parameters such as: problem dimension, exploration of the state-action space, the state-transitions dataset size, and suitable selection of the function approximators. We show that using the same transitions dataset and under a general linear parameterization of the Q-function, high performance ORM tracking can be achieved with an approximate VI scheme, on the same performance level as that of a neural-network (NN)-based implementation that is more complex and takes significantly more time to learn. However, the latter proves to be more robust to hyperparameters selection, dataset size, and to exploration strategies, recommending it as the \textit{de facto} practical implementation. The case study is aimed at ORM tracking of a real-world nonlinear two inputs -- two outputs aerodynamic process with ten internal states, as a representative high order system.
      0 references
      approximate dynamic programming
      0 references
      reinforcement learning
      0 references
      data-driven control
      0 references
      model-free control
      0 references
      reference trajectory tracking
      0 references
      output reference model
      0 references
      multivariable control
      0 references
      aerodynamic rotor system
      0 references
      neural networks
      0 references
      learning systems
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references