A stochastic trust-region framework for policy optimization (Q5096136)

From MaRDI portal





scientific article; zbMATH DE number 7571710
Language Label Description Also known as
default for all languages
No label defined
    English
    A stochastic trust-region framework for policy optimization
    scientific article; zbMATH DE number 7571710

      Statements

      A Stochastic Trust-Region Framework for Policy Optimization (English)
      0 references
      0 references
      0 references
      0 references
      15 August 2022
      0 references
      deep reinforcement learning
      0 references
      stochastic trust region method
      0 references
      policy optimization
      0 references
      global convergence
      0 references
      entropy control
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references
      0 references