Policy gradient in Lipschitz Markov decision processes (Q747252)

From MaRDI portal





scientific article; zbMATH DE number 6497623
Language Label Description Also known as
default for all languages
No label defined
    English
    Policy gradient in Lipschitz Markov decision processes
    scientific article; zbMATH DE number 6497623

      Statements

      Policy gradient in Lipschitz Markov decision processes (English)
      0 references
      0 references
      0 references
      0 references
      23 October 2015
      0 references
      reinforcement learning
      0 references
      Markov decision process
      0 references
      Lipschitz continuity
      0 references
      policy gradient algorithm
      0 references

      Identifiers