Expected policy gradients for reinforcement learning (Q4969098)

From MaRDI portal





scientific article; zbMATH DE number 7255083
Language Label Description Also known as
default for all languages
No label defined
    English
    Expected policy gradients for reinforcement learning
    scientific article; zbMATH DE number 7255083

      Statements

      0 references
      0 references
      5 October 2020
      0 references
      policy gradients
      0 references
      exploration
      0 references
      bounded actions
      0 references
      reinforcement learning
      0 references
      Markov decision process (MDP)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references
      0 references