Expected policy gradients for reinforcement learning (Q4969098)

From MaRDI portal
!
WARNING

This is the item page for this Wikibase entity, intended for internal use and editing purposes.

Please use the normal view instead:

scientific article; zbMATH DE number 7255083
Language Label Description Also known as
default for all languages
No label defined
    English
    Expected policy gradients for reinforcement learning
    scientific article; zbMATH DE number 7255083

      Statements

      0 references
      0 references
      5 October 2020
      0 references
      policy gradients
      0 references
      exploration
      0 references
      bounded actions
      0 references
      reinforcement learning
      0 references
      Markov decision process (MDP)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references

      Identifiers

      0 references
      0 references
      0 references
      0 references