An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403)

From MaRDI portal





scientific article; zbMATH DE number 7062532
Language Label Description Also known as
default for all languages
No label defined
    English
    An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
    scientific article; zbMATH DE number 7062532

      Statements

      An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      4 June 2019
      0 references

      Identifiers