Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297)

From MaRDI portal
Revision as of 17:51, 3 February 2024 by Daniel (talk | contribs) (‎Created claim: Wikidata QID (P12): Q39164602, #quickstatements; #temporary_batch_1706974296281)





scientific article
Language Label Description Also known as
default for all languages
No label defined
    English
    Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
    scientific article

      Statements

      Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      6 November 2015
      0 references
      reinforcement learning
      0 references
      transition model estimation
      0 references
      conditional density estimation
      0 references

      Identifiers