Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Created claim: Wikidata QID (P12): Q39164602, #quickstatements; #temporary_batch_1706974296281
Property / Wikidata QID
 
Property / Wikidata QID: Q39164602 / rank
 
Normal rank

Revision as of 17:51, 3 February 2024

scientific article
Language Label Description Also known as
default for all languages
No label defined
    English
    Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
    scientific article

      Statements

      Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (English)
      0 references
      0 references
      0 references
      0 references
      0 references
      0 references
      6 November 2015
      0 references
      reinforcement learning
      0 references
      transition model estimation
      0 references
      conditional density estimation
      0 references

      Identifiers