Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297): Difference between revisions

From MaRDI portal
Created claim: Wikidata QID (P12): Q39164602, #quickstatements; #temporary_batch_1706974296281
Changed an Item
Property / describes a project that uses
 
Property / describes a project that uses: PILCO / rank
 
Normal rank

Revision as of 19:48, 29 February 2024

scientific article
Language Label Description Also known as
English
Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
scientific article

    Statements

    Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    6 November 2015
    0 references
    0 references
    reinforcement learning
    0 references
    transition model estimation
    0 references
    conditional density estimation
    0 references
    0 references