Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297): Difference between revisions
From MaRDI portal
Created claim: Wikidata QID (P12): Q39164602, #quickstatements; #temporary_batch_1706974296281 |
Changed an Item |
||
| Property / describes a project that uses | |||
| Property / describes a project that uses: PILCO / rank | |||
Normal rank | |||
Revision as of 18:48, 29 February 2024
scientific article
| Language | Label | Description | Also known as |
|---|---|---|---|
| default for all languages | No label defined |
||
| English | Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation |
scientific article |
Statements
Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (English)
0 references
6 November 2015
0 references
reinforcement learning
0 references
transition model estimation
0 references
conditional density estimation
0 references