Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration (Q5378202): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(5 intermediate revisions by 5 users not shown)
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / Wikidata QID
 
Property / Wikidata QID: Q47904761 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1301.3966 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate dynamic programming with a fuzzy parameterization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4869639 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093234 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reward-Weighted Regression with Sample Reuse for Direct Policy Search in Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Improving predictive inference under covariate shift by weighting the log-likelihood function / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Real-time reinforcement learning by sequential actor-critics and experience replay / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis and improvement of policy gradient estimation / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2133224499 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 11:10, 30 July 2024

scientific article; zbMATH DE number 7065034
Language Label Description Also known as
English
Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration
scientific article; zbMATH DE number 7065034

    Statements