Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (Q889297): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Efficient exploration through active learning for value function approximation in reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Using Expectation-Maximization for Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4704221 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive importance sampling for value function approximation in off-policy reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2880931 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Statistical analysis of kernel-based least-squares density-ratio estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Computational complexity of kernel-based density-ratio estimation: a condition number analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Policy search for motor primitives in robotics / rank
 
Normal rank
Property / cites work
 
Property / cites work: Model-based contextual policy search for data-efficient generalization of robot skills / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/1532443041827907 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3394879 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Density-ratio matching under the Bregman divergence: a unified framework of density-ratio estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sufficient Dimension Reduction via Squared-Loss Mutual Information Estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis and improvement of policy gradient estimation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficient Sample Reuse in Policy Gradients with Parameter-Based Exploration / rank
 
Normal rank

Latest revision as of 01:23, 11 July 2024

scientific article
Language Label Description Also known as
English
Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
scientific article

    Statements

    Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    6 November 2015
    0 references
    0 references
    reinforcement learning
    0 references
    transition model estimation
    0 references
    conditional density estimation
    0 references
    0 references
    0 references
    0 references