Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge (Q6126872): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: MM Optimization Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Overcoming catastrophic forgetting in neural networks / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5148970 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5053301 / rank
 
Normal rank

Revision as of 21:09, 29 August 2024

scientific article; zbMATH DE number 7829860
Language Label Description Also known as
English
Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge
scientific article; zbMATH DE number 7829860

    Statements

    Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge (English)
    0 references
    0 references
    0 references
    10 April 2024
    0 references
    reinforcement learning
    0 references
    deep RL
    0 references
    actor-critic methods
    0 references
    policy optimization
    0 references
    sample efficiency
    0 references
    exploration
    0 references

    Identifiers