Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge (Q6126872): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Q4626283 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: MM Optimization Algorithms / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Overcoming catastrophic forgetting in neural networks / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5148970 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q5053301 / rank | |||
Normal rank |
Revision as of 21:09, 29 August 2024
scientific article; zbMATH DE number 7829860
Language | Label | Description | Also known as |
---|---|---|---|
English | Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge |
scientific article; zbMATH DE number 7829860 |
Statements
Accelerating actor-critic-based algorithms via pseudo-labels derived from prior knowledge (English)
0 references
10 April 2024
0 references
reinforcement learning
0 references
deep RL
0 references
actor-critic methods
0 references
policy optimization
0 references
sample efficiency
0 references
exploration
0 references
0 references