Variational actor-critic algorithms, (Q6102338): Difference between revisions
From MaRDI portal
Set OpenAlex properties. |
ReferenceBot (talk | contribs) Changed an Item |
||
Property / cites work | |||
Property / cites work: Q4533362 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3096132 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4626283 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank | |||
Normal rank |
Revision as of 02:02, 1 August 2024
scientific article; zbMATH DE number 7683230
Language | Label | Description | Also known as |
---|---|---|---|
English | Variational actor-critic algorithms, |
scientific article; zbMATH DE number 7683230 |
Statements
Variational actor-critic algorithms, (English)
0 references
8 May 2023
0 references
Markov decision process
0 references
reinforcement learning
0 references
policy gradient
0 references
optimal control
0 references