Natural actor-critic algorithms (Q1049136): Difference between revisions
From MaRDI portal
ReferenceBot (talk | contribs) Changed an Item |
Normalize DOI. |
||
Property / DOI | |||
Property / DOI: 10.1016/j.automatica.2009.07.008 / rank | |||
Property / DOI | |||
Property / DOI: 10.1016/J.AUTOMATICA.2009.07.008 / rank | |||
Normal rank |
Latest revision as of 15:00, 10 December 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Natural actor-critic algorithms |
scientific article |
Statements
Natural actor-critic algorithms (English)
0 references
8 January 2010
0 references
actor-critic reinforcement learning algorithms
0 references
policy-gradient methods
0 references
approximate dynamic programming
0 references
function approximation
0 references
two-timescale stochastic approximation
0 references
temporal difference learning
0 references
natural gradient
0 references
0 references
0 references
0 references
0 references