Natural actor-critic algorithms (Q1049136): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Import241208061232 (talk | contribs)
Normalize DOI.
 
Property / DOI
 
Property / DOI: 10.1016/j.automatica.2009.07.008 / rank
Normal rank
 
Property / DOI
 
Property / DOI: 10.1016/J.AUTOMATICA.2009.07.008 / rank
 
Normal rank

Latest revision as of 15:00, 10 December 2024

scientific article
Language Label Description Also known as
English
Natural actor-critic algorithms
scientific article

    Statements

    Natural actor-critic algorithms (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    8 January 2010
    0 references
    actor-critic reinforcement learning algorithms
    0 references
    policy-gradient methods
    0 references
    approximate dynamic programming
    0 references
    function approximation
    0 references
    two-timescale stochastic approximation
    0 references
    temporal difference learning
    0 references
    natural gradient
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references