Reinforcement learning based algorithms for average cost Markov decision processes (Q2643632): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Import240304020342 (talk | contribs)
Set profile property.
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank

Revision as of 07:56, 5 March 2024

scientific article
Language Label Description Also known as
English
Reinforcement learning based algorithms for average cost Markov decision processes
scientific article

    Statements

    Reinforcement learning based algorithms for average cost Markov decision processes (English)
    0 references
    27 August 2007
    0 references
    actor-critic algorithms
    0 references
    two timescale stochastic approximation
    0 references
    Markov decision processes
    0 references
    policy iteration
    0 references
    simultaneous perturbation stochastic approximation
    0 references
    normalized Hadamard matrices
    0 references
    reinforcement learning
    0 references
    TD-learning
    0 references

    Identifiers