A tutorial survey of reinforcement learning (Q5955768): Difference between revisions

From MaRDI portal
Added link to MaRDI item.
Import240304020342 (talk | contribs)
Set profile property.
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank

Revision as of 23:48, 4 March 2024

scientific article; zbMATH DE number 1706651
Language Label Description Also known as
English
A tutorial survey of reinforcement learning
scientific article; zbMATH DE number 1706651

    Statements

    A tutorial survey of reinforcement learning (English)
    0 references
    0 references
    18 February 2002
    0 references
    Reinforcement learning (RL) refers to the process whereby a learning system learns an associative mapping by maximizing a scalar evaluation (a reinforcement) of its performance from the environment. Delayed RL is a process in which the environment yields only a single scalar reinforcement collectively. Such tasks arise in the optimal control of dynamic systems and planning problems of artificial intelligence. Here, the authors `provide a comprehensive tutorial survey of various ideas and methods of delayed RL'. The connexion with stochastic optimal control is explored, and differences between delayed RL and dynamic programming methods are discussed. Model-based and model-free methods are examined, and general issues relating to the practical implementation of RL algorithms are noted.
    0 references
    reinforcement learning
    0 references
    dynamic programming
    0 references
    optimal control
    0 references
    neural networks
    0 references
    model-free methods
    0 references

    Identifiers