Transfer of learning by composing solutions of elemental sequential tasks (Q1812933): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Set OpenAlex properties.
 
(One intermediate revision by one other user not shown)
Property / cites work
 
Property / cites work: Q3795523 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4403756 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Macro-operators: A weak method for learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/bf00992700 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2012036715 / rank
 
Normal rank

Latest revision as of 10:00, 30 July 2024

scientific article
Language Label Description Also known as
English
Transfer of learning by composing solutions of elemental sequential tasks
scientific article

    Statements

    Transfer of learning by composing solutions of elemental sequential tasks (English)
    0 references
    0 references
    11 August 1992
    0 references
    A class of sequential decision tasks (SDTs) is considered, called composite sequential decision tasks, formed by temporally concatenating a number of elemental sequential decision tasks. Elemental SDTs cannot be decomposed into simpler SDTs. A learning agent has to learn to solve a set of elemental and composite SDTs is examined. It is assumed that the structure of the composite tasks is unknown to the learning agent. The straightforward application of reinforcement learning to multiple tasks requires learning the tasks separately, which can waste computational resources, both memory and time. A new learning algorithm and a modular architecture is described that learns the decomposition of composite SDTs, and achieves transfer of learning by sharing the solutions of elemental SDTs across multiple composite SDTs. The solution of a composite SDT is constructed by computationally inexpensive modifications of the solutions of its constituent elemental SDTs. A proof of one aspect of the learning algorithm is provided.
    0 references
    reinforcement
    0 references
    compositional learning
    0 references
    transfer of learning
    0 references
    modular architecture
    0 references
    0 references
    0 references
    0 references

    Identifiers