Transfer of learning by composing solutions of elemental sequential tasks (Q1812933): Difference between revisions
From MaRDI portal
Created a new Item |
Set OpenAlex properties. |
||
(3 intermediate revisions by 3 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3795523 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4403756 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Macro-operators: A weak method for learning / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3683893 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: \({\mathcal Q}\)-learning / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/bf00992700 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2012036715 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 10:00, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Transfer of learning by composing solutions of elemental sequential tasks |
scientific article |
Statements
Transfer of learning by composing solutions of elemental sequential tasks (English)
0 references
11 August 1992
0 references
A class of sequential decision tasks (SDTs) is considered, called composite sequential decision tasks, formed by temporally concatenating a number of elemental sequential decision tasks. Elemental SDTs cannot be decomposed into simpler SDTs. A learning agent has to learn to solve a set of elemental and composite SDTs is examined. It is assumed that the structure of the composite tasks is unknown to the learning agent. The straightforward application of reinforcement learning to multiple tasks requires learning the tasks separately, which can waste computational resources, both memory and time. A new learning algorithm and a modular architecture is described that learns the decomposition of composite SDTs, and achieves transfer of learning by sharing the solutions of elemental SDTs across multiple composite SDTs. The solution of a composite SDT is constructed by computationally inexpensive modifications of the solutions of its constituent elemental SDTs. A proof of one aspect of the learning algorithm is provided.
0 references
reinforcement
0 references
compositional learning
0 references
transfer of learning
0 references
modular architecture
0 references