The Concept of Opposition and Its Use in Q-Learning and Q(λ) Techniques (Q5302483): Difference between revisions
From MaRDI portal
Created a new Item |
ReferenceBot (talk | contribs) Changed an Item |
||
(3 intermediate revisions by 3 users not shown) | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1007/978-3-540-70829-2_11 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W1561485809 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Planning and acting in partially observable stochastic domains / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Perception-based hidden Markov models: A theoretical framework for data mining and knowledge discovery / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Reinforcement learning agents / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: \({\mathcal Q}\)-learning / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 22:17, 28 June 2024
scientific article; zbMATH DE number 5486133
Language | Label | Description | Also known as |
---|---|---|---|
English | The Concept of Opposition and Its Use in Q-Learning and Q(λ) Techniques |
scientific article; zbMATH DE number 5486133 |
Statements
The Concept of Opposition and Its Use in Q-Learning and Q(λ) Techniques (English)
0 references
7 January 2009
0 references
Reinforcement Learning
0 references
Q-value updating
0 references