On the existence of fixed points for approximate value iteration and temporal-difference learning (Q1586803): Difference between revisions
From MaRDI portal
Added link to MaRDI item. |
ReferenceBot (talk | contribs) Changed an Item |
||
(3 intermediate revisions by 2 users not shown) | |||
Property / author | |||
Property / author: Benjamin van Roy / rank | |||
Property / author | |||
Property / author: Benjamin van Roy / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Functional Approximations and Dynamic Programming / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: An analysis of temporal-difference learning with function approximation / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: The convergence of \(TD(\lambda)\) for general \(\lambda\) / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q4886156 / rank | |||
Normal rank | |||
Property / cites work | |||
Property / cites work: Q3997575 / rank | |||
Normal rank |
Latest revision as of 08:59, 3 June 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | On the existence of fixed points for approximate value iteration and temporal-difference learning |
scientific article |
Statements
On the existence of fixed points for approximate value iteration and temporal-difference learning (English)
0 references
19 February 2001
0 references
dynamic programming
0 references
neurodynamic programming
0 references
reinforcement learning
0 references
temporal-difference learning
0 references
value iteration
0 references