Convergence results for single-step on-policy reinforcement-learning algorithms (Q1568533): Difference between revisions
From MaRDI portal
Removed claims |
Set OpenAlex properties. |
||
(2 intermediate revisions by 2 users not shown) | |||
Property / author | |||
Property / author: Satinder Pal Singh / rank | |||
Normal rank | |||
Property / author | |||
Property / author: Tommi S. Jaakkola / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1023/a:1007678930559 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2150339816 / rank | |||
Normal rank |
Latest revision as of 10:39, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Convergence results for single-step on-policy reinforcement-learning algorithms |
scientific article |
Statements
Convergence results for single-step on-policy reinforcement-learning algorithms (English)
0 references
21 June 2000
0 references
reinforcement learning
0 references