Convergence results for single-step on-policy reinforcement-learning algorithms (Q1568533): Difference between revisions
From MaRDI portal
Created a new Item |
Set OpenAlex properties. |
||
(4 intermediate revisions by 3 users not shown) | |||
Property / author | |||
Property / author: Satinder Pal Singh / rank | |||
Property / author | |||
Property / author: Tommi S. Jaakkola / rank | |||
Property / author | |||
Property / author: Satinder Pal Singh / rank | |||
Normal rank | |||
Property / author | |||
Property / author: Tommi S. Jaakkola / rank | |||
Normal rank | |||
Property / MaRDI profile type | |||
Property / MaRDI profile type: MaRDI publication profile / rank | |||
Normal rank | |||
Property / full work available at URL | |||
Property / full work available at URL: https://doi.org/10.1023/a:1007678930559 / rank | |||
Normal rank | |||
Property / OpenAlex ID | |||
Property / OpenAlex ID: W2150339816 / rank | |||
Normal rank | |||
links / mardi / name | links / mardi / name | ||
Latest revision as of 10:39, 30 July 2024
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Convergence results for single-step on-policy reinforcement-learning algorithms |
scientific article |
Statements
Convergence results for single-step on-policy reinforcement-learning algorithms (English)
0 references
21 June 2000
0 references
reinforcement learning
0 references