Convergence results for single-step on-policy reinforcement-learning algorithms (Q1568533): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claims
Set OpenAlex properties.
 
(2 intermediate revisions by 2 users not shown)
Property / author
 
Property / author: Satinder Pal Singh / rank
 
Normal rank
Property / author
 
Property / author: Tommi S. Jaakkola / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1023/a:1007678930559 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2150339816 / rank
 
Normal rank

Latest revision as of 10:39, 30 July 2024

scientific article
Language Label Description Also known as
English
Convergence results for single-step on-policy reinforcement-learning algorithms
scientific article

    Statements

    Convergence results for single-step on-policy reinforcement-learning algorithms (English)
    0 references
    0 references
    0 references
    0 references
    0 references
    21 June 2000
    0 references
    reinforcement learning
    0 references

    Identifiers