An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403): Difference between revisions

From MaRDI portal
ReferenceBot (talk | contribs)
Changed an Item
Created claim: DBLP publication ID (P1635): journals/neco/MaZHS16, #quickstatements; #temporary_batch_1731530891435
 
(One intermediate revision by one other user not shown)
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1162/neco_a_00808 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2225522132 / rank
 
Normal rank
Property / DBLP publication ID
 
Property / DBLP publication ID: journals/neco/MaZHS16 / rank
 
Normal rank

Latest revision as of 22:36, 13 November 2024

scientific article; zbMATH DE number 7062532
Language Label Description Also known as
English
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
scientific article; zbMATH DE number 7062532

    Statements

    Identifiers