An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403)

From MaRDI portal
Revision as of 22:36, 13 November 2024 by Daniel (talk | contribs) (‎Created claim: DBLP publication ID (P1635): journals/neco/MaZHS16, #quickstatements; #temporary_batch_1731530891435)





scientific article; zbMATH DE number 7062532
Language Label Description Also known as
English
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
scientific article; zbMATH DE number 7062532

    Statements

    Identifiers