An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403)
From MaRDI portal
![]() | This is the item page for this Wikibase entity, intended for internal use and editing purposes. Please use this page instead for the normal view: An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions |
scientific article; zbMATH DE number 7062532
Language | Label | Description | Also known as |
---|---|---|---|
English | An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions |
scientific article; zbMATH DE number 7062532 |
Statements
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (English)
0 references
4 June 2019
0 references