An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403)

From MaRDI portal
scientific article; zbMATH DE number 7062532
Language Label Description Also known as
English
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
scientific article; zbMATH DE number 7062532

    Statements

    Identifiers