An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions (Q5380403): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
Normalize DOI.
 
(3 intermediate revisions by 3 users not shown)
Property / DOI
 
Property / DOI: 10.1162/NECO_a_00808 / rank
Normal rank
 
Property / cites work
 
Property / cites work: Online Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2921693 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Logarithmic Regret Algorithms for Online Convex Optimization / rank
 
Normal rank
Property / cites work
 
Property / cites work: Efficient algorithms for online decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions / rank
 
Normal rank
Property / cites work
 
Property / cites work: Online Markov Decision Processes Under Bandit Feedback / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4626283 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simple statistical gradient-following algorithms for connectionist reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Decision Processes with Arbitrary Reward Processes / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1162/neco_a_00808 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2225522132 / rank
 
Normal rank
Property / DBLP publication ID
 
Property / DBLP publication ID: journals/neco/MaZHS16 / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1162/NECO_A_00808 / rank
 
Normal rank

Latest revision as of 16:50, 30 December 2024

scientific article; zbMATH DE number 7062532
Language Label Description Also known as
English
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
scientific article; zbMATH DE number 7062532

    Statements

    Identifiers