Convergence of entropy-regularized natural policy gradient with linear function approximation
From MaRDI portal
Publication:6587339
Cites work
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- scientific article; zbMATH DE number 7370615 (Why is no real title available?)
- scientific article; zbMATH DE number 7306852 (Why is no real title available?)
- scientific article; zbMATH DE number 3301983 (Why is no real title available?)
- Algorithms for reinforcement learning.
- An analysis of temporal-difference learning with function approximation
- Asymptotic evaluation of certain markov process expectations for large time. IV
- Natural actor-critic algorithms
- Random design analysis of ridge regression
- Reinforcement learning. An introduction
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Understanding machine learning. From theory to algorithms
This page was built for publication: Convergence of entropy-regularized natural policy gradient with linear function approximation
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6587339)