Provably efficient learning with typed parametric models
From MaRDI portal
Publication:2880957
Recommendations
- The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms
- Learning parametric policies and transition probability models of Markov decision processes from data
- Reinforcement learning in finite MDPs: PAC analysis
- scientific article; zbMATH DE number 5547890
- A generalized path integral control approach to reinforcement learning
This page was built for publication: Provably efficient learning with typed parametric models
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2880957)