Dynamic policy programming
From MaRDI portal
Recommendations
- Empirical dynamic programming
- Approximate policy iteration: a survey and some new methods
- scientific article; zbMATH DE number 4003938
- Approximate policy iteration for Markov decision processes via quantitative adaptive aggregations
- Continuous state dynamic programming via nonexpansive approximation
Cited in
(12)- Applications of variable discounting dynamic programming to iterated function systems and related problems
- scientific article; zbMATH DE number 7370615 (Why is no real title available?)
- Value Iteration is Optic Composition
- On linear and super-linear convergence of natural policy gradient algorithm
- Empirical dynamic programming
- Kernel dynamic policy programming: applicable reinforcement learning to robot systems with high dimensional states
- Rollout sampling approximate policy iteration
- Occupancy information ratio: infinite-horizon, information-directed, parameterized policy search
- Time-varying policy rule under learning
- Approximate policy iteration: a survey and some new methods
- scientific article; zbMATH DE number 7370614 (Why is no real title available?)
- Reduced complexity dynamic programming based on policy iteration
This page was built for publication: Dynamic policy programming
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5405224)