Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering
From MaRDI portal
Publication:4574671
Recommendations
- Non-parametric policy search with limited information loss
- Rollout sampling approximate policy iteration
- Regularized policy iteration with nonparametric function spaces
- Reinforcement learning method of continuous state adaptively discretized based on \(K\)-means clustering
- Parametric Approximation Policy Iteration Algorithm Based on Gaussian Process
Cited in
(1)
This page was built for publication: Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4574671)