Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering
From MaRDI portal
Publication:4574671
DOI10.13195/J.KZYJC.2016.1148zbMATH Open1399.68107MaRDI QIDQ4574671FDOQ4574671
Publication date: 18 July 2018
Recommendations
- Non-parametric policy search with limited information loss
- Rollout sampling approximate policy iteration
- Regularized policy iteration with nonparametric function spaces
- Reinforcement learning method of continuous state adaptively discretized based on \(K\)-means clustering
- Parametric Approximation Policy Iteration Algorithm Based on Gaussian Process
Cited In (1)
This page was built for publication: Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4574671)