Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering

From MaRDI portal
Publication:4574671
Jump to:navigation, search

DOI10.13195/J.KZYJC.2016.1148zbMATH Open1399.68107MaRDI QIDQ4574671FDOQ4574671


Authors: Ting Ji, Hua Zhang Edit this on Wikidata


Publication date: 18 July 2018





Recommendations

  • Non-parametric policy search with limited information loss
  • Rollout sampling approximate policy iteration
  • Regularized policy iteration with nonparametric function spaces
  • Reinforcement learning method of continuous state adaptively discretized based on \(K\)-means clustering
  • Parametric Approximation Policy Iteration Algorithm Based on Gaussian Process


zbMATH Keywords

reinforcement learningpolicy iterationnonparametric approximationstates clustering


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)



Cited In (1)

  • Reinforcement learning method of continuous state adaptively discretized based on \(K\)-means clustering





This page was built for publication: Nonparametric approximation generalized policy iteration reinforcement learning algorithm based on states clustering

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q4574671)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:4574671&oldid=18717018"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 7 February 2024, at 12:11. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki