Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Integrating a partial model into model free reinforcement learning

From MaRDI portal
Publication:5405178
Jump to:navigation, search

zbMATH Open1432.68401MaRDI QIDQ5405178FDOQ5405178


Authors: Aviv Tamar, Dotan Di Castro, Ron Meir Edit this on Wikidata


Publication date: 1 April 2014


Full work available at URL: http://www.jmlr.org/papers/v13/tamar12a.html




Recommendations

  • Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
  • Model-free average reward multi-step reinforcement learning
  • A generalized path integral control approach to reinforcement learning
  • Model-based average reward reinforcement learning
  • Model-Based Reinforcement Learning for Partially Observable Games with Sampling-Based State Estimation


zbMATH Keywords

stochastic approximationMarkov decision processesreinforcement learningtemporal differencehybrid model-based model-free algorithms


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)



Cited In (2)

  • Model-based policy gradients with parameter-based exploration by least-squares conditional density estimation
  • Title not available (Why is that?)





This page was built for publication: Integrating a partial model into model free reinforcement learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5405178)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5405178&oldid=20145073"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 9 February 2024, at 01:58. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki