Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
Special pages
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Iteratively extending time horizon reinforcement learning.

From MaRDI portal
Publication:5897339
Jump to:navigation, search

DOI10.1007/B13633zbMATH Open1257.68123OpenAlexW2479545322MaRDI QIDQ5897339FDOQ5897339


Authors: D. Ernst, Pierre Geurts, Louis Wehenkel Edit this on Wikidata


Publication date: 23 February 2010

Published in: Lecture Notes in Computer Science (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/b13633




Recommendations

  • scientific article; zbMATH DE number 5957269
  • 10.1162/1532443041827907
  • Reinforcement learning: a tutorial survey and recent advances
  • \({\mathcal Q}\)-learning
  • Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path


Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05) Stochastic learning and adaptive control (93E35)



Cited In (4)

  • Approximate Value Iteration with Temporally Extended Actions
  • Epoch-incremental reinforcement learning algorithms
  • Title not available (Why is that?)
  • Batch mode reinforcement learning based on the synthesis of artificial trajectories





This page was built for publication: Iteratively extending time horizon reinforcement learning.

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5897339)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5897339&oldid=16696667"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 4 February 2024, at 17:43. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki