Mathematical Research Data Initiative
Main page
Recent changes
Random page
SPARQL
MaRDI@GitHub
New item
In other projects
MaRDI portal item
Discussion
View source
View history
English
Log in

Universal Reinforcement Learning

From MaRDI portal
Publication:5281503
Jump to:navigation, search

DOI10.1109/TIT.2010.2043762zbMATH Open1368.68280OpenAlexW2123742287MaRDI QIDQ5281503FDOQ5281503

Ciamac C. Moallemi, Benjamin Van Roy, Vivek Farias, Tsachy Weissman

Publication date: 27 July 2017

Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1109/tit.2010.2043762




Mathematics Subject Classification ID

Learning and adaptive systems in artificial intelligence (68T05)



Cited In (2)

  • Approximation of average cost Markov decision processes using empirical distributions and concentration inequalities
  • Off-policy evaluation in partially observed Markov decision processes under sequential ignorability






This page was built for publication: Universal Reinforcement Learning

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5281503)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:5281503&oldid=19928923"
Tools
What links here
Related changes
Printable version
Permanent link
Page information
This page was last edited on 8 February 2024, at 20:55. Warning: Page may not contain recent updates.
Privacy policy
About MaRDI portal
Disclaimers
Imprint
Powered by MediaWiki