Near-optimal PAC bounds for discounted MDPs

From MaRDI portal

(Redirected from Publication:465258)

Jump to:navigation, search

DOI10.1016/J.TCS.2014.09.029MaRDI QIDQ465258zbMATH OpenOpenAlexWikidataFDO

Authors Tor Lattimore, Marcus Hutter

Publication date 31 October 2014

Published in Theoretical Computer Science (Search for Journal in Brave)

Full work available at URL https://doi.org/10.1016/j.tcs.2014.09.029

zbMATH Keywords

Markov decision processes reinforcement learning PAC bounds sample-complexity

Mathematics Subject Classification ID

Computational learning theory (68Q32) Markov and semi-Markov decision processes (90C40)

Recommendations

Cites work

Cited in

(10)

This page was built for publication: Near-optimal PAC bounds for discounted MDPs

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q465258)

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Near-optimal_PAC_bounds_for_discounted_MDPs&oldid=62147355"