Model selection in reinforcement learning

From MaRDI portal

Revision as of 03:42, 30 January 2024 by Import240129110155 (talk | contribs) (Created automatically from import240129110155)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Publication:415618

Jump to:navigation, search

DOI10.1007/s10994-011-5254-7zbMath1237.68143OpenAlexW2006330826MaRDI QIDQ415618

Csaba Szepesvári, Amir-massoud Farahmand

Publication date: 8 May 2012

Published in: Machine Learning (Search for Journal in Brave)

Full work available at URL: https://doi.org/10.1007/s10994-011-5254-7

zbMATH Keywords

model selection adaptivity reinforcement learning complexity regularization finite-sample bounds off-policy learning offline learning

Mathematics Subject Classification ID

Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)

Related Items (3)

Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health ⋮ Batch policy learning in average reward Markov decision processes

Uses Software

Cites Work

This page was built for publication: Model selection in reinforcement learning

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Publication:415618&oldid=12290216"