Model selection in reinforcement learning
From MaRDI portal
Publication:415618
DOI10.1007/s10994-011-5254-7zbMath1237.68143OpenAlexW2006330826MaRDI QIDQ415618
Csaba Szepesvári, Amir-massoud Farahmand
Publication date: 8 May 2012
Published in: Machine Learning (Search for Journal in Brave)
Full work available at URL: https://doi.org/10.1007/s10994-011-5254-7
model selectionadaptivityreinforcement learningcomplexity regularizationfinite-sample boundsoff-policy learningoffline learning
Computational learning theory (68Q32) Learning and adaptive systems in artificial intelligence (68T05) Markov and semi-Markov decision processes (90C40)
Related Items (3)
Reinforcement learning algorithms with function approximation: recent advances and applications ⋮ Off-Policy Estimation of Long-Term Average Outcomes With Applications to Mobile Health ⋮ Batch policy learning in average reward Markov decision processes
Uses Software
Cites Work
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- Unnamed Item
- A survey of cross-validation procedures for model selection
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Stochastic optimal control. The discrete time case
- Model selection in nonparametric regression
- Nonparametric time series prediction through adaptive model selection
- A distribution-free theory of nonparametric regression
- Concentration of measure inequalities for Markov chains and \(\Phi\)-mixing processes.
- Complexity regularization via localized random penalties
- Basis function adaptation in temporal difference reinforcement learning
- Local Rademacher complexities
- Oracle inequalities for multi-fold cross validation
- Algorithms for Reinforcement Learning
- Markov Chains and Stochastic Stability
- Memory-universal prediction of stationary random processes
- 10.1162/1532443041827907
- The elements of statistical learning. Data mining, inference, and prediction
- Model selection and error estimation
This page was built for publication: Model selection in reinforcement learning