Model selection in reinforcement learning
From MaRDI portal
Recommendations
- Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path
- Regularized feature selection in reinforcement learning
- Regularized policy iteration with nonparametric function spaces
- Bayesian Reinforcement Learning with Exploration
- Selecting near-optimal approximate state representations in reinforcement learning
Cites work
- scientific article; zbMATH DE number 5957269 (Why is no real title available?)
- scientific article; zbMATH DE number 5654889 (Why is no real title available?)
- scientific article; zbMATH DE number 17222 (Why is no real title available?)
- scientific article; zbMATH DE number 1321699 (Why is no real title available?)
- 10.1162/1532443041827907
- A distribution-free theory of nonparametric regression
- A survey of cross-validation procedures for model selection
- Algorithms for reinforcement learning.
- Basis function adaptation in temporal difference reinforcement learning
- Complexity regularization via localized random penalties
- Concentration of measure inequalities for Markov chains and \(\Phi\)-mixing processes.
- Gaussian processes for machine learning.
- Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
- Local Rademacher complexities
- Markov Chains and Stochastic Stability
- Memory-universal prediction of stationary random processes
- Model selection and error estimation
- Model selection in nonparametric regression
- Nonparametric time series prediction through adaptive model selection
- Oracle inequalities for multi-fold cross validation
- Pattern recognition and machine learning.
- Stochastic optimal control. The discrete time case
- The elements of statistical learning. Data mining, inference, and prediction
Cited in
(5)- Off-policy estimation of long-term average outcomes with applications to mobile health
- Regularized feature selection in reinforcement learning
- Batch policy learning in average reward Markov decision processes
- Reinforcement learning algorithms with function approximation: recent advances and applications
- scientific article; zbMATH DE number 7306889 (Why is no real title available?)
This page was built for publication: Model selection in reinforcement learning
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q415618)