Model selection in reinforcement learning (Q415618): Difference between revisions

@@ Property / describes a project that uses @@
+ElemStatLearn
@@ Property / describes a project that uses: ElemStatLearn / rank @@
+Normal rank
@@ Property / describes a project that uses @@
+PRMLT
@@ Property / describes a project that uses: PRMLT / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-011-5254-7
+Normal rank
@@ Property / OpenAlex ID @@
+W2006330826
@@ Property / OpenAlex ID: W2006330826 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
+Normal rank
@@ Property / cites work @@
+A survey of cross-validation procedures for model selection
+Normal rank
@@ Property / cites work @@
+Q3973919
@@ Property / cites work: Q3973919 / rank @@
+Normal rank
@@ Property / cites work @@
+Model selection and error estimation
@@ Property / cites work: Model selection and error estimation / rank @@
+Normal rank
@@ Property / cites work @@
+Local Rademacher complexities
@@ Property / cites work: Local Rademacher complexities / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic optimal control. The discrete time case
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5483032
@@ Property / cites work: Q5483032 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093261
@@ Property / cites work: Q3093261 / rank @@
+Normal rank
@@ Property / cites work @@
+Memory-universal prediction of stationary random processes
+Normal rank
@@ Property / cites work @@
+A distribution-free theory of nonparametric regression
+Normal rank
@@ Property / cites work @@
+The elements of statistical learning. Data mining, inference, and prediction
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Complexity regularization via localized random penalties
+Normal rank
@@ Property / cites work @@
+Nonparametric time series prediction through adaptive model selection
+Normal rank
@@ Property / cites work @@
+Basis function adaptation in temporal difference reinforcement learning
+Normal rank
@@ Property / cites work @@
+Markov Chains and Stochastic Stability
@@ Property / cites work: Markov Chains and Stochastic Stability / rank @@
+Normal rank
@@ Property / cites work @@
+Q3394879
@@ Property / cites work: Q3394879 / rank @@
+Normal rank
@@ Property / cites work @@
+Concentration of measure inequalities for Markov chains and \(\Phi\)-mixing processes.
+Normal rank
@@ Property / cites work @@
+Algorithms for Reinforcement Learning
@@ Property / cites work: Algorithms for Reinforcement Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Q3655724
@@ Property / cites work: Q3655724 / rank @@
+Normal rank
@@ Property / cites work @@
+Oracle inequalities for multi-fold cross validation
+Normal rank
@@ Property / cites work @@
+Model selection in nonparametric regression
@@ Property / cites work: Model selection in nonparametric regression / rank @@
+Normal rank