Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path (Q1009248): Difference between revisions

@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10994-007-5038-2
+Normal rank
@@ Property / OpenAlex ID @@
+W2104753538
@@ Property / OpenAlex ID: W2104753538 / rank @@
+Normal rank
@@ Property / cites work @@
+Neural Network Learning
@@ Property / cites work: Neural Network Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path
+Normal rank
@@ Property / cites work @@
+Adaptive estimation in autoregression or \(\beta\)-mixing regression via model selection
+Normal rank
@@ Property / cites work @@
+Functional Approximations and Dynamic Programming
@@ Property / cites work: Functional Approximations and Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Stochastic optimal control. The discrete time case
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5477859
@@ Property / cites work: Q5477859 / rank @@
+Normal rank
@@ Property / cites work @@
+MIXING AND MOMENT PROPERTIES OF VARIOUS GARCH AND STOCHASTIC  VOLATILITY MODELS
+Normal rank
@@ Property / cites work @@
+Q5543516
@@ Property / cites work: Q5543516 / rank @@
+Normal rank
@@ Property / cites work @@
+Mixing Conditions for Markov Chains
@@ Property / cites work: Mixing Conditions for Markov Chains / rank @@
+Normal rank
@@ Property / cites work @@
+Q4881152
@@ Property / cites work: Q4881152 / rank @@
+Normal rank
@@ Property / cites work @@
+Mixing: Properties and examples
@@ Property / cites work: Mixing: Properties and examples / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093261
@@ Property / cites work: Q3093261 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4434179
@@ Property / cites work: Q4434179 / rank @@
+Normal rank
@@ Property / cites work @@
+A distribution-free theory of nonparametric regression
+Normal rank
@@ Property / cites work @@
+Sphere packing numbers for subsets of the Boolean \(n\)-cube with bounded Vapnik-Chervonenkis dimension
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3218572
@@ Property / cites work: Q3218572 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Nonparametric time series prediction through adaptive model selection
+Normal rank
@@ Property / cites work @@
+Markov chains and stochastic stability
@@ Property / cites work: Markov chains and stochastic stability / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093292
@@ Property / cites work: Q3093292 / rank @@
+Normal rank
@@ Property / cites work @@
+Histogram regression estimation using data-dependent partitions
+Normal rank
@@ Property / cites work @@
+Kernel-based reinforcement learning
@@ Property / cites work: Kernel-based reinforcement learning / rank @@
+Normal rank
@@ Property / cites work @@
+Convergence of stochastic processes
@@ Property / cites work: Convergence of stochastic processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q4001821
@@ Property / cites work: Q4001821 / rank @@
+Normal rank
@@ Property / cites work @@
+Generalized polynomial approximations in Markovian decision processes
+Normal rank
@@ Property / cites work @@
+Q5477860
@@ Property / cites work: Q5477860 / rank @@
+Normal rank
@@ Property / cites work @@
+Rates of convergence for empirical processes of stationary mixing sequences
+Normal rank