A generalized Kalman filter for fixed point approximation and efficient temporal-difference learning (Q859737): Difference between revisions

@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10626-006-8134-8
+Normal rank
@@ Property / OpenAlex ID @@
+W2062541405
@@ Property / OpenAlex ID: W2062541405 / rank @@
+Normal rank
@@ Property / cites work @@
+Functional Approximations and Dynamic Programming
@@ Property / cites work: Functional Approximations and Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q3997575
@@ Property / cites work: Q3997575 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4209222
@@ Property / cites work: Q4209222 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4858374
@@ Property / cites work: Q4858374 / rank @@
+Normal rank
@@ Property / cites work @@
+Technical update: Least-squares temporal difference learning
+Normal rank
@@ Property / cites work @@
+Q5477859
@@ Property / cites work: Q5477859 / rank @@
+Normal rank
@@ Property / cites work @@
+The convergence of \(TD(\lambda)\) for general \(\lambda\)
+Normal rank
@@ Property / cites work @@
+On the existence of fixed points for approximate value iteration and temporal-difference learning
+Normal rank
@@ Property / cites work @@
+Q4368791
@@ Property / cites work: Q4368791 / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+On the convergence of temporal-difference learning with linear function approximation
+Normal rank
@@ Property / cites work @@
+An analysis of temporal-difference learning with function approximation
+Normal rank
@@ Property / cites work @@
+Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives
+Normal rank
@@ Property / cites work @@
+Extensions of the multiarmed bandit problem: The discounted case
+Normal rank
@@ Property / cites work @@
+Q5477861
@@ Property / cites work: Q5477861 / rank @@
+Normal rank