Practical issues in temporal difference learning (Q1812929): Difference between revisions

@@ Property / cites work @@
+Learnability and the Vapnik-Chervonenkis dimension
+Normal rank
@@ Property / cites work @@
+The convergence of \(TD(\lambda)\) for general \(\lambda\)
+Normal rank
@@ Property / cites work @@
+A comparison and evaluation of three machine learning procedures as applied to the game of checkers
+Normal rank
@@ Property / cites work @@
+Multilayer feedforward networks are universal approximators
+Normal rank
@@ Property / cites work @@
+A pattern classification approach to evaluation function learning
+Normal rank
@@ Property / cites work @@
+A Stochastic Approximation Method
@@ Property / cites work: A Stochastic Approximation Method / rank @@
+Normal rank
@@ Property / cites work @@
+Learning representations by back-propagating errors
+Normal rank
@@ Property / cites work @@
+A parallel network that learns to play backgammon
@@ Property / cites work: A parallel network that learns to play backgammon / rank @@
+Normal rank
@@ Property / cites work @@
+On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities
+Normal rank
@@ Property / cites work @@
+On Optimal Doubling in Backgammon
@@ Property / cites work: On Optimal Doubling in Backgammon / rank @@
+Normal rank
@@ Property / DBLP publication ID @@
+journals/ml/Tesauro92
@@ Property / DBLP publication ID: journals/ml/Tesauro92 / rank @@
+Normal rank