Approximate dynamic programming via direct search in the space of value function approximations (Q713118): Difference between revisions

@@ Property / cites work @@
+Q3134873
@@ Property / cites work: Q3134873 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3241581
@@ Property / cites work: Q3241581 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4209222
@@ Property / cites work: Q4209222 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Projected equation methods for approximate solution of large linear systems
+Normal rank
@@ Property / cites work @@
+A new learning algorithm for optimal stopping
@@ Property / cites work: A new learning algorithm for optimal stopping / rank @@
+Normal rank
@@ Property / cites work @@
+Performance Loss Bounds for Approximate Value Iteration with State Aggregation
+Normal rank
@@ Property / cites work @@
+Discrete Dynamic Programming with Unbounded Rewards
+Normal rank
@@ Property / cites work @@
+.1162/1532443041827907
@@ Property / cites work: 10.1162/1532443041827907 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5630824
@@ Property / cites work: Q5630824 / rank @@
+Normal rank
@@ Property / cites work @@
+Direct search methods: Then and now
@@ Property / cites work: Direct search methods: Then and now / rank @@
+Normal rank
@@ Property / cites work @@
+Basis function adaptation in temporal difference reinforcement learning
+Normal rank
@@ Property / cites work @@
+Control Techniques for Complex Networks
@@ Property / cites work: Control Techniques for Complex Networks / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Practical issues in temporal difference learning
@@ Property / cites work: Practical issues in temporal difference learning / rank @@
+Normal rank
@@ Property / cites work @@
+On the Convergence of Pattern Search Algorithms
@@ Property / cites work: On the Convergence of Pattern Search Algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+An empirical study of policy convergence in Markov decision process value iteration
+Normal rank
@@ Property / cites work @@
+Convergence Results for Some Temporal Difference Methods Based on Least Squares
+Normal rank