Chaotic dynamics and convergence analysis of temporal difference algorithms with bang-bang control (Q2800471): Difference between revisions

@@ Property / full work available at URL @@
+https://doi.org/10.1002/oca.2156
@@ Property / full work available at URL: https://doi.org/10.1002/oca.2156 / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W1520148289
@@ Property / OpenAlex ID: W1520148289 / rank @@
+Normal rank
@@ Property / cites work @@
+Optimal chaos control through reinforcement learning
+Normal rank
@@ Property / cites work @@
+Temporal Difference Methods for General Projected Equations
+Normal rank
@@ Property / cites work @@
+.1162/153244303768966102
@@ Property / cites work: 10.1162/153244303768966102 / rank @@
+Normal rank
@@ Property / cites work @@
+An analysis of temporal-difference learning with function approximation
+Normal rank
@@ Property / cites work @@
+The convergence of \(TD(\lambda)\) for general \(\lambda\)
+Normal rank
@@ Property / cites work @@
+On the worst-case analysis of temporal-difference learning algorithms
+Normal rank
@@ Property / cites work @@
+First-Order System Least Squares for Second-Order Partial Differential Equations: Part II
+Normal rank
@@ Property / cites work @@
+Micro-chaotic dynamics due to digital sampling in hybrid systems of Filippov type
+Normal rank