A projected primal-dual gradient optimal control method for deep reinforcement learning (Q1980960): Difference between revisions

@@ Property / cites work @@
+Q3285814
@@ Property / cites work: Q3285814 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3149379
@@ Property / cites work: Q3149379 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Deep learning as optimal control problems: models and numerical methods
+Normal rank
@@ Property / cites work @@
+Q3182207
@@ Property / cites work: Q3182207 / rank @@
+Normal rank
@@ Property / cites work @@
+Handbook of Markov decision processes. Methods and applications
+Normal rank
@@ Property / cites work @@
+Q5509984
@@ Property / cites work: Q5509984 / rank @@
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank
@@ Property / cites work @@
+Simple statistical gradient-following algorithms for connectionist reinforcement learning
+Normal rank
@@ Property / cites work @@
+Ordinary differential equations. An introduction from the dynamical systems perspective
+Normal rank
@@ Property / cites work @@
+Optimal control of ODEs and DAEs.
@@ Property / cites work: Optimal control of ODEs and DAEs. / rank @@
+Normal rank
@@ Property / cites work @@
+Q3811611
@@ Property / cites work: Q3811611 / rank @@
+Normal rank
@@ Property / cites work @@
+Reinforcement Learning Applied to a Human Arm Model
+Normal rank