Q4999029 (Q4999029): Difference between revisions

@@ Property / cites work @@
+Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
+Normal rank
@@ Property / cites work @@
+Proximal Alternating Minimization and Projection Methods for Nonconvex Problems: An Approach Based on the Kurdyka-Łojasiewicz Inequality
+Normal rank
@@ Property / cites work @@
+Q5405224
@@ Property / cites work: Q5405224 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4387224
@@ Property / cites work: Q4387224 / rank @@
+Normal rank
@@ Property / cites work @@
+First-Order Methods in Optimization
@@ Property / cites work: First-Order Methods in Optimization / rank @@
+Normal rank
@@ Property / cites work @@
+Functional Approximations and Dynamic Programming
@@ Property / cites work: Functional Approximations and Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Natural actor-critic algorithms
@@ Property / cites work: Natural actor-critic algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+The Łojasiewicz Inequality for Nonsmooth Subanalytic Functions with Applications to Subgradient Dynamical Systems
+Normal rank
@@ Property / cites work @@
+.1162/153244303765208377
@@ Property / cites work: 10.1162/153244303765208377 / rank @@
+Normal rank
@@ Property / cites work @@
+Prediction, Learning, and Games
@@ Property / cites work: Prediction, Learning, and Games / rank @@
+Normal rank
@@ Property / cites work @@
+Online Markov Decision Processes
@@ Property / cites work: Online Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+A decision-theoretic generalization of on-line learning and an application to boosting
+Normal rank
@@ Property / cites work @@
+Accelerated gradient methods for nonconvex nonlinear and stochastic programming
+Normal rank
@@ Property / cites work @@
+Random design analysis of ridge regression
@@ Property / cites work: Random design analysis of ridge regression / rank @@
+Normal rank
@@ Property / cites work @@
+Q5791470
@@ Property / cites work: Q5791470 / rank @@
+Normal rank
@@ Property / cites work @@
+Near-optimal reinforcement learning in polynomial time
+Normal rank
@@ Property / cites work @@
+Q2810787
@@ Property / cites work: Q2810787 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3967358
@@ Property / cites work: Q3967358 / rank @@
+Normal rank
@@ Property / cites work @@
+Cubic regularization of Newton method and its global performance
+Normal rank
@@ Property / cites work @@
+Q5744816
@@ Property / cites work: Q5744816 / rank @@
+Normal rank
@@ Property / cites work @@
+Understanding Machine Learning
@@ Property / cites work: Understanding Machine Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Online Learning and Online Convex Optimization
@@ Property / cites work: Online Learning and Online Convex Optimization / rank @@
+Normal rank
@@ Property / cites work @@
+Simple statistical gradient-following algorithms for connectionist reinforcement learning
+Normal rank