Empirical Dynamic Programming (Q2806811): Difference between revisions

@@ Property / DOI @@
-.1287/moor.2015.0733
@@ Property / DOI: 10.1287/moor.2015.0733 / rank @@
-Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / OpenAlex ID @@
+W2593952959
@@ Property / OpenAlex ID: W2593952959 / rank @@
+Normal rank
@@ Property / arXiv ID @@
+.5918
@@ Property / arXiv ID: 1311.5918 / rank @@
+Normal rank
@@ Property / cites work @@
+Learning Algorithms for Markov Decision Processes with Average Cost
+Normal rank
@@ Property / cites work @@
+Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Neural Network Learning
@@ Property / cites work: Neural Network Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Associative search network: A reinforcement learning associative memory
+Normal rank
@@ Property / cites work @@
+Functional Approximations and Dynamic Programming
@@ Property / cites work: Functional Approximations and Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate policy iteration: a survey and some new methods
+Normal rank
@@ Property / cites work @@
+Q-Learning for Risk-Sensitive Control
@@ Property / cites work: Q-Learning for Risk-Sensitive Control / rank @@
+Normal rank
@@ Property / cites work @@
+The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning
+Normal rank
@@ Property / cites work @@
+A survey of some simulation-based algorithms for Markov decision processes
+Normal rank
@@ Property / cites work @@
+Performance Guarantees for Empirical Markov Decision Processes with Applications to Multiperiod Inventory Models
+Normal rank
@@ Property / cites work @@
+CONVERGENCE OF SIMULATION-BASED POLICY ITERATION
@@ Property / cites work: CONVERGENCE OF SIMULATION-BASED POLICY ITERATION / rank @@
+Normal rank
@@ Property / cites work @@
+Q3093180
@@ Property / cites work: Q3093180 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5635252
@@ Property / cites work: Q5635252 / rank @@
+Normal rank
@@ Property / cites work @@
+Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Simulation-based optimization of Markov decision processes: an empirical process theory approach
+Normal rank
@@ Property / cites work @@
+Stochastic Estimation of the Maximum of a Regression Function
+Normal rank
@@ Property / cites work @@
+Actor-Critic--Type Learning Algorithms for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Convergence rate of linear two-time-scale stochastic approximation.
+Normal rank
@@ Property / cites work @@
+Analysis of recursive stochastic algorithms
@@ Property / cites work: Analysis of recursive stochastic algorithms / rank @@
+Normal rank
@@ Property / cites work @@
+Q2778807
@@ Property / cites work: Q2778807 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3096132
@@ Property / cites work: Q3096132 / rank @@
+Normal rank
@@ Property / cites work @@
+The Complexity of Markov Decision Processes
@@ Property / cites work: The Complexity of Markov Decision Processes / rank @@
+Normal rank
@@ Property / cites work @@
+A Stochastic Approximation Method
@@ Property / cites work: A Stochastic Approximation Method / rank @@
+Normal rank
@@ Property / cites work @@
+Using Randomization to Break the Curse of Dimensionality
+Normal rank
@@ Property / cites work @@
+Stochastic Games
@@ Property / cites work: Stochastic Games / rank @@
+Normal rank
@@ Property / cites work @@
+.1162/153244303768966102
@@ Property / cites work: 10.1162/153244303768966102 / rank @@
+Normal rank
@@ Property / cites work @@
+\({\mathcal Q}\)-learning
@@ Property / cites work: \({\mathcal Q}\)-learning / rank @@
+Normal rank
@@ Property / cites work @@
+Approximations of Dynamic Programs, I
@@ Property / cites work: Approximations of Dynamic Programs, I / rank @@
+Normal rank
@@ Property / cites work @@
+Approximations of Dynamic Programs, II
@@ Property / cites work: Approximations of Dynamic Programs, II / rank @@
+Normal rank
@@ Property / DOI @@
+.1287/MOOR.2015.0733
@@ Property / DOI: 10.1287/MOOR.2015.0733 / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:2806811