(Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751): Difference between revisions

@@ Property / cites work @@
+Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Truncated policy iteration methods
@@ Property / cites work: Truncated policy iteration methods / rank @@
+Normal rank
@@ Property / cites work @@
+Contraction Mappings in the Theory Underlying Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Affine Structure and Invariant Policies for Dynamic Programs
+Normal rank
@@ Property / cites work @@
+Block-successive approximation for a discounted Markov decision model
+Normal rank
@@ Property / cites work @@
+Q3313617
@@ Property / cites work: Q3313617 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4547434
@@ Property / cites work: Q4547434 / rank @@
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Convergence of Policy Iteration in Stationary Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
+Normal rank
@@ Property / cites work @@
+Action Elimination Procedures for Modified Policy Iteration Algorithms
+Normal rank
@@ Property / cites work @@
+Discounted Markov games: Generalized policy iteration method
+Normal rank
@@ Property / cites work @@
+A set of successive approximation methods for discounted Markovian decision problems
+Normal rank
@@ Property / cites work @@
+Q4190426
@@ Property / cites work: Q4190426 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximations of Dynamic Programs, I
@@ Property / cites work: Approximations of Dynamic Programs, I / rank @@
+Normal rank