A \(K\)-step look-ahead analysis of value iteration algorithms for Markov decision processes (Q1266643): Difference between revisions

@@ Property / cites work @@
+Truncated policy iteration methods
@@ Property / cites work: Truncated policy iteration methods / rank @@
+Normal rank
@@ Property / cites work @@
+Q4193284
@@ Property / cites work: Q4193284 / rank @@
+Normal rank
@@ Property / cites work @@
+Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes
+Normal rank
@@ Property / cites work @@
+Accelerating Procedures of the Value Iteration Algorithm for Discounted Markov Decision Processes, Based on a One-Step Lookahead Analysis
+Normal rank
@@ Property / cites work @@
+Q4284156
@@ Property / cites work: Q4284156 / rank @@
+Normal rank
@@ Property / cites work @@
+Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems
+Normal rank
@@ Property / cites work @@
+Branch-and-Bound Strategies for Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations
+Normal rank
@@ Property / cites work @@
+Q4190426
@@ Property / cites work: Q4190426 / rank @@
+Normal rank
@@ Property / cites work @@
+A set of successive approximation methods for discounted Markovian decision problems
+Normal rank
@@ Property / cites work @@
+Computing Optimal Policies for Controlled Tandem Queueing Systems
+Normal rank
@@ Property / cites work @@
+Discrete versions of an algorithm due to Varaiya
@@ Property / cites work: Discrete versions of an algorithm due to Varaiya / rank @@
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Technical Note—Accelerated Computation of the Expected Discounted Return in a Markov Chain
+Normal rank
@@ Property / cites work @@
+Bounds and Transformations for Discounted Finite Markov Decision Chains
+Normal rank
@@ Property / cites work @@
+Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
+Normal rank
@@ Property / cites work @@
+Action Elimination Procedures for Modified Policy Iteration Algorithms
+Normal rank
@@ Property / cites work @@
+The convergence of value iteration in discounted Markov decision processes
+Normal rank
@@ Property / cites work @@
+Iterative solution of the functional equations of undiscounted Markov renewal programming
+Normal rank
@@ Property / cites work @@
+A simple technique in Markovian control with applications to resource allocation to resource allocation in communication networks
+Normal rank
@@ Property / cites work @@
+Computational comparison of value iteration algorithms for discounted Markov decision processes
+Normal rank