The method of value oriented successive approximations for the average reward Markov decision process (Q1144501): Difference between revisions

@@ Property / cites work @@
+Optimal decision procedures for finite Markov chains. Part II: Communicating systems
+Normal rank
@@ Property / cites work @@
+Q3245701
@@ Property / cites work: Q3245701 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3251743
@@ Property / cites work: Q3251743 / rank @@
+Normal rank
@@ Property / cites work @@
+Technical Note—Bounds on the Gain of a Markov Decision Process
+Normal rank
@@ Property / cites work @@
+Technical Note—The Method of Successive Approximations and Markovian Decision Problems
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Technical Note—Undiscounted Markov Renewal Programming Via Modified Successive Approximations
+Normal rank
@@ Property / cites work @@
+Discounting, Ergodicity and Convergence for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+A set of successive approximation methods for discounted Markovian decision problems
+Normal rank
@@ Property / cites work @@
+Q4190426
@@ Property / cites work: Q4190426 / rank @@
+Normal rank
@@ Property / cites work @@
+On Finding the Maximal Gain for Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Technical Note—Improved Conditions for Convergence in Undiscounted Markov Renewal Programming
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Iterative solution of the functional equations of undiscounted Markov renewal programming
+Normal rank
@@ Property / cites work @@
+The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems
+Normal rank
@@ Property / cites work @@
+Geometric convergence of value-iteration in multichain Markov decision problems
+Normal rank
@@ Property / cites work @@
+A successive approximation algorithm for an undiscounted Markov decision process
+Normal rank
@@ Property / cites work @@
+Dynamic programming, Markov chains, and the method of successive approximations
+Normal rank