On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes (Q5502179): Difference between revisions

@@ Property / arXiv ID @@
+.1459
@@ Property / arXiv ID: 1411.1459 / rank @@
+Normal rank
@@ Property / cites work @@
+Monotone Mappings with Application in Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Stochastic optimal control. The discrete time case
+Normal rank
@@ Property / cites work @@
+An Analysis of Stochastic Shortest Path Problems
@@ Property / cites work: An Analysis of Stochastic Shortest Path Problems / rank @@
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q5583572
@@ Property / cites work: Q5583572 / rank @@
+Normal rank
@@ Property / cites work @@
+A Borel Set Not Containing a Graph
@@ Property / cites work: A Borel Set Not Containing a Graph / rank @@
+Normal rank
@@ Property / cites work @@
+The optimal reward operator in dynamic programming
+Normal rank
@@ Property / cites work @@
+Q3527701
@@ Property / cites work: Q3527701 / rank @@
+Normal rank
@@ Property / cites work @@
+Value iteration and optimization of multiclass queueing networks
+Normal rank
@@ Property / cites work @@
+Real Analysis and Probability
@@ Property / cites work: Real Analysis and Probability / rank @@
+Normal rank
@@ Property / cites work @@
+The Expected Total Cost Criterion for Markov Decision Processes under Constraints: A Convex Analytic Approach
+Normal rank
@@ Property / cites work @@
+The Expected Total Cost Criterion for Markov Decision Processes under Constraints
+Normal rank
@@ Property / cites work @@
+Q3237805
@@ Property / cites work: Q3237805 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3807013
@@ Property / cites work: Q3807013 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4547438
@@ Property / cites work: Q4547438 / rank @@
+Normal rank
@@ Property / cites work @@
+Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities
+Normal rank
@@ Property / cites work @@
+Q3329244
@@ Property / cites work: Q3329244 / rank @@
+Normal rank
@@ Property / cites work @@
+A simple proof of Whittle's bridging condition in dynamic programming
+Normal rank
@@ Property / cites work @@
+Q4255598
@@ Property / cites work: Q4255598 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Optimality of Structured Policies in Countable Stage Decision Processes. II: Positive and Negative Problems
+Normal rank
@@ Property / cites work @@
+Q5541832
@@ Property / cites work: Q5541832 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4421713
@@ Property / cites work: Q4421713 / rank @@
+Normal rank
@@ Property / cites work @@
+The Optimal Reward Operator in Negative Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Q4881151
@@ Property / cites work: Q4881151 / rank @@
+Normal rank
@@ Property / cites work @@
+Control Techniques for Complex Networks
@@ Property / cites work: Control Techniques for Complex Networks / rank @@
+Normal rank
@@ Property / cites work @@
+On the Existence of Stationary Optimal Strategies
@@ Property / cites work: On the Existence of Stationary Optimal Strategies / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal
+Normal rank
@@ Property / cites work @@
+Stationary Policies in Dynamic Programming Models Under Compactness Assumptions
+Normal rank
@@ Property / cites work @@
+Stationary policies and Markov policies in Borel dynamic programming
+Normal rank
@@ Property / cites work @@
+Q4194027
@@ Property / cites work: Q4194027 / rank @@
+Normal rank
@@ Property / cites work @@
+Universally Measurable Policies in Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Q4626283
@@ Property / cites work: Q4626283 / rank @@
+Normal rank
@@ Property / cites work @@
+Algorithms for Reinforcement Learning
@@ Property / cites work: Algorithms for Reinforcement Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Asynchronous stochastic approximation and Q-learning
+Normal rank
@@ Property / cites work @@
+Q3912356
@@ Property / cites work: Q3912356 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4192588
@@ Property / cites work: Q4192588 / rank @@
+Normal rank
@@ Property / cites work @@
+A simple condition for regularity in negative programming
+Normal rank
@@ Property / cites work @@
+A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies
+Normal rank
@@ Property / cites work @@
+Q3975565
@@ Property / cites work: Q3975565 / rank @@
+Normal rank