Bounds for the quality and the number of steps in Bellman's value iteration algorithm (Q1317533): Difference between revisions

@@ Property / cites work @@
+Q5599448
@@ Property / cites work: Q5599448 / rank @@
+Normal rank
@@ Property / cites work @@
+A polynomial time bound for Howard's policy improvement algorithm
+Normal rank
@@ Property / cites work @@
+Q3730373
@@ Property / cites work: Q3730373 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Fixed Points of the Optimal Reward Operator in Stochastic Dynamic Programming with Discount Factor Greater than One
+Normal rank
@@ Property / cites work @@
+Bounds and good policies in stationary finite–stage Markovian decision problems
+Normal rank
@@ Property / cites work @@
+Abschätzungen für Spektralwerte
@@ Property / cites work: Abschätzungen für Spektralwerte / rank @@
+Normal rank