(Approximate) iterated successive approximations algorithm for sequential decision processes (Q378751): Difference between revisions

@@ Property / Mathematics Subject Classification ID @@
+C40
@@ Property / Mathematics Subject Classification ID: 90C40 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C59
@@ Property / Mathematics Subject Classification ID: 90C59 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6225980
@@ Property / zbMATH DE Number: 6225980 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+sequential decision processes
@@ Property / zbMATH Keywords: sequential decision processes / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+Markov decision chains
@@ Property / zbMATH Keywords: Markov decision chains / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+successive approximations
@@ Property / zbMATH Keywords: successive approximations / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+modified policy iteration
@@ Property / zbMATH Keywords: modified policy iteration / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.1007/s10479-012-1073-x
+Normal rank
@@ Property / OpenAlex ID @@
+W2014981566
@@ Property / OpenAlex ID: W2014981566 / rank @@
+Normal rank
@@ Property / cites work @@
+Q-Learning and Enhanced Policy Iteration in Discounted Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Truncated policy iteration methods
@@ Property / cites work: Truncated policy iteration methods / rank @@
+Normal rank
@@ Property / cites work @@
+Contraction Mappings in the Theory Underlying Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Affine Structure and Invariant Policies for Dynamic Programs
+Normal rank
@@ Property / cites work @@
+Block-successive approximation for a discounted Markov decision model
+Normal rank
@@ Property / cites work @@
+Q3313617
@@ Property / cites work: Q3313617 / rank @@
+Normal rank
@@ Property / cites work @@
+Q3266141
@@ Property / cites work: Q3266141 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4547434
@@ Property / cites work: Q4547434 / rank @@
+Normal rank
@@ Property / cites work @@
+Some Bounds for Discounted Sequential Decision Processes
+Normal rank
@@ Property / cites work @@
+Improved iterative computation of the expected discounted return in Markov and semi-Markov chains
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Convergence of Policy Iteration in Stationary Dynamic Programming
+Normal rank
@@ Property / cites work @@
+Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
+Normal rank
@@ Property / cites work @@
+Action Elimination Procedures for Modified Policy Iteration Algorithms
+Normal rank
@@ Property / cites work @@
+Discounted Markov games: Generalized policy iteration method
+Normal rank
@@ Property / cites work @@
+A set of successive approximation methods for discounted Markovian decision problems
+Normal rank
@@ Property / cites work @@
+Q4190426
@@ Property / cites work: Q4190426 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximations of Dynamic Programs, I
@@ Property / cites work: Approximations of Dynamic Programs, I / rank @@
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:378751