A perturbation approach to a class of discounted approximate value iteration algorithms with Borel spaces (Q330284): Difference between revisions

@@ Property / Mathematics Subject Classification ID @@
+E20
@@ Property / Mathematics Subject Classification ID: 93E20 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C59
@@ Property / Mathematics Subject Classification ID: 90C59 / rank @@
+Normal rank
@@ Property / Mathematics Subject Classification ID @@
+C40
@@ Property / Mathematics Subject Classification ID: 90C40 / rank @@
+Normal rank
@@ Property / zbMATH DE Number @@
+6643067
@@ Property / zbMATH DE Number: 6643067 / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+Markov decision processes
@@ Property / zbMATH Keywords: Markov decision processes / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+discounted criterion
@@ Property / zbMATH Keywords: discounted criterion / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+approximate value iteration algorithm
@@ Property / zbMATH Keywords: approximate value iteration algorithm / rank @@
+Normal rank
@@ Property / zbMATH Keywords @@
+perturbed models
@@ Property / zbMATH Keywords: perturbed models / rank @@
+Normal rank
@@ Property / MaRDI profile type @@
+MaRDI publication profile
@@ Property / MaRDI profile type: MaRDI publication profile / rank @@
+Normal rank
@@ Property / full work available at URL @@
+https://doi.org/10.3934/jdg.2016014
+Normal rank
@@ Property / OpenAlex ID @@
+W2507723477
@@ Property / OpenAlex ID: W2507723477 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Approximate dynamic programming via direct search in the space of value function approximations
+Normal rank
@@ Property / cites work @@
+Q3795523
@@ Property / cites work: Q3795523 / rank @@
+Normal rank
@@ Property / cites work @@
+Approximate policy iteration: a survey and some new methods
+Normal rank
@@ Property / cites work @@
+Q4257216
@@ Property / cites work: Q4257216 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4433637
@@ Property / cites work: Q4433637 / rank @@
+Normal rank
@@ Property / cites work @@
+The approximation of continuous functions by positive linear operators
+Normal rank
@@ Property / cites work @@
+Approximation of Infinite Horizon Discounted Cost Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+On the existence of fixed points for approximate value iteration and temporal-difference learning
+Normal rank
@@ Property / cites work @@
+Adaptive Markov control processes
@@ Property / cites work: Adaptive Markov control processes / rank @@
+Normal rank
@@ Property / cites work @@
+Q4255598
@@ Property / cites work: Q4255598 / rank @@
+Normal rank
@@ Property / cites work @@
+An Approximate Dynamic Programming Algorithm for Monotone Value Functions
+Normal rank
@@ Property / cites work @@
+Performance Bounds in $L_p$‐norm for Approximate Value Iteration
+Normal rank
@@ Property / cites work @@
+Approximate Dynamic Programming
@@ Property / cites work: Approximate Dynamic Programming / rank @@
+Normal rank
@@ Property / cites work @@
+A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications
+Normal rank
@@ Property / cites work @@
+Perspectives of approximate dynamic programming
@@ Property / cites work: Perspectives of approximate dynamic programming / rank @@
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4369442
@@ Property / cites work: Q4369442 / rank @@
+Normal rank
@@ Property / cites work @@
+Continuous state dynamic programming via nonexpansive approximation
+Normal rank
@@ Property / cites work @@
+Q3630866
@@ Property / cites work: Q3630866 / rank @@
+Normal rank
@@ Property / cites work @@
+A survey of Markov decision models for control of networks of queues
+Normal rank
@@ Property / cites work @@
+Performance Loss Bounds for Approximate Value Iteration with State Aggregation
+Normal rank
@@ Property / cites work @@
+Application of average dynamic programming to inventory systems
+Normal rank
@@ Property / cites work @@
+A Survey of Applications of Markov Decision Processes
+Normal rank
@@ links / mardi / name / links / mardi / name @@
+Publication:330284