A unified approach to time-aggregated Markov decision processes (Q259403): Difference between revisions

@@ Property / cites work @@
+Time aggregated Markov decision processes via standard dynamic programming
+Normal rank
@@ Property / cites work @@
+Recent advances in hierarchical reinforcement learning
+Normal rank
@@ Property / cites work @@
+A New Value Iteration method for the Average Cost Dynamic Programming Problem
+Normal rank
@@ Property / cites work @@
+Q4821526
@@ Property / cites work: Q4821526 / rank @@
+Normal rank
@@ Property / cites work @@
+Semi-markov decision problems and performance sensitivity analysis
+Normal rank
@@ Property / cites work @@
+Q5425954
@@ Property / cites work: Q5425954 / rank @@
+Normal rank
@@ Property / cites work @@
+Perturbation realization, potentials, and sensitivity analysis of Markov processes
+Normal rank
@@ Property / cites work @@
+A time aggregation approach to Markov decision processes
+Normal rank
@@ Property / cites work @@
+Continuous-time Markov decision processes. Theory and applications
+Normal rank
@@ Property / cites work @@
+Q5635253
@@ Property / cites work: Q5635253 / rank @@
+Normal rank
@@ Property / cites work @@
+A basic formula for performance gradient estimation of semi-Markov decision processes
+Normal rank
@@ Property / cites work @@
+Q4315289
@@ Property / cites work: Q4315289 / rank @@
+Normal rank
@@ Property / cites work @@
+Markov decision Processes with fractional costs
@@ Property / cites work: Markov decision Processes with fractional costs / rank @@
+Normal rank
@@ Property / cites work @@
+Q4367948
@@ Property / cites work: Q4367948 / rank @@
+Normal rank
@@ Property / cites work @@
+Incremental Value Iteration for Time-Aggregated Markov-Decision Processes
+Normal rank
@@ Property / cites work @@
+Performance gradient estimation for the very large finite Markov chains
+Normal rank