Simulation-based optimization of Markov decision processes: an empirical process theory approach (Q608432): Difference between revisions

@@ Property / full work available at URL @@
+https://doi.org/10.1016/j.automatica.2010.05.021
+Normal rank
@@ Property / OpenAlex ID @@
+W2071767680
@@ Property / OpenAlex ID: W2071767680 / rank @@
+Normal rank
@@ Property / cites work @@
+Scale-sensitive dimensions, uniform convergence, and learnability
+Normal rank
@@ Property / cites work @@
+Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality
+Normal rank
@@ Property / cites work @@
+Neural Network Learning
@@ Property / cites work: Neural Network Learning / rank @@
+Normal rank
@@ Property / cites work @@
+Q4533362
@@ Property / cites work: Q4533362 / rank @@
+Normal rank
@@ Property / cites work @@
+Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
+Normal rank
@@ Property / cites work @@
+Q5425954
@@ Property / cites work: Q5425954 / rank @@
+Normal rank
@@ Property / cites work @@
+Simulation-based algorithms for Markov decision processes.
+Normal rank
@@ Property / cites work @@
+Dynamic Programming Conditions for Partially Observable Stochastic Systems
+Normal rank
@@ Property / cites work @@
+Uniform Central Limit Theorems
@@ Property / cites work: Uniform Central Limit Theorems / rank @@
+Normal rank
@@ Property / cites work @@
+Q4057976
@@ Property / cites work: Q4057976 / rank @@
+Normal rank
@@ Property / cites work @@
+Decision theoretic generalizations of the PAC model for neural net and other learning applications
+Normal rank
@@ Property / cites work @@
+Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
+Normal rank
@@ Property / cites work @@
+Q2756809
@@ Property / cites work: Q2756809 / rank @@
+Normal rank
@@ Property / cites work @@
+On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
+Normal rank
@@ Property / cites work @@
+Approximate gradient methods in policy-space optimization of Markov reward processes
+Normal rank
@@ Property / cites work @@
+Q3148833
@@ Property / cites work: Q3148833 / rank @@
+Normal rank
@@ Property / cites work @@
+Q4001821
@@ Property / cites work: Q4001821 / rank @@
+Normal rank
@@ Property / cites work @@
+Concentration of measure and isoperimetric inequalities in product spaces
+Normal rank
@@ Property / cites work @@
+Necessary and Sufficient Conditions for the Uniform Convergence of Means to their Expectations
+Normal rank
@@ Property / cites work @@
+Learning and generalisation. With applications to neural networks.
+Normal rank