Simulation-based optimization of Markov decision processes: an empirical process theory approach
From MaRDI portal
Recommendations
- Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
- Computing optimal policies for Markovian decision processes using simulation
- Simulation-based optimization of Markov reward processes
- A survey of some simulation-based algorithms for Markov decision processes
- Simulation-based algorithms for Markov decision processes.
Cites work
- scientific article; zbMATH DE number 1804129 (Why is no real title available?)
- scientific article; zbMATH DE number 51427 (Why is no real title available?)
- scientific article; zbMATH DE number 3474804 (Why is no real title available?)
- scientific article; zbMATH DE number 1753152 (Why is no real title available?)
- Approximate gradient methods in policy-space optimization of Markov reward processes
- Concentration of measure and isoperimetric inequalities in product spaces
- Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems
- Decision theoretic generalizations of the PAC model for neural net and other learning applications
- Dynamic Programming Conditions for Partially Observable Stochastic Systems
- Learning and generalisation. With applications to neural networks.
- Necessary and Sufficient Conditions for the Uniform Convergence of Means to their Expectations
- Neural Network Learning
- On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
- Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality
- Scale-sensitive dimensions, uniform convergence, and learnability
- Simulation-based algorithms for Markov decision processes.
- Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
- Stochastic learning and optimization. A sensitivity-based approach.
- The concentration of measure phenomenon
- Uniform Central Limit Theorems
Cited in
(10)- Simulation‐based Uniform Value Function Estimates of Markov Decision Processes
- Some limit properties of Markov chains induced by recursive stochastic algorithms
- Empirical dynamic programming
- Efficient PAC learning for episodic tasks with acyclic state spaces
- Simulation-based optimization of Markov reward processes
- Primal-Dual Regression Approach for Markov Decision Processes with General State and Action Spaces
- Relevant states and memory in Markov chain bootstrapping and simulation
- On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies
- Computing optimal policies for Markovian decision processes using simulation
- scientific article; zbMATH DE number 1804129 (Why is no real title available?)
This page was built for publication: Simulation-based optimization of Markov decision processes: an empirical process theory approach
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q608432)