Simulation-based optimization of Markov decision processes: an empirical process theory approach (Q608432): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
(One intermediate revision by one other user not shown)
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1016/j.automatica.2010.05.021 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2071767680 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Scale-sensitive dimensions, uniform convergence, and learnability / rank
 
Normal rank
Property / cites work
 
Property / cites work: Rate of Convergence of Empirical Measures and Costs in Controlled Markov Chains and Transient Optimality / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural Network Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4533362 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conservation Laws, Extended Polymatroids and Multiarmed Bandit Problems; A Polyhedral Approach to Indexable Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5425954 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based algorithms for Markov decision processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic Programming Conditions for Partially Observable Stochastic Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Uniform Central Limit Theorems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4057976 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Decision theoretic generalizations of the PAC model for neural net and other learning applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation‐based Uniform Value Function Estimates of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2756809 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Empirical State-Action Frequencies in Markov Decision Processes Under General Policies / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate gradient methods in policy-space optimization of Markov reward processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3148833 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4001821 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Concentration of measure and isoperimetric inequalities in product spaces / rank
 
Normal rank
Property / cites work
 
Property / cites work: Necessary and Sufficient Conditions for the Uniform Convergence of Means to their Expectations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning and generalisation. With applications to neural networks. / rank
 
Normal rank

Latest revision as of 12:10, 3 July 2024

scientific article
Language Label Description Also known as
English
Simulation-based optimization of Markov decision processes: an empirical process theory approach
scientific article

    Statements

    Simulation-based optimization of Markov decision processes: an empirical process theory approach (English)
    0 references
    0 references
    0 references
    0 references
    25 November 2010
    0 references
    Markov decision processes
    0 references
    learning algorithms
    0 references
    Monte Carlo simulation
    0 references
    stochastic control
    0 references
    optimization
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references