Empirical Dynamic Programming (Q2806811): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Changed an Item
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Learning Algorithms for Markov Decision Processes with Average Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural Network Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Associative search network: A reinforcement learning associative memory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Functional Approximations and Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate policy iteration: a survey and some new methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning for Risk-Sensitive Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of some simulation-based algorithms for Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Guarantees for Empirical Markov Decision Processes with Applications to Multiperiod Inventory Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: CONVERGENCE OF SIMULATION-BASED POLICY ITERATION / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093180 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5635252 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation‐based Uniform Value Function Estimates of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization of Markov decision processes: an empirical process theory approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Estimation of the Maximum of a Regression Function / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence rate of linear two-time-scale stochastic approximation. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis of recursive stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2778807 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3096132 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Complexity of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Using Randomization to Break the Curse of Dimensionality / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, II / rank
 
Normal rank

Latest revision as of 01:01, 12 July 2024

scientific article
Language Label Description Also known as
English
Empirical Dynamic Programming
scientific article

    Statements

    Empirical Dynamic Programming (English)
    0 references
    0 references
    0 references
    0 references
    19 May 2016
    0 references
    dynamic programming
    0 references
    empirical methods
    0 references
    Markov decision processes
    0 references
    random operators
    0 references
    probabilistic fixed points
    0 references
    simulation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references