Empirical Dynamic Programming (Q2806811): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Import241208061232 (talk | contribs)
Normalize DOI.
 
(5 intermediate revisions by 5 users not shown)
Property / DOI
 
Property / DOI: 10.1287/moor.2015.0733 / rank
Normal rank
 
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W2593952959 / rank
 
Normal rank
Property / arXiv ID
 
Property / arXiv ID: 1311.5918 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Learning Algorithms for Markov Decision Processes with Average Cost / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Fixed Point Iteration with an Application to Infinite Horizon Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Neural Network Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Associative search network: A reinforcement learning associative memory / rank
 
Normal rank
Property / cites work
 
Property / cites work: Functional Approximations and Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate policy iteration: a survey and some new methods / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q-Learning for Risk-Sensitive Control / rank
 
Normal rank
Property / cites work
 
Property / cites work: The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A survey of some simulation-based algorithms for Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance Guarantees for Empirical Markov Decision Processes with Applications to Multiperiod Inventory Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: CONVERGENCE OF SIMULATION-BASED POLICY ITERATION / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3093180 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5635252 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation‐based Uniform Value Function Estimates of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based optimization of Markov decision processes: an empirical process theory approach / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Estimation of the Maximum of a Regression Function / rank
 
Normal rank
Property / cites work
 
Property / cites work: Actor-Critic--Type Learning Algorithms for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence rate of linear two-time-scale stochastic approximation. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Analysis of recursive stochastic algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2778807 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3096132 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Complexity of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Stochastic Approximation Method / rank
 
Normal rank
Property / cites work
 
Property / cites work: Using Randomization to Break the Curse of Dimensionality / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic Games / rank
 
Normal rank
Property / cites work
 
Property / cites work: 10.1162/153244303768966102 / rank
 
Normal rank
Property / cites work
 
Property / cites work: \({\mathcal Q}\)-learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, II / rank
 
Normal rank
Property / DOI
 
Property / DOI: 10.1287/MOOR.2015.0733 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 23:17, 19 December 2024

scientific article
Language Label Description Also known as
English
Empirical Dynamic Programming
scientific article

    Statements

    Empirical Dynamic Programming (English)
    0 references
    0 references
    0 references
    0 references
    19 May 2016
    0 references
    dynamic programming
    0 references
    empirical methods
    0 references
    Markov decision processes
    0 references
    random operators
    0 references
    probabilistic fixed points
    0 references
    simulation
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers

    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references