Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (Q300040): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Time aggregated Markov decision processes via standard dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate dynamic programming via direct search in the space of value function approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stability and optimality of a multi-product production and storage system under demand uncertainty / rank
 
Normal rank
Property / cites work
 
Property / cites work: Accelerating the convergence of value iteration by using partial transition functions / rank
 
Normal rank
Property / cites work
 
Property / cites work: A New Value Iteration method for the Average Cost Dynamic Programming Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q2925454 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4257216 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4256521 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate dynamic programming with a fuzzy parameterization / rank
 
Normal rank
Property / cites work
 
Property / cites work: A time aggregation approach to Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Simulation-based algorithms for Markov decision processes. / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sufficient Classes of Strategies in Discrete Dynamic Programming I: Decomposition of Randomized Strategies and Embedded Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: LAO*: A heuristic search algorithm that finds solutions with loops / rank
 
Normal rank
Property / cites work
 
Property / cites work: Probabilistic Relational Planning with First Order Decision Diagrams / rank
 
Normal rank
Property / cites work
 
Property / cites work: Exact finite approximations of average-cost countable Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reducing reinforcement learning to KWIK online regression / rank
 
Normal rank
Property / cites work
 
Property / cites work: Kernel-based reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Distributed Actor-Critic Algorithm and Applications to Mobile Sensor Network Coordination Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximate Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision Processes with fractional costs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Incremental Value Iteration for Time-Aggregated Markov-Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: An analysis of temporal-difference learning with function approximation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Lebesgue-Sampling-Based Optimal Control Problems With Time Aggregation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance gradient estimation for the very large finite Markov chains / rank
 
Normal rank

Revision as of 04:58, 12 July 2024

scientific article
Language Label Description Also known as
English
Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm
scientific article

    Statements

    Solving average cost Markov decision processes by means of a two-phase time aggregation algorithm (English)
    0 references
    23 June 2016
    0 references
    dynamic programming
    0 references
    Markov decision processes
    0 references
    embedding
    0 references
    time aggregation
    0 references
    stochastic optimal control
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references

    Identifiers