A unified approach to time-aggregated Markov decision processes (Q259403): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Time aggregated Markov decision processes via standard dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Recent advances in hierarchical reinforcement learning / rank
 
Normal rank
Property / cites work
 
Property / cites work: A New Value Iteration method for the Average Cost Dynamic Programming Problem / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4821526 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Semi-markov decision problems and performance sensitivity analysis / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5425954 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Perturbation realization, potentials, and sensitivity analysis of Markov processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A time aggregation approach to Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Continuous-time Markov decision processes. Theory and applications / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5635253 / rank
 
Normal rank
Property / cites work
 
Property / cites work: A basic formula for performance gradient estimation of semi-Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4315289 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov decision Processes with fractional costs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4367948 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Incremental Value Iteration for Time-Aggregated Markov-Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance gradient estimation for the very large finite Markov chains / rank
 
Normal rank

Revision as of 14:27, 11 July 2024

scientific article
Language Label Description Also known as
English
A unified approach to time-aggregated Markov decision processes
scientific article

    Statements

    A unified approach to time-aggregated Markov decision processes (English)
    0 references
    0 references
    0 references
    0 references
    11 March 2016
    0 references
    time aggregation
    0 references
    performance sensitivity
    0 references
    Markov decision process
    0 references
    semi-Markov decision process
    0 references

    Identifiers