Markov decision processes (Q5904001): Difference between revisions

From MaRDI portal
Set OpenAlex properties.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: Q3241581 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3286740 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Stochastic optimal control. The discrete time case / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4209222 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Discounted Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Contraction Mappings in the Theory Underlying Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5561586 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Finite state Markovian decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: On the Optimality of Myopic Policies in Sequential Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Vector-Valued Dynamic Programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313617 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Performance evaluation and perturbation analysis of discrete event dynamic systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3266141 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5635252 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4739658 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sequential Decision Problems with Expected Utility Criteria. III: Upper and Lower Transience / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence of Dynamic Programming Models / rank
 
Normal rank
Property / cites work
 
Property / cites work: A modified dynamic programming method for Markovian decision problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Letter to the Editor—A Test for Suboptimal Actions in Markovian Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5549539 / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Iterative Aggregation Procedure for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Finding the Maximal Gain for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Suboptimal policy determination for large-scale Markov decision processes. II: Implementation and numerical evaluation / rank
 
Normal rank
Property / cites work
 
Property / cites work: Some Bounds for Discounted Sequential Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Bounds and Transformations for Discounted Finite Markov Decision Chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Modified Policy Iteration Algorithms for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Action Elimination Procedures for Modified Policy Iteration Algorithms / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615108 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3683893 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5602035 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The Asymptotic Behavior of Undiscounted Value Iteration in Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Iterative Aggregation-Disaggregation Procedures for Discounted Semi-Markov Reward Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Sufficient statistics in the optimum control of stochastic systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Minimizing a Submodular Function on a Lattice / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3912356 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4170121 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3890445 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Suboptimal Design for Large Scale, Multimodule Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reward Revision for Discounted Markov Decision Problems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Parameter Imprecision in Finite State, Finite Action Dynamic Programs / rank
 
Normal rank
Property / cites work
 
Property / cites work: Reward revision and the average reward Markov decision process / rank
 
Normal rank
Property / cites work
 
Property / cites work: Markov Decision Processes with Imprecise Transition Probabilities / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming, Markov chains, and the method of successive approximations / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3034625 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3867541 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3856450 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimality and efficiency. I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multi-objective infinite-horizon discounted Markov decision processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: A Survey of Applications of Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Infinite horizon Markov decision processes with unknown or variable discount factors / rank
 
Normal rank
Property / cites work
 
Property / cites work: Mean, variance and probabilistic criteria in finite Markov decision processes: A review / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, II / rank
 
Normal rank

Latest revision as of 10:12, 20 June 2024

scientific article; zbMATH DE number 4110496
Language Label Description Also known as
English
Markov decision processes
scientific article; zbMATH DE number 4110496

    Statements

    Markov decision processes (English)
    0 references
    0 references
    0 references
    1989
    0 references
    The paper is an introduction to Markov decision processes mainly addressed to possible applicants. Therefore it presents a finite model only, but a broad variety of objectives, algorithms (e.g. aggregation), and extensions (e.g. semi-Markov, partially observed, adaptive multiobjective, and constrained models). Some remarks on possible future research are added.
    0 references
    0 references
    discrete event dynamic systems
    0 references
    introduction
    0 references
    semi-Markov
    0 references
    partially observed
    0 references
    adaptive
    0 references
    multiobjective
    0 references
    constrained models
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references