Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds (Q799497): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
 
Property / cites work
 
Property / cites work: A survey of maintenance models: The control and surveillance of deteriorating systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Quality Control under Markovian Deterioration / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5561586 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3910270 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convex composite multi-objective nonsmooth programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Applications of dynamic programming and other optimization methods in pest management / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal Integrated Control of Univoltine Pest Populations with Age Structure / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, I / rank
 
Normal rank
Property / cites work
 
Property / cites work: Approximations of Dynamic Programs, II / rank
 
Normal rank
Property / cites work
 
Property / cites work: An Iterative Aggregation Procedure for Markov Decision Processes / rank
 
Normal rank
Property / cites work
 
Property / cites work: Multilayer control of large Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Suboptimal Design for Large Scale, Multimodule Systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming and stochastic control / rank
 
Normal rank

Latest revision as of 15:48, 14 June 2024

scientific article
Language Label Description Also known as
English
Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds
scientific article

    Statements

    Suboptimal policy determination for large-scale Markov decision processes. I: Description and bounds (English)
    0 references
    0 references
    0 references
    0 references
    1985
    0 references
    This paper is the first of two papers that present and evaluate an approach for determining suboptimal policies for large-scale Markov decision processes (MDP). Part 1 is devoted to the determination of bounds that motivate the development and indicate the quality of the suboptimal design approach; Part 2 [see the following review] is concerned with the implementation and evaluation of the suboptimal design approach. The specific MDP considered is the infinite-horizon, expected total discounted cost MDP with finite state and action spaces. The approach can be described as follows. First, the original MDP is approximated by a specially structured MDP. The special structure suggests how to construct associated smaller, more computationally tractable MDP's. The suboptimal policy for the original MDP is then constructed from the solutions of these smaller MDP's. The key feature of this approach is that the state and action space cardinalities of the smaller MDP's are exponential reductions of the state and action space cardinalities of the original MDP.
    0 references
    0 references
    infinite-horizon expected total discounted cost
    0 references
    suboptimal policies
    0 references
    large-scale Markov decision processes
    0 references
    finite state and action spaces
    0 references