Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (Q1068732): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Dynamic programming and stochastic control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Nonstationary Markov decision problems with converging parameters / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4150452 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive control of service in queueing systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive control of discounted Markov decision chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5599448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter / rank
 
Normal rank
Property / cites work
 
Property / cites work: Strongly consistent estimation in a controlled Markov renewal model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3875736 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5649557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: On Dynamic Programming with Unbounded Rewards / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimation and control in Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615108 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimation and control in discounted stochastic dynamic programming / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4772533 / rank
 
Normal rank

Revision as of 09:03, 17 June 2024

scientific article
Language Label Description Also known as
English
Nonstationary value-iteration and adaptive control of discounted semi- Markov processes
scientific article

    Statements

    Nonstationary value-iteration and adaptive control of discounted semi- Markov processes (English)
    0 references
    1985
    0 references
    We consider in this paper discounted-reward, denumerable state space, semi-Markov decision processes which depend on unknown parameters. The problems we are interested in are: Given that the true parameter value is unknown, (I) give an iterative scheme to determine the total maximal discounted reward, and (II) find an asymptotically discount optimal (adaptive) policy. Our solutions are inspired by the nonstationary value iteration (NVI) scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [J. Optimization Theory Appl. 34, 207-241 (1981; Zbl 0426.90091)] combined with the ideas of \textit{M. Schäl} [in: Optimization, theory and algorithms, Conf. Confolant/France 1981, Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)] concerning the ''principle of estimation and control'' for the adaptive control of semi-Markov processes.
    0 references
    discounted-reward
    0 references
    denumerable state space
    0 references
    semi-Markov decision processes
    0 references
    unknown parameters
    0 references
    nonstationary value iteration
    0 references
    principle of estimation and control
    0 references
    adaptive control of semi-Markov processes
    0 references

    Identifiers