Adaptive control of discounted Markov decision chains (Q796461): Difference between revisions

From MaRDI portal
Import240304020342 (talk | contribs)
Set profile property.
ReferenceBot (talk | contribs)
Changed an Item
Property / cites work
 
Property / cites work: Nonstationary Markov decision problems with converging parameters / rank
 
Normal rank
Property / cites work
 
Property / cites work: Dynamic programming and stochastic control / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5615108 / rank
 
Normal rank
Property / cites work
 
Property / cites work: The average-optimal adaptive control of a Markov renewal model in presence of an unknown parameter / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5599448 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q4150452 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q5649557 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Estimation and control in Markov chains / rank
 
Normal rank
Property / cites work
 
Property / cites work: Strongly consistent estimation in a controlled Markov renewal model / rank
 
Normal rank
Property / cites work
 
Property / cites work: Adaptive control of service in queueing systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Optimal adaptive control of priority assignment in queueing systems / rank
 
Normal rank
Property / cites work
 
Property / cites work: Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3881672 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Q3313754 / rank
 
Normal rank
Property / cites work
 
Property / cites work: Convergence analysis of parametric identification methods / rank
 
Normal rank

Revision as of 12:28, 14 June 2024

scientific article
Language Label Description Also known as
English
Adaptive control of discounted Markov decision chains
scientific article

    Statements

    Adaptive control of discounted Markov decision chains (English)
    0 references
    1985
    0 references
    We consider discounted-reward finite-state Markov decision processes which depend on unknown parameters. An adaptive policy inspired by the nonstationary value iteration scheme of \textit{A. Federgruen} and \textit{P. J. Schweitzer} [ibid. 34, 207-241 (1981; Zbl 0426.90091)] is proposed. This policy is briefly compared with the principle of estimation and control recently obtained by \textit{M. Schäl} [Lect. Notes Pure Appl. Math. 86, 239-253 (1983; Zbl 0525.93071)].
    0 references
    discounted-reward finite-state Markov decision processes
    0 references
    adaptive policy
    0 references
    nonstationary value iteration
    0 references

    Identifiers