Adaptive policies for time-varying stochastic systems under discounted criterion (Q1397033): Difference between revisions

From MaRDI portal
RedirectionBot (talk | contribs)
Removed claim: reviewed by (P1447): Item:Q174462
RedirectionBot (talk | contribs)
Changed an Item
Property / reviewed by
 
Property / reviewed by: H. Pragarauskas / rank
 
Normal rank

Revision as of 07:32, 10 February 2024

scientific article
Language Label Description Also known as
English
Adaptive policies for time-varying stochastic systems under discounted criterion
scientific article

    Statements

    Adaptive policies for time-varying stochastic systems under discounted criterion (English)
    0 references
    0 references
    0 references
    16 July 2003
    0 references
    The authors consider a discrete-time controlled Markov system whose evolution is described by the equation \(x_{n+1}= G_n(x_n, a_n,\xi_n)\), \(n= 0,1,\dots\), where the system states \(x_n\) and controls \(a_n\) are elements of Borel spaces and \(\{\xi_n\}\) is a sequence of observable i.i.d. random vectors with unknown distribution. Assuming the convergence of \(G_n\) and estimating the unknown distribution density of \(\xi_n\), an asymptotically optimal control policy for the limit control system is constructed.
    0 references
    0 references
    0 references
    0 references
    0 references
    non-homogeneous Markov control processes
    0 references
    discrete-time stochastic systems
    0 references
    discounted cost criterion
    0 references
    optimal adaptive policy
    0 references