Adaptive control of a partially observed discrete time Markov process (Q1384646): Difference between revisions

From MaRDI portal
Importer (talk | contribs)
Created a new Item
 
Set OpenAlex properties.
 
(4 intermediate revisions by 3 users not shown)
Property / author
 
Property / author: Tyrone E. Duncan / rank
Normal rank
 
Property / author
 
Property / author: Bozenna Pasik-Duncan / rank
Normal rank
 
Property / author
 
Property / author: Łukasz Stettner / rank
Normal rank
 
Property / reviewed by
 
Property / reviewed by: Yuliya S. Mishura / rank
Normal rank
 
Property / author
 
Property / author: Tyrone E. Duncan / rank
 
Normal rank
Property / author
 
Property / author: Bozenna Pasik-Duncan / rank
 
Normal rank
Property / author
 
Property / author: Łukasz Stettner / rank
 
Normal rank
Property / reviewed by
 
Property / reviewed by: Yuliya S. Mishura / rank
 
Normal rank
Property / MaRDI profile type
 
Property / MaRDI profile type: MaRDI publication profile / rank
 
Normal rank
Property / full work available at URL
 
Property / full work available at URL: https://doi.org/10.1007/s002459900077 / rank
 
Normal rank
Property / OpenAlex ID
 
Property / OpenAlex ID: W1991950454 / rank
 
Normal rank
links / mardi / namelinks / mardi / name
 

Latest revision as of 01:30, 20 March 2024

scientific article
Language Label Description Also known as
English
Adaptive control of a partially observed discrete time Markov process
scientific article

    Statements

    Adaptive control of a partially observed discrete time Markov process (English)
    0 references
    0 references
    20 April 1998
    0 references
    The authors consider the Markov process \((X_n,n\in N)\) on the measurable space \((E,{\mathcal E})\), where \(E\) is either a closed subset of \(R^d\) or \(E\) is a countable set. The transition operator depends on an unknown, fixed parameter \(a^0\in {\mathcal A} \subset R^{K_0}\). The process \((X_n,\;n\in N)\) is completely observed in a fixed recurrent domain \(\Gamma\), and is partially observed in \(\Gamma^C\). An admissible control is adapted to the observations, and the cost is desired to minimize the average cost per unit time. The solution of an adaptive control problem is given by constructing an approximately self-optimal strategy. A different form of partial observations is also considered. Namely, in this case there is no subset of complete observations but there is a sequence of stopping times \((\tau_n,\;n\in N)\) such that the random variables \((X_{\tau_n},\;n\in N)\) are independent and identically distributed.
    0 references
    0 references
    0 references
    0 references
    0 references
    stochastic adaptive control
    0 references
    discrete time Markov processes
    0 references
    approximately self-optimal strategy
    0 references
    partial observations
    0 references
    0 references
    0 references
    0 references
    0 references