Optimal, recursive procedures of identification (Q1105556)

From MaRDI portal
Revision as of 18:03, 18 June 2024 by ReferenceBot (talk | contribs) (‎Changed an Item)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
Optimal, recursive procedures of identification
scientific article

    Statements

    Optimal, recursive procedures of identification (English)
    0 references
    1988
    0 references
    The paper concerns the parameter estimation of a stochastic process. We consider recursive estimating procedures based on the idea of stochastic approximation: (1) \(p_{s+1}=p_ s-\mu_ sF_{s+1}(Y_{s+1},p_ s)\), where \(\{\mu_ s\}\) is a sequence of positivd to the other ones when \(\beta\uparrow 1.\) Recently a variant of the discounted multi-armed bandit problem has been described by \textit{P. P. Varaiya, J. C. Walrand} and \textit{C. Buyukkoc} [IEEE Trans. Autom. Control 30, 426-439 (1985; Zbl 0566.90096)]. In this variant the k machines are represented by independent Markov chains. It is motivated by priority scheduling in networks. The relations of this variant with the third formulation given above is described.
    0 references
    0 references
    0 references
    0 references
    0 references
    0 references
    parameter estimation
    0 references
    stochastic approximation
    0 references
    0 references