The double kernel method in density estimation (Q914283)

From MaRDI portal
Revision as of 02:36, 5 March 2024 by Import240304020342 (talk | contribs) (Set profile property.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
scientific article
Language Label Description Also known as
English
The double kernel method in density estimation
scientific article

    Statements

    The double kernel method in density estimation (English)
    0 references
    1989
    0 references
    The author considers kernel estimators \[ \hat f_ n(x,h)=(nh)^{- 1}\sum^{n}_{j=1}K((x-X_ j)/h), \] based on i.i.d. samples \((X_ 1,...,X_ n)\) with density f on \({\mathbb{R}}\). The major problem in this context is to find a good smoothing parameter h. Various data-driven bandwidth choices have been suggested, mainly based on \(L_ 2\)-error criteria. This paper is concerned with \(L_ 1\)-convergence and suitable bandwidth choices H. The aim is to find asymptotically optimal procedures, which means \[ E(\int | \hat f_ n(x,H)-f(x)| dx)/\inf_{h>0}E(\int | \hat f_ n(x,h)-f(x)| dx)\to 1,\quad as\quad n\to \infty, \] in analogy to the result of \textit{C. J. Stone}, Ann. Stat. 12, 1285-1297 (1984; Zbl 0599.62052) for the \(L_ 2\)-error based on a CV-bandwidth. Cross validation is not useful in the \(L_ 1\)- case, so the author suggests the following method: Use a second kernel estimator \(\tilde f\) based on a different kernel L. The Fourier transforms of K and L should be different near 0 (e.g. K and L are kernels of different order). Then, choose H as the minimizer of \(\int | \hat f_ n(x,h)-\tilde f_ n(x,h)| dx.\) The following results give a flavour of the theorems obtained: 1.) E(\(\int | \hat f_ n(x,H)-f(x)| dx)\to 0\), \(n\to \infty\), for all densities f on \({\mathbb{R}}.\) 2.) Assume that \(f^{(s-1)}\) is absolutely continuous on \({\mathbb{R}}\) and \(\int \sqrt{f}dx<\infty\), then \[ \limsup_{n\to \infty}\{E \int | \hat f_ n(x,H)-f(x)| dx/\inf_{h>0} E \int | \hat f_ n(x,h)- f(x)| dx\}\leq c(L,K) \] (c-optimality), provided K is a kernel of order s, so that f is in the saturation class of K. 3.) \(H\to 0\), nH\(\to \infty\) and \(\int | \hat f_ n(x,H)-f(x)| dx\to 0\), completely, for an arbitrary density f on \({\mathbb{R}}\).
    0 references
    0 references
    Parzen-Rosenblatt kernel estimate
    0 references
    characteristic functions
    0 references
    density estimation
    0 references
    strong convergence
    0 references
    automatic choice of smoothing factor L1- consistency
    0 references
    kernel estimators
    0 references
    data-driven bandwidth choices
    0 references
    asymptotically optimal procedures
    0 references
    Fourier transforms
    0 references
    0 references