Optimal bandwidth selection in nonparametric regression function estimation (Q1077110)

From MaRDI portal
scientific article
Language Label Description Also known as
English
Optimal bandwidth selection in nonparametric regression function estimation
scientific article

    Statements

    Optimal bandwidth selection in nonparametric regression function estimation (English)
    0 references
    1985
    0 references
    Let \((X,Y),(X_ 1,Y_ 1),(X_ 2,Y_ 2)..\). be i.i.d. \((d+1)\) dimensional random vectors with Y real valued. Consider the problem of estimating the regression function \[ m(x)=E(Y| X=x), \] using \((X_ 1,Y_ 1),...,(X_ n,Y_ n)\). In this paper, kernel estimators with a data-driven bandwidth are investigated. The bandwidth selection rule \(\hat h\) is to choose h to minimize \[ CV(h)=n^{- 1}\sum^{n}_{j=1}(Y_ j-\hat m_ j(X_ j))^ 2W(X_ j), \] where \[ \hat m_ j(x)=(n-1)^{-1}\sum^{n}_{i\neq j}h^{-d}K((x-x_ j)/h)Y_ i/\hat f_ j\quad (x),\quad \hat f_ j(x)=(n-1)^{- 1}\sum^{n}_{j=1}(\hat m^ 2_ j(X_ j)W(X_ j)) \] and W(x) is a weight function. The authors establish asymptotic optimality for this bandwidth selection rule which can be interpreted in terms of cross validation. They settle an open problem of \textit{C. J. Stone} [Ann. Stat. 10, 1040-1053 (1982; Zbl 0511.62048)] regarding the optimal rate uniformly over smoothness classes and show that these selection rules are important in exploratory data analysis.
    0 references
    0 references
    optimal bandwidth selection
    0 references
    nonparametric regression function
    0 references
    estimation
    0 references
    Rosenblatt-Parzen kernel density estimator
    0 references
    kernel estimators with a data-driven bandwidth
    0 references
    bandwidth selection rule
    0 references
    asymptotic optimality
    0 references
    cross validation
    0 references
    exploratory data analysis
    0 references
    0 references
    0 references