Optimal bandwidth selection in nonparametric regression function estimation (Q1077110)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | Optimal bandwidth selection in nonparametric regression function estimation |
scientific article |
Statements
Optimal bandwidth selection in nonparametric regression function estimation (English)
0 references
1985
0 references
Let \((X,Y),(X_ 1,Y_ 1),(X_ 2,Y_ 2)..\). be i.i.d. \((d+1)\) dimensional random vectors with Y real valued. Consider the problem of estimating the regression function \[ m(x)=E(Y| X=x), \] using \((X_ 1,Y_ 1),...,(X_ n,Y_ n)\). In this paper, kernel estimators with a data-driven bandwidth are investigated. The bandwidth selection rule \(\hat h\) is to choose h to minimize \[ CV(h)=n^{- 1}\sum^{n}_{j=1}(Y_ j-\hat m_ j(X_ j))^ 2W(X_ j), \] where \[ \hat m_ j(x)=(n-1)^{-1}\sum^{n}_{i\neq j}h^{-d}K((x-x_ j)/h)Y_ i/\hat f_ j\quad (x),\quad \hat f_ j(x)=(n-1)^{- 1}\sum^{n}_{j=1}(\hat m^ 2_ j(X_ j)W(X_ j)) \] and W(x) is a weight function. The authors establish asymptotic optimality for this bandwidth selection rule which can be interpreted in terms of cross validation. They settle an open problem of \textit{C. J. Stone} [Ann. Stat. 10, 1040-1053 (1982; Zbl 0511.62048)] regarding the optimal rate uniformly over smoothness classes and show that these selection rules are important in exploratory data analysis.
0 references
optimal bandwidth selection
0 references
nonparametric regression function
0 references
estimation
0 references
Rosenblatt-Parzen kernel density estimator
0 references
kernel estimators with a data-driven bandwidth
0 references
bandwidth selection rule
0 references
asymptotic optimality
0 references
cross validation
0 references
exploratory data analysis
0 references