A partially linear framework for massive heterogeneous data (Q309709)
From MaRDI portal
scientific article
Language | Label | Description | Also known as |
---|---|---|---|
English | A partially linear framework for massive heterogeneous data |
scientific article |
Statements
A partially linear framework for massive heterogeneous data (English)
0 references
7 September 2016
0 references
The paper under review deals with a partially linear framework for modeling massive heterogeneous data with the objective to extract common features across all subpopulations while exploring heterogeneity of each. The authors propose an aggregation type estimator for the commonality parameter with the same minimax optimal bound and asymptotic distribution as in the case when there is no heterogeneity. This result holds when the number of subpopulations does not grow too fast. Next, a plug-in estimator for the heterogeneity parameter is provided, which has the same asymptotic distribution as in the case when commonality information is available. Also, the heterogeneity among a large number of subpopulations is tested by employing approximation theory results from \textit{V. Chernozhukov} et al. [Ann. Stat. 41, No. 6, 2786--2819 (2013; Zbl 1292.62030)]. Finally, the ``divide-and-conquer'' method based on the obtained results is applied to the subpopulation with a huge sample size that cannot be processed in a single computer.
0 references
heterogeneous data
0 references
kernel ridge regression
0 references
partially linear model
0 references
divide-and-conquer method
0 references
massive data
0 references
0 references
0 references
0 references