Randomized incomplete \(U\)-statistics in high dimensions (Q2284368)

From MaRDI portal

Jump to:navigation, search

scientific article

Language	Label	Description	Also known as
English	Randomized incomplete \(U\)-statistics in high dimensions	scientific article

Statements

scholarly article

0 references

Randomized incomplete \(U\)-statistics in high dimensions (English)

0 references

0 references

0 references

The Annals of Statistics

0 references

publication date

15 January 2020

0 references

full work available at URL

https://arxiv.org/abs/1712.00771

0 references

https://projecteuclid.org/euclid.aos/1572487388

0 references

The authors consider the problem of statistical inference for the mean vector \(\mathbb{E}h(X_1,\ldots,X_r)\), based on independent and identically distributed data \(X_1,\ldots,X_n\) taking values in a measurable space \((S,\mathcal{S})\), and where \(h:S^r\mapsto\mathbb{R}^d\) is a fixed, symmetric function. Their aim is to develop tools for the setting where \(d\) is possibly much larger than \(n\), but where \(n\) is also large. In this setting, the commonly used \(U\)-statistic \[ \frac{1}{|I_{n,r}|}\sum_{(i_1,\ldots,i_r)\in I_{n,r}}h(X_{i_1},\ldots,X_{i_r})\,, \] where \(I_{n,r}\) is the set of all \(r\)-tuples in \(\{1,\ldots,n\}\), suffers from problems of computational scalability. As a solution to this, the authors propose two randomized incomplete \(U\)-statistics, where the average is taken over only a randomly chosen subset of \(I_{n,r}\), rather than over all elements of this set. The first of these uses Bernoulli sampling (or, equivalently, sampling without replacement), and the second uses sampling with replacement. Under some assumptions (such as some boundedness assumptions), Gaussian approximation results are established for these randomized incomplete \(U\)-statistics, with an explicit rate of convergence, in both the nondegenerate and degenerate cases. Since the limiting Gaussian distribution here has a covariance matrix depending on the unknown underlying distribution, fully data-dependent bootstrap techniques are developed which make these results applicable. The paper concludes with a simulation study investigating this framework in the setting of testing for pairwise independence of elements of a high-dimensional random vector using several well-known statistics from the literature.

0 references

0 references

zbMATH Keywords

incomplete \(U\)-statistics

0 references

randomized inference

0 references

Gaussian approximation

0 references

bootstrap

0 references

divide and conquer

0 references

Bernoulli sampling

0 references

sampling with replacement

0 references

MaRDI profile type

MaRDI publication profile

0 references

On the bootstrap of \(U\) and \(V\) statistics

0 references

A consistent test of independence based on a sign covariance related to Kendall's tau

0 references

Incomplete Generalized <i>U</i>‐Statistics for Food Risk Assessment

0 references

Some asymptotic theory for the bootstrap

0 references

Some properties of incomplete <i>U</i>-statistics

0 references

0 references

Reduced U-statistics and the Hodges-Lehmann estimator

0 references

Gaussian and bootstrap approximations for high-dimensional U-statistics and their applications

0 references

Jackknife multiplier bootstrap: finite sample approximations to the \(U\)-process supremum with applications

0 references

Randomized incomplete \(U\)-statistics in high dimensions

0 references

Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors

0 references

Central limit theorems and bootstrap in high dimensions

0 references

0 references

Random quadratic forms and the bootstrap for \(U\)-statistics

0 references

Pairwise Independence of Jointly Dependent Variables

0 references

Distribution-free tests of independence in high dimensions

0 references

A Class of Statistics with Asymptotically Normal Distribution

0 references

A Non-Parametric Test of Independence

0 references

On weighted \(U\)-statistics for stationary processes.

0 references

Consistency of the generalized bootstrap for degenerate \(U\)-statistics

0 references

Generalized bootstrap for studentized U-statistics: A rank statistic approach

0 references

The asymptotic distributions of incomplete U-statistics

0 references

A Scalable Bootstrap for Massive Data

0 references

Testing independence in high dimensions with sums of rank correlations

0 references

Asymptotic distributions for weighted \(U\)-statistics

0 references

0 references

Large-Sample Theory for the Bergsma-Dassios Sign Covariance

0 references

Asymptotic distributions of weighted \(U\)-statistics of degree 2

0 references

On the asymptotic behavior of weighted \(U\)-statistics

0 references

Asymptotic distribution of symmetric statistics

0 references

Asymptotic normality of permutation statistics derived from weighted sums of bivariate functions

0 references

Measuring and testing dependence by correlation of distances

0 references

Asymptotic Statistics

0 references

Weak convergence and empirical processes. With applications to statistics

0 references

Weighted bootstrap for \(U\)-statistics

0 references

Testing Mutual Independence in High Dimension via Distance Covariance

0 references

Divide and Conquer Kernel Ridge Regression: A Distributed Algorithm with Minimax Optimal Rates

0 references

Identifiers

zbMATH Open document ID

0 references

10.1214/18-AOS1773

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

zbMATH DE Number

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:2284368

Retrieved from "https://portal.mardi4nfdi.de/w/index.php?title=Item:Q2284368&oldid=36551598"