Optimal rates for independence testing via $U$-statistic permutation tests (Q130903)

scientific article; zbMATH DE number 7438256

Optimal rates for independence testing via $U$-statistic permutation tests

Language	Label	Description	Also known as
English	Optimal rates for independence testing via $U$-statistic permutation tests	scientific article; zbMATH DE number 7438256	Optimal rates for independence testing via $U$-statistic permutation tests

Statements

instance of

scholarly article

0 references

publication date

15 January 2020

0 references

3 December 2021

0 references

arXiv ID

2001.05513

0 references

arXiv classification

math.ST

0 references

stat.ME

0 references

stat.ML

0 references

stat.TH

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

0 references

10.48550/arXiv.2001.05513

0 references

10.1214/20-AOS2041

0 references

MaRDI profile type

MaRDI publication profile

0 references

title

Optimal rates for independence testing via $U$-statistic permutation tests (English)

0 references

zbMATH Open document ID

1504.62060

0 references

published in

The Annals of Statistics

0 references

full work available at URL

https://arxiv.org/abs/2001.05513

0 references

https://projecteuclid.org/journals/annals-of-statistics/volume-49/issue-5/Optimal-rates-for-independence-testing-via-U-statistic-permutation-tests/10.1214/20-AOS2041.full

0 references

review text

The authors study the problem of independence testing in a general framework, where the data consists of independent copies of a pair $(X,Y)$ taking values in a separable measure space $(\mathcal{X},\mathcal{Y})$, equipped with a $\sigma$-finite measure $\mu$. Assuming that the joint distribution of $(X,Y)$ has a density $f$ with respect to $\mu$, one may define a measure of dependence $D(f)$, given by the squared $L^2(\mu)$ distance between the joint density and the product of its marginal densities. This satisfies the natural requirement that $D(f)=0$ if and only if $X$ and $Y$ are independent. However, Theorem 1 reveals that it is not possible to construct a valid independence test with nontrivial power against all alternatives satisfying a lower bound on $D(f)$. This motivates to introduce classes satisfying an additional Sobolev-type smoothness condition as well as boundedness conditions on the joint and marginal densities. The first main goal of this work is determination of the minimax separation rate of independence testing over these classes, and to this end, a new permutation test of independence based on a $U$-statistic estimator of $D(f)$ is defined. Further, Theorem 2 in Section 3 provides a very general upper bound on the separation rate of independence testing. Note that the framework is broad enough to include both discrete and absolutely continuous data, as well as data that may take values in infinite-dimensional spaces. The authors show how the bound can be simplified in many special cases, and, in Section 4, how to construct adaptive versions of their tests that incur only a small loss in effective sample size. Moreover, in Section 5, matching lower bounds in several instances is provided, allowing to conclude that suggested USP test attains the minimax optimal separation rate for independence testing in such settings. In Section 6, an approximation to the power function of the test at local alternatives is elucidated, thereby providing a very detailed description of its properties. Numerical properties are studied in Section 7. Suggested methodology is implemented in the \texttt{R} package \texttt{USP}.

0 references

zbMATH DE Number

7438256

0 references

zbMATH Keywords

independence testing

0 references

minimax separation rates

0 references

permutation tests

0 references

Stein's method