Dimension-agnostic inference using cross U-statistics

DOI10.3150/23-BEJ1613MaRDI QIDQ6178581zbMATH OpenOpenAlexFDO

Authors Ilmun Kim, Aaditya Ramdas

Publication date 16 January 2024

Published in Bernoulli (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/2011.05068

zbMATH Keywords

minimax optimality sample splitting studentization degenerate U-statistics high-dimensional limits

Mathematics Subject Classification ID

Nonparametric inference (62Gxx) Parametric inference (62Fxx) Multivariate analysis (62Hxx)

Abstract: Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension

d

while letting the sample size

n

increase to infinity. Recently, much effort has been dedicated towards understanding how these methods behave in high-dimensional settings, where

d

and

n

both increase to infinity together. This often leads to different inference procedures, depending on the assumptions about the dimensionality, leaving the practitioner in a bind: given a dataset with 100 samples in 20 dimensions, should they calibrate by assuming

n g g d

, or

d / n a p p r o x 0.2

? This paper considers the goal of dimension-agnostic inference; developing methods whose validity does not depend on any assumption on

d

versus

n

. We introduce an approach that uses variational representations of existing test statistics along with sample splitting and self-normalization to produce a refined test statistic with a Gaussian limiting distribution, regardless of how

d

scales with

n

. The resulting statistic can be viewed as a careful modification of degenerate U-statistics, dropping diagonal blocks and retaining off-diagonal blocks. We exemplify our technique for some classical problems including one-sample mean and covariance testing, and show that our tests have minimax rate-optimal power against appropriate local alternatives. In most settings, our cross U-statistic matches the high-dimensional power of the corresponding (degenerate) U-statistic up to a

s q r t 2

factor.

Cites work

Cited in

(1)

Limit theorems for a class of processes generalizing the U -empirical process

This page was built for publication: Dimension-agnostic inference using cross U-statistics

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6178581)