Dimension-agnostic inference using cross U-statistics
From MaRDI portal
Publication:6178581
DOI10.3150/23-BEJ1613arXiv2011.05068OpenAlexW4388506996MaRDI QIDQ6178581FDOQ6178581
Authors: Ilmun Kim, Aaditya Ramdas
Publication date: 16 January 2024
Published in: Bernoulli (Search for Journal in Brave)
Abstract: Classical asymptotic theory for statistical inference usually involves calibrating a statistic by fixing the dimension while letting the sample size increase to infinity. Recently, much effort has been dedicated towards understanding how these methods behave in high-dimensional settings, where and both increase to infinity together. This often leads to different inference procedures, depending on the assumptions about the dimensionality, leaving the practitioner in a bind: given a dataset with 100 samples in 20 dimensions, should they calibrate by assuming , or ? This paper considers the goal of dimension-agnostic inference; developing methods whose validity does not depend on any assumption on versus . We introduce an approach that uses variational representations of existing test statistics along with sample splitting and self-normalization to produce a refined test statistic with a Gaussian limiting distribution, regardless of how scales with . The resulting statistic can be viewed as a careful modification of degenerate U-statistics, dropping diagonal blocks and retaining off-diagonal blocks. We exemplify our technique for some classical problems including one-sample mean and covariance testing, and show that our tests have minimax rate-optimal power against appropriate local alternatives. In most settings, our cross U-statistic matches the high-dimensional power of the corresponding (degenerate) U-statistic up to a factor.
Full work available at URL: https://arxiv.org/abs/2011.05068
Cites Work
- Central limit theorems and bootstrap in high dimensions
- Title not available (Why is that?)
- Estimation and Inference of Heterogeneous Treatment Effects using Random Forests
- Conditional Distance Correlation
- Testing Statistical Hypotheses
- Bootstrapping and sample splitting for high-dimensional, assumption-lean inference
- Optimal hypothesis testing for high dimensional covariance matrices
- A note on testing the covariance matrix for large dimension
- Quantifying uncertainty in random forests via confidence intervals and hypothesis tests
- Title not available (Why is that?)
- High-dimensional probability. An introduction with applications in data science
- A kernel two-sample test
- The Large-Sample Distribution of the Likelihood Ratio for Testing Composite Hypotheses
- A new test for multivariate normality
- Central limit theorem for integrated square error of multivariate nonparametric density estimators
- Distribution-free predictive inference for regression
- On the power of conditional independence testing under model-X
- Asymptotic behavior of M-estimators of p regression parameters when \(p^ 2/n\) is large. I. Consistency
- Nonparametric goodness-of-fit testing under Gaussian models
- Non-asymptotic minimax rates of testing in signal detection
- A two-sample test for high-dimensional data with applications to gene-set testing
- A one-sample test for normality with kernel methods
- Title not available (Why is that?)
- Two-Sample Test of High Dimensional Means Under Dependence
- Tests for high-dimensional covariance matrices
- Distribution-Free Consistent Independence Tests via Center-Outward Ranks and Signs
- A test for the mean vector with fewer observations than the dimension
- Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing
- The Berry-Esseen bound for Student's statistic
- On some test criteria for covariance matrix
- Student's t-Test Under Symmetry Conditions
- Central limit theorems for classical likelihood ratio tests for high-dimensional normal distributions
- Interaction screening for ultrahigh-dimensional data
- A note on data-splitting for the evaluation of significance levels
- Asymptotic behavior of M estimators of p regression parameters when \(p^ 2/n\) is large. II: Normal approximation
- Tests for high-dimensional regression coefficients with factorial designs
- Split Sample Methods for Constructing Confidence Intervals for Binomial and Poisson Parameters
- Asymptotic normality of quadratic estimators
- Modification of some goodness-of-fit statistics to yield asymptotically normal null distributions
- Multivariate Rank-Based Distribution-Free Nonparametric Testing Using Measure Transportation
- Some properties of incomplete U-statistics
- Estimation of integrated squared density derivatives
- Distribution and quantile functions, ranks and signs in dimension \(d\): a measure transportation approach
- Monge-Kantorovich depth, quantiles, ranks and signs
- A review of 20 years of naive tests of significance for high-dimensional mean vectors and covariance matrices
- Conditional mean and quantile dependence testing in high dimension
- Minimax Euclidean separation rates for testing convex hypotheses in \(\mathbb{R}^{d}\)
- Exact bounds on the closeness between the Student and standard normal distributions
- Can we trust the bootstrap in high-dimensions? The case of linear models
- Asymptotic normality of a consistent estimator of maximum mean discrepancy in Hilbert space
- The Holdout Randomization Test for Feature Selection in Black Box Models
- Goodness-of-fit Testing in High Dimensional Generalized Linear Models
- Robust multivariate nonparametric tests via projection averaging
- A modern maximum-likelihood theory for high-dimensional logistic regression
- Universal inference
- Multinomial goodness-of-fit based on \(U\)-statistics: high-dimensional asymptotic and minimax optimality
- Classification accuracy as a proxy for two-sample testing
- Minimax optimality of permutation tests
- A feasible high dimensional randomization test for the mean vector
- Gaussian universal likelihood ratio testing
Cited In (1)
This page was built for publication: Dimension-agnostic inference using cross U-statistics
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6178581)