Multinomial goodness-of-fit based on U-statistics: high-dimensional asymptotic and minimax optimality
From MaRDI portal
Publication:2301048
Abstract: We consider multinomial goodness-of-fit tests in the high-dimensional regime where the number of bins increases with the sample size. In this regime, Pearson's chi-squared test can suffer from low power due to the substantial bias as well as high variance of its statistic. To resolve these issues, we introduce a family of U-statistic for multinomial goodness-of-fit and study their asymptotic behaviors in high-dimensions. Specifically, we establish conditions under which the considered U-statistic is asymptotically Poisson or Gaussian, and investigate its power function under each asymptotic regime. Furthermore, we introduce a class of weights for the U-statistic that results in minimax rate optimal tests.
Recommendations
- Asymptotic approximations for the distributions of multinomial goodness- of-fit statistics
- Asymptotic distributions for goodness-of-fit statistics in a sequence of multinomial models
- Asymptotic approximations for the distributions of the multinomial goodness-of-fit statistics under local alternatives
- On the asymptotic properties of a certain class of goodness-of-fit tests associated with multinomial distributions
- A general family of limited information goodness-of-fit statistics for multinomial data
- On the intermediate asymptotic efficiency of goodness-of-fit tests in multinomial distributions
- Improvement of approximations for the distributions of multinomial goodness-of-fit statistics under nonlocal alternatives
- scientific article; zbMATH DE number 4166374
Cites work
- scientific article; zbMATH DE number 3126886 (Why is no real title available?)
- scientific article; zbMATH DE number 3723610 (Why is no real title available?)
- scientific article; zbMATH DE number 47948 (Why is no real title available?)
- scientific article; zbMATH DE number 3087287 (Why is no real title available?)
- A Warning on the Use of Chi-Squared Statistics With Frequency Tables With Small Expected Cell Counts
- A two-sample test for high-dimensional data with applications to gene-set testing
- An automatic inequality prover and instance optimal identity testing
- Asymptotic Distribution of The $\chi ^2 $ Criterion when the Number of Observations and Number of Groups Increase Simultaneously
- Asymptotic normality and efficiency for certain goodness-of-fit tests
- Central limit theorem for integrated square error of multivariate nonparametric density estimators
- Central limit theorems for multinomial sums
- Chi-squared goodness of fit tests with applications
- Collision-based Testers are Optimal for Uniformity and Closeness
- Double asymptotics for the chi-square statistic
- Geometry of goodness-of-fit testing in high dimensional low sample size modelling
- Goodness-of-Fit Tests for Large Sparse Multinomial Distributions
- Hypothesis testing for densities and high-dimensional multinomials: sharp local minimax rates
- Hypothesis testing for high-dimensional multinomials: a selective review
- Mathematical Statistics
- Testing Statistical Hypotheses
- The log-likelihood ratio for sparse multinomial mixtures
- The matching, birthday and the strong birthday problem: a contemporary review
- Two moments suffice for Poisson approximations: The Chen-Stein method
- Univariate Discrete Distributions
Cited in
(5)- Asymptotically independent U-statistics in high-dimensional testing
- Poisson limit theorems for the Cressie-Read statistics
- Dimension-agnostic inference using cross U-statistics
- Geometry of goodness-of-fit testing in high dimensional low sample size modelling
- Testing for practically significant dependencies in high dimensions via bootstrapping maxima of \(U\)-statistics
This page was built for publication: Multinomial goodness-of-fit based on \(U\)-statistics: high-dimensional asymptotic and minimax optimality
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2301048)