Asymptotic normality of Gini correlation in high dimension with applications to the K-sample problem
From MaRDI portal
Publication:6184886
Abstract: The categorical Gini correlation proposed by Dang et al. is a dependence measure to characterize independence between categorical and numerical variables. The asymptotic distributions of the sample correlation under dependence and independence have been established when the dimension of the numerical variable is fixed. However, its asymptotic behavior for high dimensional data has not been explored. In this paper, we develop the central limit theorem for the Gini correlation in the more realistic setting where the dimensionality of the numerical variable is diverging. We then construct a powerful and consistent test for the -sample problem based on the asymptotic normality. The proposed test not only avoids computation burden but also gains power over the permutation procedure. Simulation studies and real data illustrations show that the proposed test is more competitive to existing methods across a broad range of realistic situations, especially in unbalanced cases.
Cites work
- A consistent multivariate test of association based on ranks of distances
- A new Gini correlation between quantitative and qualitative variables
- A nonparametric approach to high-dimensional \(k\)-sample comparison problems
- A nonparametric two-sample test applicable to high dimensional data
- A rank-based Cramér-von-Mises-type test for two samples
- A test for the two-sample problem based on empirical characteristic functions
- Asymptotic distributions of high-dimensional distance correlation inference
- Asymptotic normality of interpoint distances for high-dimensional data with applications to the two-sample problem
- Brownian distance covariance
- Consistent distribution-free \(K\)-sample and independence tests for univariate random variables
- DISCO analysis: A nonparametric extension of analysis of variance
- Distance covariance in metric spaces
- Distance-based and RKHS-based dependence metrics in high dimension
- Energy statistics: a class of statistics based on distances
- Hypothesis testing in the presence of multiple samples under density ratio models
- Interpoint distance based two sample tests in high dimension
- K-Sample Analogues of the Kolmogorov-Smirnov and Cramer-V. Mises Tests
- Measuring and testing dependence by correlation of distances
- Non-parametric \(k\)-sample tests: density functions vs distribution functions
- Nonparametric \(K\)-sample tests via dynamic slicing
- On a new multivariate two-sample test.
- Testing homogeneity for multiple nonnegative distributions with excess zero observations
- The Kolmogorov-Smirnov, Cramer-von Mises Tests
- The distance correlation \(t\)-test of independence in high dimension
- Two-sample test statistics for measuring discrepancies between two multivariate probability density functions using kernel-based density estimates
Cited in
(3)
This page was built for publication: Asymptotic normality of Gini correlation in high dimension with applications to the \(K\)-sample problem
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6184886)