Large-scale kernel methods for independence testing
From MaRDI portal
Abstract: Representations of probability measures in reproducing kernel Hilbert spaces provide a flexible framework for fully nonparametric hypothesis tests of independence, which can capture any type of departure from independence, including nonlinear associations and multivariate interactions. However, these approaches come with an at least quadratic computational cost in the number of observations, which can be prohibitive in many applications. Arguably, it is exactly in such large-scale datasets that capturing any type of dependence is of interest, so striking a favourable tradeoff between computational efficiency and test performance for kernel independence tests would have a direct impact on their applicability in practice. In this contribution, we provide an extensive study of the use of large-scale kernel approximations in the context of independence testing, contrasting block-based, Nystrom and random Fourier feature approaches. Through a variety of synthetic data experiments, it is demonstrated that our novel large scale methods give comparable performance with existing methods whilst using significantly less computation time and memory.
Recommendations
- Kernel methods for measuring independence
- Kernel-based tests for joint independence
- Testing independence by nonparametric kernel method
- Kernel methods for independence measurement with coefficient constraints
- Testing the independence of sets of large-dimensional variables
- Testing independence among a large number of high-dimensional random vectors
- Large-scale simultaneous testing using kernel density estimation
- Testing for independence of large dimensional vectors
Cites work
- scientific article; zbMATH DE number 6378135 (Why is no real title available?)
- scientific article; zbMATH DE number 3673370 (Why is no real title available?)
- scientific article; zbMATH DE number 3719745 (Why is no real title available?)
- scientific article; zbMATH DE number 5055767 (Why is no real title available?)
- 10.1162/153244303768966085
- A Hilbert Space Embedding for Distributions
- A kernel two-sample test
- Algorithmic Learning Theory
- Algorithms for learning kernels based on centered alignment
- Approximation theorems of mathematical statistics
- Brownian distance covariance
- Distance covariance in metric spaces
- Equivalence of distance-based and RKHS-based statistics in hypothesis testing
- FastMMD: Ensemble of Circular Discrepancy for Efficient Two-Sample Test
- Feature selection via dependence maximization
- Measuring and testing dependence by correlation of distances
- Nonlinear canonical analysis and independence tests
- Nonlinear measures of association with kernel canonical correlation analysis and applications
- On the bootstrap of \(U\) and \(V\) statistics
- Scattered Data Approximation
- Support Vector Machines
- Theory of Reproducing Kernels
- Two-sample test statistics for measuring discrepancies between two multivariate probability density functions using kernel-based density estimates
Cited in
(17)- The exact equivalence of distance and kernel methods in hypothesis testing
- Validation of association
- A kernel- and optimal transport- based test of independence between covariates and right-censored lifetimes
- A survey of some recent developments in measures of association
- Tests of mutual or serial independence of random vectors with applications
- A new coefficient of correlation
- Nonparametric Independence Tests: Space Partitioning and Kernel Approaches
- scientific article; zbMATH DE number 7750671 (Why is no real title available?)
- The Chi-Square Test of Distance Correlation
- A Kernel Log-Rank Test of Independence for Right-Censored Data
- Statistical dependence: beyond Pearson's \(\rho\)
- Large-scale simultaneous testing using kernel density estimation
- Multivariate tests of independence based on a new class of measures of independence in reproducing kernel Hilbert space
- A fast algorithm for computing distance correlation
- New HSIC-based tests for independence between two stationary multivariate time series
- The Binary Expansion Randomized Ensemble Test
- Testing semiparametric model-equivalence hypotheses based on the characteristic function
This page was built for publication: Large-scale kernel methods for independence testing
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1702289)