Same but different: distance correlations between topological summaries
From MaRDI portal
Publication:5118380
Abstract: Persistent homology allows us to create topological summaries of complex data. In order to analyse these statistically, we need to choose a topological summary and a relevant metric space in which this topological summary exists. While different summaries may contain the same information (as they come from the same persistence module), they can lead to different statistical conclusions since they lie in different metric spaces. The best choice of metric will often be application-specific. In this paper we discuss distance correlation, which is a non-parametric tool for comparing data sets that can lie in completely different metric spaces. In particular we calculate the distance correlation between different choices of topological summaries. We compare some different topological summaries for a variety of random models of underlying data via the distance correlation between the samples. We also give examples of performing distance correlation between topological summaries and other scalar measures of interest, such as a paired random variable or a parameter of the random model used to generate the underlying data. This article is meant to be expository in style, and will include the definitions of standard statistical quantities in order to be accessible to non-statisticians.
Recommendations
- Statistical topological data analysis using persistence landscapes
- A persistence landscapes toolbox for topological statistics
- Stochastic convergence of persistence landscapes and silhouettes
- Stochastic convergence of persistence landscapes and silhouettes
- Functional summaries of persistence diagrams
Cites work
- A persistence landscapes toolbox for topological statistics
- Cone fields and topological sampling in manifolds with bounded curvature
- Distance covariance in metric spaces
- Fréchet means for distributions of persistence diagrams
- Geometry helps to compare persistence diagrams
- Hypothesis testing for topological data analysis
- Kernel method for persistence diagrams via kernel embedding and weight factor
- Measuring and testing dependence by correlation of distances
- Positive definite metric spaces
- Principal component analysis of persistent homology rank functions with case studies of spatial point patterns, sphere packing and colloids
- Statistical topological data analysis using persistence landscapes
Cited in
(8)- Embeddings of persistence diagrams into Hilbert spaces
- Describing topology on the set of persistence diagrams
- Signatures, Lipschitz-Free Spaces, and Paths of Persistence Diagrams
- A new measure for the attitude to mobility of Italian students and graduates: a topological data analysis approach
- Graph pseudometrics from a topological point of view
- Persistence curves: a canonical framework for summarizing persistence diagrams
- Nonembeddability of persistence diagrams with \(p>2\) Wasserstein metric
- The space of persistence diagrams on \(n\) points coarsely embeds into Hilbert space
This page was built for publication: Same but different: distance correlations between topological summaries
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5118380)