Hypothesis testing for topological data analysis
From MaRDI portal
Abstract: Persistent homology is a vital tool for topological data analysis. Previous work has developed some statistical estimators for characteristics of collections of persistence diagrams. However, tools that provide statistical inference for observations that are persistence diagrams are limited. Specifically, there is a need for tests that can assess the strength of evidence against a claim that two samples arise from the same population or process. We propose the use of randomization-style null hypothesis significance tests (NHST) for these situations. The test is based on a loss function that comprises pairwise distances between the elements of each sample and all the elements in the other sample. We use this method to analyze a range of simulated and experimental data. Through these examples we experimentally explore the power of the p-values. Our results show that the randomization-style NHST based on pairwise distances can distinguish between samples from different processes, which suggests that its use for hypothesis tests upon persistence diagrams is reasonable. We demonstrate its application on a real dataset of fMRI data of patients with ADHD.
Recommendations
Cites work
- scientific article; zbMATH DE number 46578 (Why is no real title available?)
- scientific article; zbMATH DE number 1782877 (Why is no real title available?)
- scientific article; zbMATH DE number 2103273 (Why is no real title available?)
- scientific article; zbMATH DE number 962738 (Why is no real title available?)
- A statistical approach to persistent homology
- Confidence sets for persistence diagrams
- Could Fisher, Jeffreys and Neyman have agreed on testing? (With comments and a rejoinder).
- Describing high-order statistical dependence using ``concurrence topology, with application to functional MRI brain data
- Exploring uses of persistent homology for statistical analysis of landmark-based shape data
- Extending hypothesis testing with persistent homology to three or more groups
- Fréchet means for distributions of persistence diagrams
- Permutation \(p\)-values should never be zero: calculating exact \(p\)-values when permutations are randomly drawn
- Persistent homology transform for modeling shapes and surfaces
- Principal component analysis of persistent homology rank functions with case studies of spatial point patterns, sphere packing and colloids
- Probability measures on the space of persistence diagrams
- Randomization tests. With CD-ROM.
- Statistical topological data analysis using persistence landscapes
Cited in
(27)- Virtual persistence diagrams, signed measures, Wasserstein distances, and Banach spaces
- Tropical sufficient statistics for persistent homology
- Persistent homology based goodness-of-fit tests for spatial tessellations
- Persistent homology for analyzing environmental lake monitoring data
- Multiple hypothesis testing with persistent homology
- Statistical inference for persistent homology applied to simulated fMRI time series data
- scientific article; zbMATH DE number 7206834 (Why is no real title available?)
- Improving homology estimates with random walks
- Modelling persistence diagrams with planar point processes, and revealing topology with bagplots
- Extending hypothesis testing with persistent homology to three or more groups
- An introduction to persistent homology for time series
- What Are Higher-Order Networks?
- TDAstats
- scientific article; zbMATH DE number 7646157 (Why is no real title available?)
- Topology-driven goodness-of-fit tests in arbitrary dimensions
- Vector summaries of persistence diagrams for permutation-based hypothesis testing
- Feature Detection and Hypothesis Testing for Extremely Noisy Nanoparticle Images using Topological Data Analysis
- Topological data analysis on simple English Wikipedia articles
- The persistence landscape and some of its properties
- Functional summaries of persistence diagrams
- On the topological data analysis extensions and comparisons
- Comparison of persistence diagrams
- Same but different: distance correlations between topological summaries
- Metrics and Stabilization in One Parameter Persistence
- A Bayesian framework for persistent homology
- A random persistence diagram generator
- Bayesian topological learning for classifying the structure of biological networks
This page was built for publication: Hypothesis testing for topological data analysis
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q141945)