Hybrid Wasserstein distance and fast distribution clustering
From MaRDI portal
Publication:2283573
DOI10.1214/19-EJS1639zbMATH Open1435.62249arXiv1812.11026OpenAlexW2995577776MaRDI QIDQ2283573FDOQ2283573
Authors: Isabella Verdinelli, Larry Wasserman
Publication date: 3 January 2020
Published in: Electronic Journal of Statistics (Search for Journal in Brave)
Abstract: We define a modified Wasserstein distance for distribution clustering which inherits many of the properties of the Wasserstein distance but which can be estimated easily and computed quickly. The modified distance is the sum of two terms. The first term --- which has a closed form --- measures the location-scale differences between the distributions. The second term is an approximation that measures the remaining distance after accounting for location-scale differences. We consider several forms of approximation with our main emphasis being a tangent space approximation that can be estimated using nonparametric regression. We evaluate the strengths and weaknesses of this approach on simulated and real examples.
Full work available at URL: https://arxiv.org/abs/1812.11026
Recommendations
- Robust clustering tools based on optimal transportation
- Wasserstein distance on finite spaces: statistical inference and algorithms
- Inference for empirical Wasserstein distances on finite spaces
- Fuzzy clustering of distributional data with automatic weighting of variable components
- Clustering, factor discovery and optimal transport
Classification and discrimination; cluster analysis (statistical aspects) (62H30) Learning and adaptive systems in artificial intelligence (68T05)
Cites Work
- Title not available (Why is that?)
- Nonparametric functional data analysis. Theory and practice.
- Robust clustering tools based on optimal transportation
- Title not available (Why is that?)
- An Exact Distribution-Free Test Comparing Two Multivariate Distributions based on Adjacency
- A class of Wasserstein metrics for probability distributions
- Fréchet means and Procrustes analysis in Wasserstein space
- Energy statistics: a class of statistics based on distances
- Title not available (Why is that?)
- A fixed-point approach to barycenters in Wasserstein space
- Limit laws of the empirical Wasserstein distance: Gaussian distributions
- Functional mixed effects models
- A linear optimal transportation framework for quantifying and visualizing variations in sets of images
- Sharp asymptotic and finite-sample rates of convergence of empirical measures in Wasserstein distance
- Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations
- On the Bures-Wasserstein distance between positive definite matrices
- Multivariate kernel smoothing and its applications
- A population background for nonparametric density-based clustering
- Inference for empirical Wasserstein distances on finite spaces
- Optimal transport: fast probabilistic approximation with exact solvers
- Title not available (Why is that?)
- Preconditioning of optimal transport
Cited In (15)
- Wasserstein discriminant analysis
- Linear regression for numeric symbolic variables: a least squares approach based on Wasserstein distance
- Hierarchical clustering with optimal transport
- Multisource Single-Cell Data Integration by MAW Barycenter for Gaussian Mixture Models
- Minimax confidence intervals for the sliced Wasserstein distance
- Central limit theorems for semi-discrete Wasserstein distances
- Wasserstein distance on finite spaces: statistical inference and algorithms
- Fast Discrete Distribution Clustering Using Wasserstein Barycenter With Sparse Support
- Clustering, factor discovery and optimal transport
- On clustering uncertain and structured data with Wasserstein barycenters and a geodesic criterion for the number of clusters
- Co-clustering algorithms for distributional data with automated variable weighting
- On the use of Wasserstein distance in the distributional analysis of human decision making under uncertainty
- Covariance-based soft clustering of functional data based on the Wasserstein-Procrustes metric
- Title not available (Why is that?)
- Network consensus in the Wasserstein metric space of probability measures
Uses Software
This page was built for publication: Hybrid Wasserstein distance and fast distribution clustering
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q2283573)