k-variance: a clustered notion of variance

From MaRDI portal
Publication:5089735

DOI10.1137/20M1385895zbMATH Open1493.62257arXiv2012.06958OpenAlexW3112942112MaRDI QIDQ5089735FDOQ5089735


Authors: Justin Solomon, Kristjan Greenewald, H. N. Nagaraja Edit this on Wikidata


Publication date: 15 July 2022

Published in: SIAM Journal on Mathematics of Data Science (Search for Journal in Brave)

Abstract: We introduce k-variance, a generalization of variance built on the machinery of random bipartite matchings. K-variance measures the expected cost of matching two sets of k samples from a distribution to each other, capturing local rather than global information about a measure as k increases; it is easily approximated stochastically using sampling and linear programming. In addition to defining k-variance and proving its basic properties, we provide in-depth analysis of this quantity in several key cases, including one-dimensional measures, clustered measures, and measures concentrated on low-dimensional subsets of mathbbRn. We conclude with experiments and open problems motivated by this new way to summarize distributional shape.


Full work available at URL: https://arxiv.org/abs/2012.06958




Recommendations




Cites Work


Cited In (1)





This page was built for publication: \(k\)-variance: a clustered notion of variance

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5089735)