Randomized near-neighbor graphs, giant components and applications in data science

From MaRDI portal
Publication:3299443




Abstract: If we pick n random points uniformly in [0,1]d and connect each point to its knearest neighbors, then it is well known that there exists a giant connected component with high probability. We prove that in [0,1]d it suffices to connect every point to cd,1loglogn points chosen randomly among its cd,2lognnearest neighbors to ensure a giant component of size no(n) with high probability. This construction yields a much sparser random graph with simnloglogn instead of simnlogn edges that has comparable connectivity properties. This result has nontrivial implications for problems in data science where an affinity matrix is constructed: instead of picking the knearest neighbors, one can often pick kllk random points out of the knearest neighbors without sacrificing efficiency. This can massively simplify and accelerate computation, we illustrate this with several numerical examples.



Cites work



Describes a project that uses

Uses Software





This page was built for publication: Randomized near-neighbor graphs, giant components and applications in data science

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q3299443)