Spectral clustering in the Gaussian mixture block model

From MaRDI portal
Publication:6509901

arXiv2305.00979MaRDI QIDQ6509901FDOQ6509901


Authors: Shuangping Li, Tselil Schramm Edit this on Wikidata



Abstract: Gaussian mixture block models are distributions over graphs that strive to model modern networks: to generate a graph from such a model, we associate each vertex i with a latent feature vector uiinmathbbRd sampled from a mixture of Gaussians, and we add edge (i,j) if and only if the feature vectors are sufficiently similar, in that langleui,ujanglegeau for a pre-specified threshold au. The different components of the Gaussian mixture represent the fact that there may be different types of nodes with different distributions over features -- for example, in a social network each component represents the different attributes of a distinct community. Natural algorithmic tasks associated with these networks are embedding (recovering the latent feature vectors) and clustering (grouping nodes by their mixture component). In this paper we initiate the study of clustering and embedding graphs sampled from high-dimensional Gaussian mixture block models, where the dimension of the latent feature vectors doinfty as the size of the network noinfty. This high-dimensional setting is most appropriate in the context of modern networks, in which we think of the latent feature space as being high-dimensional. We analyze the performance of canonical spectral clustering and embedding algorithms for such graphs in the case of 2-component spherical Gaussian mixtures, and begin to sketch out the information-computation landscape for clustering and embedding in these models.













This page was built for publication: Spectral clustering in the Gaussian mixture block model

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6509901)