On the Analysis of a Label Propagation Algorithm for Community Detection

From MaRDI portal
Publication:5507403

DOI10.1007/978-3-642-35668-1_18zbMATH Open1351.68303arXiv1210.3735OpenAlexW1831343635MaRDI QIDQ5507403FDOQ5507403


Authors: Kishore Kothapalli, Vivek B. Sardeshmukh, Sriram Pemmaraju Edit this on Wikidata


Publication date: 19 December 2016

Published in: Distributed Computing and Networking (Search for Journal in Brave)

Abstract: This paper initiates formal analysis of a simple, distributed algorithm for community detection on networks. We analyze an algorithm that we call extsc{Max-LPA}, both in terms of its convergence time and in terms of the "quality" of the communities detected. extsc{Max-LPA} is an instance of a class of community detection algorithms called extit{label propagation} algorithms. As far as we know, most analysis of label propagation algorithms thus far has been empirical in nature and in this paper we seek a theoretical understanding of label propagation algorithms. In our main result, we define a clustered version of er random graphs with clusters V1,V2,...,Vk where the probability p, of an edge connecting nodes within a cluster Vi is higher than p, the probability of an edge connecting nodes in distinct clusters. We show that even with fairly general restrictions on p and p (p=Omega(frac1n1/4epsilon) for any epsilon>0, p=O(p2), where n is the number of nodes), extsc{Max-LPA} detects the clusters V1,V2,...,Vn in just two rounds. Based on this and on empirical results, we conjecture that extsc{Max-LPA} can correctly and quickly identify communities on clustered er graphs even when the clusters are much sparser, i.e., with p=fracclognn for some c>1.


Full work available at URL: https://arxiv.org/abs/1210.3735




Recommendations




Cited In (15)





This page was built for publication: On the Analysis of a Label Propagation Algorithm for Community Detection

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5507403)