Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Peng, Chengbina; b | Zhang, Zhihuac | Wong, Ka-Chund | Zhang, Xianglianga; * | Keyes, David E.a
Affiliations: [a] King Abdullah University of Science and Technology, Thuwal, Saudi Arabia | [b] Ningbo Institute of Industrial Technology, Ningbo, Zhejiang, China | [c] Shanghai Jiao Tong University, Shanghai, China | [d] City University of Hong Kong, Hong Kong, China
Correspondence: [*] Corresponding author: Xiangliang Zhang, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Post Box: 2925, Kingdom of Saudi Arabia. Tel.: +966 12 808 0313; E-mail: [email protected].
Abstract: Community detection in graphs is widely used in social and biological networks, and the stochastic block model is a powerful probabilistic tool for describing graphs with community structures. However, in the era of “big data”, traditional inference algorithms for such a model are increasingly limited due to their high time complexity and poor scalability. In this paper, we propose a multi-stage maximum likelihood approach to recover the latent parameters of the stochastic block model, in time linear with respect to the number of edges. We also propose a parallel algorithm based on message passing. Our algorithm can overlap communication and computation, providing speedup without compromising accuracy as the number of processors grows. For example, to process a real-world graph with about 1.3 million nodes and 10 million edges, our algorithm requires about 6 seconds on 64 cores of a contemporary commodity Linux cluster. Experiments demonstrate that the algorithm can produce high quality results on both benchmark and real-world graphs. An example of finding more meaningful communities is illustrated consequently in comparison with a popular modularity maximization algorithm.
Keywords: Stochastic block model, parallel computing, community detection
DOI: 10.3233/IDA-163156
Journal: Intelligent Data Analysis, vol. 21, no. 6, pp. 1463-1485, 2017
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]