Improved community structure discovery algorithm based on combined clique percolation method and K-means algorithm

Research on the community structure of networks is beneficial for understanding the structure of networks, analyzing their characteristics and discovering the rules hidden in these networks. To address issues from previous community mining algorithms, such as the low rate of convergence and high tim...

Full description

Saved in:
Bibliographic Details
Published inPeer-to-peer networking and applications Vol. 13; no. 6; pp. 2224 - 2233
Main Authors Zhou, Zhou, Xiao, Zhuopeng, Deng, WeiHong
Format Journal Article
LanguageEnglish
Published New York Springer US 01.11.2020
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1936-6442
1936-6450
DOI10.1007/s12083-020-00902-9

Cover

More Information
Summary:Research on the community structure of networks is beneficial for understanding the structure of networks, analyzing their characteristics and discovering the rules hidden in these networks. To address issues from previous community mining algorithms, such as the low rate of convergence and high time complexity, this study proposes an improved community structure discovery algorithm named CPMK-Means algorithm. The main idea of this algorithm can be summarised as follows. The clique percolation method (CPM) algorithm generates the maximum number of cliques by combining depth-first search with breadth-first search so that the number of cluster centres is determined. Then, the k centres are selected based on the principle of the maximum degree of centres and minimum similarity between different centres. Afterwards, nodes in the network are assigned to the communities formed by the k centres, and the iterations are performed repeatedly until the centres become stable. Finally, the overlapping communities are merged. Experiments are carried out on standard data sets Football and Collins to evaluate the performance of the CPMK-Means algorithm. Results indicate that the CPMK-Means algorithm can achieve better community mining and higher execution efficiency compared with other algorithms. Furthermore, it is superior to other algorithms in terms of precision, recall, accuracy, F-measure and separation.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1936-6442
1936-6450
DOI:10.1007/s12083-020-00902-9