Improved community structure discovery algorithm based on combined clique percolation method and K-means algorithm

Research on the community structure of networks is beneficial for understanding the structure of networks, analyzing their characteristics and discovering the rules hidden in these networks. To address issues from previous community mining algorithms, such as the low rate of convergence and high tim...

Full description

Saved in:

Bibliographic Details
Published in	Peer-to-peer networking and applications Vol. 13; no. 6; pp. 2224 - 2233
Main Authors	Zhou, Zhou, Xiao, Zhuopeng, Deng, WeiHong
Format	Journal Article
Language	English
Published	New York Springer US 01.11.2020 Springer Nature B.V
Subjects	Algorithms Communications Engineering Computer Communication Networks Engineering Information Systems and Communication Service Networks Percolation Signal,Image and Speech Processing Standard data Community structure Execution efficiency Parallel computing Big data Mining algorithm
Online Access	Get full text
ISSN	1936-6442 1936-6450
DOI	10.1007/s12083-020-00902-9

Cover

More Information
Summary:	Research on the community structure of networks is beneficial for understanding the structure of networks, analyzing their characteristics and discovering the rules hidden in these networks. To address issues from previous community mining algorithms, such as the low rate of convergence and high time complexity, this study proposes an improved community structure discovery algorithm named CPMK-Means algorithm. The main idea of this algorithm can be summarised as follows. The clique percolation method (CPM) algorithm generates the maximum number of cliques by combining depth-first search with breadth-first search so that the number of cluster centres is determined. Then, the k centres are selected based on the principle of the maximum degree of centres and minimum similarity between different centres. Afterwards, nodes in the network are assigned to the communities formed by the k centres, and the iterations are performed repeatedly until the centres become stable. Finally, the overlapping communities are merged. Experiments are carried out on standard data sets Football and Collins to evaluate the performance of the CPMK-Means algorithm. Results indicate that the CPMK-Means algorithm can achieve better community mining and higher execution efficiency compared with other algorithms. Furthermore, it is superior to other algorithms in terms of precision, recall, accuracy, F-measure and separation.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1936-6442 1936-6450
DOI:	10.1007/s12083-020-00902-9