OClustR: A new graph-based algorithm for overlapping clustering

Clustering is a Data Mining technique, which has been widely used in many practical applications. From these applications, there are some, like social network analysis, topic detection and tracking, information retrieval, categorization of digital libraries, among others, where objects may belong to...

Full description

Saved in:
Bibliographic Details
Published inNeurocomputing (Amsterdam) Vol. 121; pp. 234 - 247
Main Authors Pérez-Suárez, Airel, Martínez-Trinidad, José F., Carrasco-Ochoa, Jesús A., Medina-Pagola, José E.
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 09.12.2013
Elsevier
Subjects
Online AccessGet full text
ISSN0925-2312
1872-8286
DOI10.1016/j.neucom.2013.04.025

Cover

More Information
Summary:Clustering is a Data Mining technique, which has been widely used in many practical applications. From these applications, there are some, like social network analysis, topic detection and tracking, information retrieval, categorization of digital libraries, among others, where objects may belong to more than one cluster; however, most clustering algorithms build disjoint clusters. In this work, we introduce OClustR, a new graph-based clustering algorithm for building overlapping clusters. The proposed algorithm introduces a new graph-covering strategy and a new filtering strategy, which together allow to build overlapping clusterings more accurately than those built by previous algorithms. The experimental evaluation, conducted over several standard collections, showed that our proposed algorithm builds less clusters than those built by the previous related algorithms. Additionally, OClustR builds clusters with overlapping closer to the real overlapping in the collections than the overlapping generated by other clustering algorithms. •A new overlapping clustering algorithm, called OClustR, is proposed.•OClustR introduces a new graph-covering strategy and a new filtering strategy.•OClustR outperforms state-of-the-art algorithms in several collections.•OClustR solves the limitations of the state-of-the-art algorithms.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0925-2312
1872-8286
DOI:10.1016/j.neucom.2013.04.025