Design and analysis of text document clustering using salp swarm algorithm

In the technological era, exponential increase of unorganized text documents offers increased difficulties retrieving the most relevant data. The document clustering is a most prominent technique that transforms unorganized contents into organized contents in the form of clusters. The recognition te...

Full description

Saved in:
Bibliographic Details
Published inThe Journal of supercomputing Vol. 78; no. 14; pp. 16197 - 16213
Main Authors Ponnusamy, Muruganantham, Bedi, Pradeep, Suresh, Tamilarasi, Alagarsamy, Aravindhan, Manikandan, R., Yuvaraj, N.
Format Journal Article
LanguageEnglish
Published New York Springer US 01.09.2022
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN0920-8542
1573-0484
DOI10.1007/s11227-022-04525-0

Cover

More Information
Summary:In the technological era, exponential increase of unorganized text documents offers increased difficulties retrieving the most relevant data. The document clustering is a most prominent technique that transforms unorganized contents into organized contents in the form of clusters. The recognition technique always undergoes clustering of text documents with misleading or redundant information that degrades document clustering quality. In this study, a salp swarm algorithm (SSA) is used for clustering the text documents. The study is improved with a similarity and a distance-based measurements as an objective function in the clustering domain. The experimental validation is conducted to show the efficacy of SSA-based similarity distance measurement that prominently improves the quality of clustering the text documents. The comparison with existing methods shows that the proposed SSA offers better clustering of text documents in accuracy, sensitivity, specificity, and f -measure.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0920-8542
1573-0484
DOI:10.1007/s11227-022-04525-0