MutaSwarmClus: enhancing data clustering efficiency with mutation-enhanced swarm algorithm

Data clustering is a fundamental technique in data mining, pivotal for various applications such as statistical analysis and data compression. Traditional clustering algorithms often struggle with noisy or high-dimensional datasets, hindering their efficacy in addressing real-world challenges. In re...

Full description

Saved in:
Bibliographic Details
Published inCluster computing Vol. 28; no. 3; p. 188
Main Authors Malik, Saleem, Patro, S. Gopal Krishna, Mahanty, Chandrakanta, Lasisi, Ayodele, Al-sareji, Osamah J.
Format Journal Article
LanguageEnglish
Published New York Springer US 01.06.2025
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1386-7857
1573-7543
1573-7543
DOI10.1007/s10586-024-04822-8

Cover

More Information
Summary:Data clustering is a fundamental technique in data mining, pivotal for various applications such as statistical analysis and data compression. Traditional clustering algorithms often struggle with noisy or high-dimensional datasets, hindering their efficacy in addressing real-world challenges. In response, this research introduces MutaSwarmClus, a novel hybrid metaheuristic algorithm that combines Mouth Brooding Fish (MBF), Ant Colony Optimization (ACO), and mutation operators to enhance clustering quality. MutaSwarmClus intends to adaptively control the exploration and exploitation phases of the solution space, solve issues with local optima and changes in the distribution of available data. Moreover, it incorporates an Iterated Local Search (ILS) to refine solutions and avoiding getting stuck in local optima. MutaSwarmClus therefore increases the robustness of the clustering process by incorporating controlled randomness through mutation operators in order to handle noisy and outlier data points well. According to the contributions analysis, the proposed algorithm improves the clustering solution with the combined system of MBF, ACO, and mutation operators, which enables the mechanism of exploration and exploitation in the process of information search. As shown through the results of experimental studies, MutaSwarmClus has high performance when used with various benchmarks, and outperforms or performs as well as or better than compared to other clustering algorithms such as K-means, ALO, Hybrid ALO, and MBF. It achieves an average error rate of only 10%, underscoring its accuracy in clustering tasks. The utilization of MutaSwarmClus offers a solution to the existing problems in clustering large datasets in terms of scalability, efficiency and accuracy. Possible directions for future work can continue to optimize the model parameters of the algorithm and study its adaptability in dynamic conditions and with large amounts of data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1386-7857
1573-7543
1573-7543
DOI:10.1007/s10586-024-04822-8