Big data based analytic model to predict and classify breast cancer using improved fractional rough fuzzy K‐means clustering and labeled ensemble classifier algorithm

Breast cancer is a very dangerous disease that mainly affects women. It is a deadliest disease that highly affects the women's life. Therefore, it is necessary to predict and classify this deadly disease for early diagnosis. There exist numerous data mining techniques for early prediction and c...

Full description

Saved in:
Bibliographic Details
Published inConcurrency and computation Vol. 34; no. 10
Main Authors K, Srikanth, Zahoor Ul Huq, S., Siva Kumar, A. P.
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.05.2022
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text
ISSN1532-0626
1532-0634
DOI10.1002/cpe.6715

Cover

More Information
Summary:Breast cancer is a very dangerous disease that mainly affects women. It is a deadliest disease that highly affects the women's life. Therefore, it is necessary to predict and classify this deadly disease for early diagnosis. There exist numerous data mining techniques for early prediction and classification of this disease. The big data based analytical model provides the better solution for storing, manipulating, and analyzing a great number of mammographic images. In this article, a new improved fractional rough fuzzy K‐means clustering strategy is considered for disease prediction. Then, a new Tunicate Swarm Algorithm (TSA) is introduced to optimize the weight parameters. TSA is a bio‐inspired metaheuristic optimization approach. Finally, the labeled ensemble classifier (LEC) is utilized for classifying the stages of breast cancer as malignant and benign. Here, the data is randomly generated from breast cancer Wisconsin dataset (diagnosis) obtainable on UCI machine learning repository. The proposed strategy is compared with different existing strategies, like Logistic Regression Classifier, Random Forest Classifier. From the analysis, it is observed that the proposed big data based analytical model using LEC provides 99.3% accuracy that is very high when compared to the accuracy of existing approaches.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1532-0626
1532-0634
DOI:10.1002/cpe.6715