New hybrid features extracted from US images for breast cancer classification

Artificial intelligence (AI), and image processing fields play a vital role in classifying benign and malignant breast cancer (BC). The novelty of this paper lies in computing original hybrid features (HF) from textural and shape features of BC integrated into a polynomial regression, and their clas...

Full description

Saved in:
Bibliographic Details
Published inScientific reports Vol. 15; no. 1; pp. 25690 - 15
Main Authors Tăbăcaru, Gigi, Moldovanu, Simona, Munteanu, Dan, Barbu, Marian
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 16.07.2025
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text
ISSN2045-2322
2045-2322
DOI10.1038/s41598-025-09554-2

Cover

More Information
Summary:Artificial intelligence (AI), and image processing fields play a vital role in classifying benign and malignant breast cancer (BC). The novelty of this paper lies in computing original hybrid features (HF) from textural and shape features of BC integrated into a polynomial regression, and their classification with two different Automated Machine Learning (AutoML). The obtained data are original; therefore, a previous analysis of them with violin graphs was needed. For computing of the hybrid features, the Haralick textural features and Hu moments were integrated in a polynomial regression way. In this context, two different AutoML, PyCaret and TPOT (Tree-based Pipeline Optimization Tool) were proposed, and the optimal model for hybrid features included in the classification process was identified during the tuning process. The experimental results indicated that the HF, composed of entropy and Hu moments, was selected by PyCaret using the AdaBoost Classifier (ADB) as the optimal classifier, achieving an accuracy of 91.4%. Additionally, TPOT employed a Multilayer Perceptron Classifier, which provided an accuracy of 90.6%. These findings identified the most effective features for classifying benign and malignant breast cancer (BC). Enhancing computational efficiency reduces the risk of overfitting; hence, the bagging, boosting, and stacking Ensemble Machine Learning (EML) techniques were proposed to validate the obtained results. The study’s originality lies in the HF’s capacity to accurately represent and capture the lesion’s texture and shape, just like a physician makes a BC diagnosis.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-025-09554-2