A fine-grained image classification algorithm based on self-supervised learning and multi-feature fusion of blood cells

Leukemia is a prevalent and widespread blood disease, and its early diagnosis is crucial for effective patient treatment. Diagnosing leukemia types heavily relies on pathologists’ morphological examination of blood cell images. However, this process is tedious and time-consuming, and the diagnostic...

Full description

Saved in:
Bibliographic Details
Published inScientific reports Vol. 14; no. 1; pp. 22964 - 15
Main Authors Jia, Nan, Guo, Jingxia, Li, Yan, Tang, Siyuan, Xu, Li, Liu, Liang, Xing, Junfeng
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 03.10.2024
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text
ISSN2045-2322
2045-2322
DOI10.1038/s41598-024-74753-2

Cover

More Information
Summary:Leukemia is a prevalent and widespread blood disease, and its early diagnosis is crucial for effective patient treatment. Diagnosing leukemia types heavily relies on pathologists’ morphological examination of blood cell images. However, this process is tedious and time-consuming, and the diagnostic results are subjective, leading to potential misdiagnosis and underdiagnosis. This paper proposes a blood cell image classification method that combines MAE with an enhanced Vision Transformer to tackle these challenges. Initially, pre-training occurs on two datasets, TMAMD and Red4, using the MAE self-supervised learning algorithm. Subsequently, the pre-training weights are transferred to our improved model.This paper introduces feature fusion of the outputs from each layer of the Transformer encoder to maximize the utilization of features extracted from lower layers, such as color, contour, and texture of blood cells, along with deeper semantic features. Furthermore, the dynamic margins for the subcenter Arcface Loss function are employed to enhance the model’s fine-grained feature representation by achieving inter-class dispersion and intra-class aggregation. Models trained using our method achieved state-of-the-art results on both the TMAMD dataset and Red4 dataset, with classification accuracies of 93.51% and 81.41%, respectively. This achievement is expected to be a valuable reference for physicians in their clinical diagnoses.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2045-2322
2045-2322
DOI:10.1038/s41598-024-74753-2