A fine-grained image classification algorithm based on self-supervised learning and multi-feature fusion of blood cells

Leukemia is a prevalent and widespread blood disease, and its early diagnosis is crucial for effective patient treatment. Diagnosing leukemia types heavily relies on pathologists’ morphological examination of blood cell images. However, this process is tedious and time-consuming, and the diagnostic...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 14; no. 1; pp. 22964 - 15
Main Authors	Jia, Nan, Guo, Jingxia, Li, Yan, Tang, Siyuan, Xu, Li, Liu, Liang, Xing, Junfeng
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 03.10.2024 Nature Publishing Group Nature Portfolio
Subjects	639/705/1042 692/699/67/1990/283 Algorithms Blood Blood Cells Cell fusion Classification Humanities and Social Sciences Humans Image Interpretation, Computer-Assisted - methods Image processing Image Processing, Computer-Assisted - methods Leukemia Leukemia - diagnosis Leukemia - pathology multidisciplinary Science Science (multidisciplinary) Supervised Machine Learning Training
Online Access	Get full text
ISSN	2045-2322 2045-2322
DOI	10.1038/s41598-024-74753-2

Cover

More Information
Summary:	Leukemia is a prevalent and widespread blood disease, and its early diagnosis is crucial for effective patient treatment. Diagnosing leukemia types heavily relies on pathologists’ morphological examination of blood cell images. However, this process is tedious and time-consuming, and the diagnostic results are subjective, leading to potential misdiagnosis and underdiagnosis. This paper proposes a blood cell image classification method that combines MAE with an enhanced Vision Transformer to tackle these challenges. Initially, pre-training occurs on two datasets, TMAMD and Red4, using the MAE self-supervised learning algorithm. Subsequently, the pre-training weights are transferred to our improved model.This paper introduces feature fusion of the outputs from each layer of the Transformer encoder to maximize the utilization of features extracted from lower layers, such as color, contour, and texture of blood cells, along with deeper semantic features. Furthermore, the dynamic margins for the subcenter Arcface Loss function are employed to enhance the model’s fine-grained feature representation by achieving inter-class dispersion and intra-class aggregation. Models trained using our method achieved state-of-the-art results on both the TMAMD dataset and Red4 dataset, with classification accuracies of 93.51% and 81.41%, respectively. This achievement is expected to be a valuable reference for physicians in their clinical diagnoses.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-74753-2