| Literature DB >> 34249819 |
Min Zhou1,2, Kefei Wu1,2, Lisha Yu2, Mengdi Xu3,4, Junjun Yang5, Qing Shen3,4, Bo Liu3,4, Lei Shi3,4, Shuang Wu3,4, Bin Dong1, Hansong Wang1,6, Jiajun Yuan1,7, Shuhong Shen2, Liebin Zhao1,6.
Abstract
Leukemia is the most common malignancy affecting children. The morphologic analysis of bone marrow smears is an important initial step for diagnosis. Recent publications demonstrated that artificial intelligence is able to classify blood cells but a long way from clinical use. A total of 1,732 bone marrow images were used for the training of a convolutional neural network (CNN). New techniques of deep learning were integrated and an end-to-end leukemia diagnosis system was developed by using raw images without pre-processing. The system creatively imitated the workflow of a hematologist by detecting and excluding uncountable and crushed cells, then classifying and counting the remain cells to make a diagnosis. The performance of the CNN in classifying WBCs achieved an accuracy of 82.93%, precision of 86.07% and F1 score of 82.02%. And the performance in diagnosing acute lymphoid leukemia achieved an accuracy of 89%, sensitivity of 86% and specificity of 95%. The system also performs well at detecting the bone marrow metastasis of lymphoma and neuroblastoma, achieving an average accuracy of 82.93%. This is the first study which included a wider variety of cell types in leukemia diagnosis, and achieved a relatively high performance in real clinical scenarios.Entities:
Keywords: artificial intelligence; computer-aided diagnose; deep learning; leukemia; morphological diagnosis
Year: 2021 PMID: 34249819 PMCID: PMC8264256 DOI: 10.3389/fped.2021.693676
Source DB: PubMed Journal: Front Pediatr ISSN: 2296-2360 Impact factor: 3.418
Figure 1Representative images of the classified cell types.
Figure 2Dataset split and distribution of categories.
The cell types and total numbers of cells of each class in the dataset.
| Granulocyte | 0 | Myeloblast | 830 |
| 1 | Promyelocyte | 1,101 | |
| 2 | Neutrophilic myelocyte | 1,398 | |
| 3 | Neutrophilic metamyelocyte | 1,244 | |
| 4 | Neutrophilic granulocyte band form | 1,784 | |
| 5 | Neutrophilic granulocyte segmented from | 2,004 | |
| 6 | Eosinophil | 352 | |
| Erythroid | 7 | Pro-erythroblast | 337 |
| 8 | Basophilic erythroblast | 530 | |
| 9 | Polychromatic erythroblast | 2,983 | |
| 10 | Orthochromatic erythroblast | 1,439 | |
| Lymphocyte | 11 | Lymphoblast | 1,905 |
| 12 | Lymphocyte | 2,558 | |
| 13 | Lymphoma cell | 900 | |
| Monocyte | 14 | Monoblast | 1,410 |
| 15 | Promonocyte | 520 | |
| 16 | Monocyte | 544 | |
| Megakaryocyte | 17 | Megakaryoblast | 630 |
| 18 | Promegakaryocyte | 185 | |
| Others | 19 | Neuroblastoma cell | 1,201 |
| 20 | Uncountable cells | 310 | |
The uncountable cells included reticular cells, mast cells, naked nuclei and so on, which are usually not involved in the diagnosis of leukemia.
Figure 3The overall modeling framework containing two modules: detection module and classification module. In the detection module, we train detection model using RetinaNet method to detect all the WBCs in bone marrow images. The classification module contains two stage. In the first stage, we develop a countable cell classification model to discriminate crush white blood cell which would not be counted by hematologists. Then in the second stage, the detected countable white blood cells are submitted to classification model for WBC classification.
Figure 4The ROC curve of the second model stage for discriminating countable and uncountable WBCs.
The performance of the deep learning model for the classification of types of WBCs.
| ResNext101_32*8d swsl | 0.8149 | 0.8441 | 0.8149 | 0.9856 |
| ResNext50_32*4dswsl | 0.7982 | 0.8327 | 0.7982 | 0.9844 |
| Resnet50 | 0.8073 | 0.8023 | 0.8074 | 0.9813 |
| Ensemble model | 0.8293 | 0.8567 | 0.8293 | 0.9870 |
Figure 5The confusion matrix of the ensemble model for WBC classification (Refer to Table 1 for cell types of each class).
Figure 6The PR curves of the differential count of 20 types of cells. (A) Cell classes with an AP over 0.9. (B) Cell classes with an AP from 0.8 to 0.9. (C) Cell classes with an AP under 0.8.
Comparison of the classification accuracy of different researches in WBC detection.
| MoradiAmin et al. ( | Real world | 958 | 2 | ALL | Feature extraction, c-means | >90% (disease level) |
| Bigorra et al. ( | Real world | 916 | 3 | ALL, AML | Feature extraction | >74% (cell level) |
| Choi et al. ( | Real world | 2,174 | 10 | ALL, AML | CNN | 97.06% (cell level) |
| Shafique et al. ( | ALL-IDB | 108+260 | 2 | ALL | CNN | >95% (disease level) |
| Moshavash et al. ( | ALL-IDB | 108+260 | 2 | ALL | Segmentation, feature extraction, SVM | 89.81% (disease level) |
| Qin et al. ( | Real world | 92,480 | 40 | ALL, AML | CNN | 76.84% (cell level) |
| Rehman et al. ( | Real world | / | 2 | ALL | Segmentation, CNN | 97.78% (disease level) |
| Boudu et al. ( | Real world | 7,468 | 6 | ALL, AML | Feature extraction | 85.8% (cell level) |
| Shahin et al. ( | ALL-IDB | 108+260 | 2 | ALL | Feature extraction, CNN | 96.1% (cell level) |
| Anwar et al. ( | ALL-IDB | 108+260 | 2 | ALL | CNN | 99.5% (disease level) |
| Gehlot et al. ( | TCIA | 15,114 | 2 | ALL | CNN | 94.8% (F1 score, cell level) |
| Zhang et al. ( | BCCD | 5,000 | 6 | / | CNN, HOG, SVM | >95% (cell level) |