| Literature DB >> 35193533 |
Karsten Wendt1, Jan Moritz Middeke2, Jan-Niklas Eckardt3, Tim Schmittmann1, Sebastian Riechert1, Michael Kramer2, Anas Shekh Sulaiman2, Katja Sockel2, Frank Kroschinsky2, Johannes Schetelig2, Lisa Wagenführ2, Ulrich Schuler2, Uwe Platzbecker4, Christian Thiede2, Friedrich Stölzel2, Christoph Röllig2, Martin Bornhäuser2,5,6.
Abstract
BACKGROUND: Acute promyelocytic leukemia (APL) is considered a hematologic emergency due to high risk of bleeding and fatal hemorrhages being a major cause of death. Despite lower death rates reported from clinical trials, patient registry data suggest an early death rate of 20%, especially for elderly and frail patients. Therefore, reliable diagnosis is required as treatment with differentiation-inducing agents leads to cure in the majority of patients. However, diagnosis commonly relies on cytomorphology and genetic confirmation of the pathognomonic t(15;17). Yet, the latter is more time consuming and in some regions unavailable.Entities:
Keywords: Acute myeloid leukemia; Acute promyelocytic leukemia; Artificial intelligence; Bone marrow smear; Deep learning
Mesh:
Year: 2022 PMID: 35193533 PMCID: PMC8864866 DOI: 10.1186/s12885-022-09307-8
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Workflow of the multi-step deep learning model for APL recognition. We identified patients with APL, non-APL AML and healthy bone marrow donors by retrospective chart review. Representative images of bone marrow smears (BMS) were labeled according to diagnosis. After image preprocessing, transformation and augmentation, initial cell border proposals were given by the Faster Region-based Convolutional Neural Net (FRCNN) that were manually corrected on an online segmentation and annotation platform based on the VGG image annotator tool. The FRCNN was trained iteratively to improve cell border proposals. Segmented cells were manually labeled according to cell type (myeloblasts, promyelocytes) and Auer rods. Convolutional neural nets were then implemented on the automatically segmented cells for binary classification of individual cell types and features. Their output was used to train an ensemble neural net for the binary classification between APL and non-APL AML or APL and healthy bone marrow donor samples
Patient characteristics
| parameter | non-APL AML | APL | Bone marrow donors |
|---|---|---|---|
| N | 1 048 | 58 | 236 |
| Age, median (IQR) | 57 (49–67) | 50.5 (42.25–58) | 31 (25–39) |
| Male | 45.5 | 45.7 | 70 |
| Female | 54.5 | 54.3 | 30 |
| de novo | 77.5 | 90.5 | / |
| sAML | 13.6 | 0 | / |
| tAML | 8.9 | 9.5 | / |
| Favorable | 34.1 | / | / |
| intermediate | 44.7 | / | / |
| adverse | 21.2 | / | / |
| WBC in GPt/l, median (IQR) | 14 (2.7–45.2) | 1.3 (0.7–6.4) | / |
| Hb in mmol/l, median (IQR) | 5.8 (5.0–6.8) | 6.3 (5.3–6.9) | / |
| Hb in g/dl, median (IQR) | 9.3 (8.1–11.0) | 10.1 (8.5–11.1) | / |
| Plt in GPt/l, median (IQR) | 56 (31–103) | 27 (18–57) | / |
| PB blasts, median (IQR) | 27 (6–62) | 14.75 (1.3–63) | / |
| BM blasts, median (IQR) | 63.5 (41.5–80) | 63 (53.5–76) | / |
Patient characteristics of non-APL AML, APL and control (bone marrow donors) groups. AML type was defined according to the WHO 2016 classification
sAML Secondary AML, tAML Therapy-associated AML, WBC white blood cell count, Hb Hemoglobin, Plt Platelet count, PB Peripheral blood, BM Bone marrow, N Number, IQR interquartile range
Fig. 2Examples of automated segmentation and occlusion sensitivity mapping. A Faster Recurrent Neural Network (FRCNN) was used for cell segmentation. First, it was trained by human example and after iterative learning, automated cell detection was performed (A). Segmented cells show a yellow elliptic border. With respect to explainable artificial intelligence, we used occlusion sensitivity mapping to retrace the decision-making process of the convolutional neural nets in image-level recognition (B). In occlusion sensitivity mapping, parts of the image are iteratively blocked from evaluation by the neural network and performance is measured. If the blocked part of the image is highly important for correct classifications, performance will drop accordingly. This process is iteratively repeated for the entire image. The result can be visualized in the sense that highly important image areas are highlighted (yellow/green) while less important or negligible areas are shaded (blue/purple)
Fig. 3Performance of convolutional neural nets for binary cell type classifications. Since end-to-end image-level classification did not show satisfactory results in preliminary testing, we used cell-level recognition with convolutional neural nets as a proxy. Relevant cell types and features for the distinction between non-APL AML, APL and healthy bone marrow, i. e. myeloblasts, promyelocytes, and Auer rods, were labeled manually and CNNs were trained. The performance of individual CNNs for the detection of myeloblasts (A), promyelocytes (B), and Auer rods (C) on the respective testing sets (that were rigorously withheld from training) is displayed as area under the receiver operating curve (AUROC) using three-fold cross-validation (cv 0, 1, 2 illustrated in light blue, orange, and green). std. dev. – standard deviation of the mean; TPR – true positive rate; FPR – false positive rate
Fig. 4Performance of the ensemble neural net for APL image-level recognition. Performance metrics for the binary classification of APL vs. healthy bone marrow donor samples (top row) and APL vs. non-APL AML samples (bottom row) were calculated as areas under the curve for precision-recall curves (A, C) and the receiver operating characteristic (B, D) using threefold cross-validation (cv 0, 1, 2 illustrated in light blue, orange, and green) and averaging results (Macro avg, dark blue). Calculations were performed in Python. std. dev. – standard deviation of the mean; TPR – true positive rate; FPR – false positive rate