| Literature DB >> 34067726 |
Georg Steinbuss1,2, Mark Kriegsmann2,3, Christiane Zgorzelski2, Alexander Brobeil2, Benjamin Goeppert2, Sascha Dietrich1, Gunhild Mechtersheimer2, Katharina Kriegsmann1.
Abstract
The diagnosis and the subtyping of non-Hodgkin lymphoma (NHL) are challenging and require expert knowledge, great experience, thorough morphological analysis, and often additional expensive immunohistological and molecular methods. As these requirements are not always available, supplemental methods supporting morphological-based decision making and potentially entity subtyping are required. Deep learning methods have been shown to classify histopathological images with high accuracy, but data on NHL subtyping are limited. After annotation of histopathological whole-slide images and image patch extraction, we trained and optimized an EfficientNet convolutional neuronal network algorithm on 84,139 image patches from 629 patients and evaluated its potential to classify tumor-free reference lymph nodes, nodal small lymphocytic lymphoma/chronic lymphocytic leukemia, and nodal diffuse large B-cell lymphoma. The optimized algorithm achieved an accuracy of 95.56% on an independent test set including 16,960 image patches from 125 patients after the application of quality controls. Automatic classification of NHL is possible with high accuracy using deep learning on histopathological images and routine diagnostic applications should be pursued.Entities:
Keywords: CLL/SLL; CNN; DLBCL; artificial intelligence; deep learning; histopathology
Year: 2021 PMID: 34067726 PMCID: PMC8156071 DOI: 10.3390/cancers13102419
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Tumor annotation and generation of image patches. Representative tissue microarray core of a diffuse large B-cell lymphoma without (A) and with annotation (B)—yellow outline, as well as after image patche creation (C)—red squares. The image patches were subsequently saved as .png files. Tile numbers (Tile 1, Tile 2, etc.) are shown in a gray box within each red square in this example. Scale bars (black line): 100 µm.
Figure 2Examples of image patches from annotated areas. Representative image patches from control lymph nodes (A), small lymphocytic lymphoma/chronic lymphocytic leukemia (B), and diffuse large B-cell lymphoma (C) are shown. Magnification: each image 100 × 100 µm (395 × 395 px).
Number of extracted image patches per group.
| Group | SLL/CLL | DLBCL | LN Lung | LN Colon | LN Pancreas |
|---|---|---|---|---|---|
| Total cases, | 129 | 119 | 64 | 230 | 87 |
| Training set, ~60% of cases | |||||
| Cases, | 78 | 80 | 34 | 134 | 52 |
| Image patches, | |||||
| Total | 11,404 | 8625 | 5064 | 18,488 | 7004 |
| Minimum | 4 | 7 | 33 | 4 | 2 |
| Maximum | 231 | 278 | 238 | 222 | 245 |
| Mean | 146 | 108 | 149 | 138 | 135 |
| Median | 149 | 99.5 | 150 | 142 | 139 |
| Validation set, ~20% of cases | |||||
| Cases, | 22 | 18 | 15 | 57 | 14 |
| Image patches, | |||||
| Total | 3086 | 1815 | 2436 | 7870 | 1387 |
| Minimum | 3 | 13 | 115 | 24 | 18 |
| Maximum | 214 | 329 | 251 | 255 | 242 |
| Mean | 140 | 101 | 162 | 138 | 99 |
| Median | 146.5 | 88 | 156 | 140 | 80.5 |
| Test set, ~20% of cases | |||||
| Cases, | 29 | 21 | 15 | 39 | 21 |
| Image patches, | |||||
| Total | 4631 | 2236 | 2393 | 4966 | 2734 |
| Minimum | 18 | 22 | 105 | 17 | 48 |
| Maximum | 226 | 225 | 265 | 184 | 237 |
| Mean | 160 | 106 | 160 | 127 | 130 |
| Median | 189 | 103 | 162 | 132 | 131 |
DLBCL: diffuse large B-cell-lymphoma, LN: lymph node, SLL/CLL: small lymphocytic lymphoma/chronic lymphatic leukemia.
Figure 3Training and validation accuracy of the model with the highest validation accuracy as per EfficientNet.
Figure 4Confusion matrix of the best-performing model in terms of the test data at the patch (left) and case level (right). The lower panels exhibit the balanced accuracy (BACC). CLL: chronic lymphocytic leukemia, DLBCL: diffuse large B-cell lymphoma, LN: lymph node.
Balanced accuracy (BACC) given different case quality control (CQC) thresholds and patch quality control thresholds (PQC).
| CQC Threshold | 50% | 60% | 70% | 80% | 90% | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| PQC | PQC not | BACC (%) | CQC not met (%) | BACC (%) | CQC not met (%) | BACC (%) | CQC not met (%) | BACC (%) | CQC not met (%) | BACC (%) | CQC not met (%) |
|
| 0.54 | 84.48 | 0.8 | 88.61 | 5.6 | 89.55 | 8 | 94.12 | 17.6 | 95.56 | 24.8 |
|
| 3.44 | 83.74 | 0 | 87.67 | 4.8 | 89.55 | 8 | 94.12 | 16.8 | 93.75 | 22.4 |
|
| 6.43 | 85.25 | 0.8 | 85.99 | 3.2 | 89.64 | 7.2 | 94.74 | 15.2 | 93.75 | 21.6 |
|
| 10.05 | 85.32 | 0 | 86.79 | 4 | 89.64 | 7.2 | 92.42 | 11.2 | 93.75 | 20.8 |
|
| 15.74 | 85.25 | 0.8 | 85.24 | 2.4 | 89.64 | 6.4 | 91.3 | 9.6 | 93.75 | 18.4 |
Figure 5SmoothGrad heatmaps of exemplary patches that were classified correctly. For each class, the upper plot shows the original image patch while the lower plot shows the patch overlaid with the SmoothGrad heatmap with respect to the class of the patch. High SmoothGrad activity scores can be seen in areas overlaid with single cells, as well as in lung LN with extracellular anthracosis. This confirms that the algorithm classified the image patches on the basis of cellular and extracellular morphological structures. DLBCL: diffuse large B-cell-lymphoma, LN: lymph node, SLL/CLL: small lymphocytic lympho-ma/chronic lymphatic leukemia. Magnification: each image 100 × 100 µm (395 × 395 px).