| Literature DB >> 32599959 |
Valentina Gaidano1,2, Valerio Tenace3, Nathalie Santoro4, Silvia Varvello5, Alessandro Cignetti5, Giuseppina Prato6, Giuseppe Saglio1,5, Giovanni De Rosa4, Massimo Geuna4.
Abstract
The immunophenotype is a key element to classify B-cell Non-Hodgkin Lymphomas (B-NHL); while it is routinely obtained through immunohistochemistry, the use of flow cytometry (FC) could bear several advantages. However, few FC laboratories can rely on a long-standing practical experience, and the literature in support is still limited; as a result, the use of FC is generally restricted to the analysis of lymphomas with bone marrow or peripheral blood involvement. In this work, we applied machine learning to our database of 1465 B-NHL samples from different sources, building four artificial predictive systems which could classify B-NHL in up to nine of the most common clinico-pathological entities. Our best model shows an overall accuracy of 92.68%, a mean sensitivity of 88.54% and a mean specificity of 98.77%. Beyond the clinical applicability, our models demonstrate (i) the strong discriminatory power of MIB1 and Bcl2, whose integration in the predictive model significantly increased the performance of the algorithm; (ii) the potential usefulness of some non-canonical markers in categorizing B-NHL; and (iii) that FC markers should not be described as strictly positive or negative according to fixed thresholds, but they rather correlate with different B-NHL depending on their level of expression.Entities:
Keywords: artificial intelligence; classification; flow cytometry; lymphoma; machine learning; non-hodgkin
Year: 2020 PMID: 32599959 PMCID: PMC7352227 DOI: 10.3390/cancers12061684
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Classification trees of Model I (a), Model II (b), Model III (c), and Model IV (d). Values reported inside decision nodes represent threshold values. Percentage values reported in leaf nodes represent the prediction probability, i.e., how likely the achieved class is the correct one.
Figure 2Confusion matrices and corresponding overall accuracies for the classifiers associated to Model I (a), Model II (b), Model III (c), and Model IV (d).
Accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) for each B-NHL subclass for Model I to IV. All values are percentages.
| B-NHL | Accuracy | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|
|
| |||||
|
| 99.45 | 66.67 | 99.73 | 66.67 | 99.73 |
|
| 95.90 | 97.01 | 94.96 | 94.21 | 97.42 |
|
| 96.18 | 80.00 | 99.04 | 93.62 | 96.56 |
|
| 96.45 | 86.00 | 98.10 | 87.76 | 97.79 |
|
| 99.45 | 83.32 | 99.71 | 83.32 | 99.71 |
|
| 96.45 | 46.67 | 98.57 | 58.32 | 97.75 |
|
| 97.81 | 85.70 | 98.54 | 78.26 | 99.12 |
|
| 94.28 | 84.09 | 95.67 | 72.54 | 97.78 |
|
| 99.45 | 60.00 | 100.00 | 100.00 | 99.45 |
|
| |||||
|
| 99.43 | 75.00 | 99.71 | 75.00 | 99.71 |
|
| 98.87 | 98.20 | 99.46 | 99.39 | 98.42 |
|
| 95.76 | 87.26 | 97.32 | 85.70 | 97.65 |
|
| 95.76 | 82.00 | 98.03 | 87.23 | 97.07 |
|
| 98.87 | 86.67 | 99.40 | 86.67 | 99.40 |
|
| 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
|
| 96.62 | 90.70 | 97.43 | 82.98 | 98.70 |
|
| |||||
|
| 99.15 | 50.00 | 99.71 | 66.67 | 99.43 |
|
| 96.06 | 96.40 | 95.73 | 95.26 | 96.76 |
|
| 95.76 | 81.81 | 98.32 | 90.00 | 96.71 |
|
| 96.34 | 88.00 | 97.70 | 86.26 | 98.03 |
|
| 98.03 | 53.32 | 100.00 | 100.00 | 97.98 |
|
| 98.03 | 95.23 | 98.20 | 76.92 | 99.70 |
|
| 95.20 | 86.04 | 96.46 | 77.07 | 98.04 |
|
| |||||
|
| 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
|
| 99.26 | 94.12 | 100.00 | 100.00 | 99.17 |
|
| 92.70 | 89.79 | 94.31 | 89.79 | 94.31 |
|
| 94.89 | 92.31 | 95.92 | 90.00 | 96.90 |
|
| 98.54 | 100.00 | 98.51 | 50.00 | 100.00 |
|
| 98.54 | 87.50 | 99.21 | 87.50 | 99.21 |
|
| 91.23 | 65.00 | 95.73 | 72.21 | 94.12 |
BL—Burkitt Lymphoma, B-NH—B-cell Non-Hodgkin Lymphomas, CLL—Chronic Lymphocytic Leukemia, DLBCL—Diffuse Large B-cell Lymphoma, FCL—Follicular Cell Lymphoma, HCL—Hairy Cell Leukemia, LPL—Lymphoplasmacytic Lymphoma, MCL—Mantle Cell Lymphoma, MZL—Marginal Zone Lymphoma, NPV—Negative Predictive Value, PPV—Positive Predictive Value, SL—Splenic Lymphoma.
Figure 3Top 10 markers in order of importance selected by Model I (a), Model II (b), Model III (c), and Model IV (d).
Figure 4Distribution of CD5 (a), MIB1(b), Bcl2 (c) and CS (d) in all the samples of the database, versus B-NHL subclass. HCL and SL samples are not represented in MIB1 and Bcl2 plots because samples from these two classes have not been consistently evaluated for these two markers. Each box has a central mark that indicates the median of the distribution, whereas bottom and top edges of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to the most extreme non-outliers data points. Circle markers indicate outliers. CS stands for cell size.
Figure 5List of surface markers associated with the percentage of samples characterized for each marker. The blue dotted line indicates the threshold (50%) above which markers were considered.
Distribution of samples in B-NHL categories. Samples are furtherly divided according to their origin: blood samples (PB/BM) and non-blood samples (lymph nodes and tissue biopsies or FNA, pleural and peritoneal effusions, liquor and bronchoalveolar lavage).
| B-NHL Category | Total Samples | Blood Samples | Non-Blood Samples |
|---|---|---|---|
| CLL | 670 | 602 | 68 |
| FCL | 199 | 43 | 156 |
| SL | 19 | 17 | 2 |
| MZL | 174 | 94 | 80 |
| DLBCL | 220 | 25 | 195 |
| LPL | 60 | 53 | 7 |
| MCL | 83 | 51 | 32 |
| HCL | 26 | 26 | 0 |
| BL | 14 | 4 | 10 |