| Literature DB >> 35566407 |
Jan Kulis1, Łukasz Wawrowski2, Łukasz Sędek3, Łukasz Wróbel4, Łukasz Słota1, Vincent H J van der Velden5, Tomasz Szczepański1, Marek Sikora4.
Abstract
Flow cytometry technique (FC) is a standard diagnostic tool for diagnostics of B-cell precursor acute lymphoblastic leukemia (BCP-ALL) assessing the immunophenotype of blast cells. BCP-ALL is often associated with underlying genetic aberrations, that have evidenced prognostic significance and can impact the disease outcome. Since the determination of patient prognosis is already important at the initial phase of BCP-ALL diagnostics, we aimed to reveal specific genetic aberrations by finding specific multiple antigen expression patterns with FC immunophenotyping. The FC immunophenotype data were analysed using machine learning methods (gradient boosting, decision trees, classification rules). The obtained results were verified with the use of repeated cross-validation. The t(12;21)/ETV6-RUNX1 aberration occurs more often when blasts present high expression of CD10, CD38, low CD34, CD45 and specific low expression of CD81. The t(v;11q23)/KMT2A is associated with positive NG2 expression and low CD10, CD34, TdT and CD24. Hyperdiploidy is associated with CD123, CD66c and CD34 expression on blast cells. In turn, high expression of CD81, low expression of CD45, CD22 and lack of CD123 and NG2 indicates that none of the studied aberrations is present. Detecting aberrations in pediatric BCP-ALL, based on the expression of multiple markers, can be done with decent efficiency.Entities:
Keywords: acute lymphoblastic leukaemia; classification; cytogenetics; flow cytometry; knowledge discovery; machine learning
Year: 2022 PMID: 35566407 PMCID: PMC9100578 DOI: 10.3390/jcm11092281
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.964
Figure 1Example of 2–dimensional visualisation illustrating good separation of patients with different genetic aberrations based on phenotypic features.
Figure 2Example of 2–dimensional visualisation illustrating poor separation of patients with different genetic aberrations based on phenotypic features.
Descriptive statistics of antigen expression.
| Attribute | Min. | Median | Max. | Attribute | Min. | Median | Max. |
|---|---|---|---|---|---|---|---|
| NG2 | 0 | 0 | 1 | CD81 | 0 | 3 | 25 |
| CD45 | 0 | 2 | 10 | cyIgM | 1 | 2 | 20 |
| CD34 | 0 | 4 | 51 | CD123 | 0 | 0 | 9 |
| CD10 | 0 | 27 | 164 | CD22 | 0 | 2 | 14 |
| CD20 | 0 | 0 | 15 | CD66c | 0 | 1 | 87 |
| CD38 | 0 | 2 | 30 | CD33 | 0 | 0 | 2 |
| CD24 | 0 | 22 | 151 | CD9 | 0 | 3 | 58 |
| TdT | 1 | 4 | 29 | CD13 | 0 | 2 | 37 |
| CD15 + CD65 | 0 | 0 | 3 |
Analysed data sets characteristics—decision class distribution.
| Decision Problem | Primary Class (1) | sec. Class (0) | Examples |
|---|---|---|---|
| t(12;21)/ETV6-RUNX1 | 52 | 422 | 474 |
| t(x;11q23)/KMT2A | 17 | 588 | 695 |
| hyperdiploidy | 124 | 378 | 502 |
| no aberration | 257 | 190 | 447 |
| t(12;21) vs. hyperdip. | 49 | 121 | 170 |
Figure 3Descriptive and predictive analysis workflow.
Figure 4Receiver operating characteristic analysis for t(12;21)/ETV6-RUNX1 and t(x;11q23)/KMT2A prediction.
Classification efficiency (%).
| Decision Problem | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|
| t(12;21)/ETV6-RUNX1 | 68.1 (5.7) | 88.4 (3.1) | 42.8 (4,7) | 95.8 (6.2) |
| t(v;11q23)/KMT2A | 59.4 (7.5) | 98.9 (1.1) | 66.0 (15.4) | 98.8 (0.2) |
| hyperdiploidy | 78.5 (4.1) | 81.4 (2.3) | 58.1 (1.8) | 92.1 (1.2) |
| no aberration | 92.6 (1.7) | 41.4 (4.8) | 68.2 (1.3) | 80.1 (2.2) |
| t(12;21) vs. hyperdip. | 81.4 (5.2) | 91.6 (4.3) | 81.5 (6.7) | 92.1 (1.9) |
Figure 5F1 values for classifiers with increasing number of features (mean +/− standard deviation). Each successive classifier uses one more feature than the previous one (i.e., classifier 5 uses all the feature of classifier 4 and the feature ranked 5th in the feature importance ranking).
Classification efficiency of RPART algorithm (%).
| Decision Problem | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|
| t(12;21)/ETV6-RUNX1 | 73 | 99 | 95 | 97 |
| t(v;11q23)/KMT2A | 71 | 100 | 100 | 99 |
| hyperdiploidy | 86 | 96 | 86 | 96 |
| no abberation | 91 | 89 | 92 | 88 |
| t(12;21) vs. hyperdip. | 81 | 98 | 95 | 92 |
Figure 6Odds ratio values for the for the premise of the positive rules with successive elementary conditions added.