| Literature DB >> 31185999 |
Yocanxóchitl Perfecto-Avalos1, Alejandro Garcia-Gonzalez2, Ana Hernandez-Reynoso3, Gildardo Sánchez-Ante4, Carlos Ortiz-Hidalgo5, Sean-Patrick Scott2, Rita Q Fuentes-Aguilar1, Ricardo Diaz-Dominguez2, Grettel León-Martínez6, Verónica Velasco-Vales7, Mara A Cárdenas-Escudero8, José A Hernández-Hernández2, Arturo Santos2, José R Borbolla-Escoboza9, Luis Villela10,11.
Abstract
BACKGROUND: Diffuse large B-cell lymphoma (DLBCL) is classified into germinal center-like (GCB) and non-germinal center-like (non-GCB) cell-of-origin groups, entities driven by different oncogenic pathways with different clinical outcomes. DLBCL classification by immunohistochemistry (IHC)-based decision tree algorithms is a simpler reported technique than gene expression profiling (GEP). There is a significant discrepancy between IHC-decision tree algorithms when they are compared to GEP.Entities:
Keywords: COO identification; DLBCL; IHC algorithm; Linear discriminant analysis; Machine learning
Mesh:
Year: 2019 PMID: 31185999 PMCID: PMC6560900 DOI: 10.1186/s12967-019-1951-y
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Performance metrics of classification of IHC-decision tree algorithms and LDA
| Algorithm | Antibody combination | Acc | Sens | Spec | PPV | NPV | LR+ | LR− | |
|---|---|---|---|---|---|---|---|---|---|
| IHC-decision trees | Nyman | 3,5 | 0.72 | 0.52 | 0.91 | 0.84 | 0.67 | 5.56 | 0.53 |
| Colomo | 1,2,5 | 0.78 | 0.71 | 0.84 | 0.81 | 0.75 | 4.56 | 0.34 | |
| Hans | 1,2,5 | 0.85 | 0.91 | 0.78 | 0.80 | 0.91 | 4.21 | 0.11 | |
| Hans* | 1,5 | 0.82 | 0.94 | 0.70 | 0.75 | 0.92 | 3.14 | 0.09 | |
| Choi | 1,2,3,4,5 | 0.88 | 0.94 | 0.84 | 0.84 | 0.93 | 5.70 | 0.08 | |
| Choi* | 1,3,4,5 | 0.79 | 0.74 | 0.83 | 0.80 | 0.77 | 4.30 | 0.31 | |
| VY3 | 1,2,3 | 0.88 | 0.92 | 0.84 | 0.85 | 0.92 | 5.92 | 0.09 | |
| VY4 | 1,2,3,4 | 0.88 | 0.93 | 0.84 | 0.85 | 0.92 | 5.80 | 0.09 | |
| Linear discriminant analysis | As in Hans* | 1,5 | 0.84 | 0.77 | 0.91 | 0.89 | 0.81 | 8.59 | 0.25 |
| As in Nyman | 3,5 | 0.77 | 0.81 | 0.74 | 0.75 | 0.81 | 3.10 | 0.25 | |
| As in VY3 | 1,2,3 | 0.89 | 0.87 | 0.91 | 0.90 | 0.88 | 9.19 | 0.15 | |
| As in Hans/Colomo | 1,2,5 | 0.87 | 0.86 | 0.88 | 0.87 | 0.87 | 7.25 | 0.16 | |
| – | 1,4,5 | 0.87 | 0.81 | 0.92 | 0.90 | 0.84 | 9.93 | 0.20 | |
| As in VY4 | 1,2,3,4 | 0.87 | 0.84 | 0.90 | 0.89 | 0.86 | 8.24 | 0.17 | |
| As in Choi* | 1,3,4,5 | 0.88 | 0.86 | 0.91 | 0.90 | 0.87 | 9.09 | 0.16 | |
| As in Choi | 1,2,3,4,5 | 0.89 | 0.87 | 0.91 | 0.90 | 0.88 | 9.23 | 0.14 |
The upper section corresponds to the performance of the IHC-decision tree algorithms. Lower section corresponds to equivalent combinations of antibodies, but with LDA classification, this includes the rest of combinations not reported by IHC-decision tree algorithms. Choi, VY3, and VY4 algorithms reached the most considerable accuracy, representing the most balanced options of sensibility and specificity, with similar performance metrics
Numeric tags 1 = CD10, 2 = BCL6, 3 = FOXP1, 4 = GCTE1, and 5 = MUM1
Acc: accuracy; Sens: sensitivity; Spec: specificity; PPV: positive predictive value; NPV: negative predictive values; LR+: likelihood ratio for positive test results; LR−: likelihood ratio for negative test result
Coefficients of linear discriminant functions (LDF) derived from LDA for all possible combination of antibodies
| Antibody combination | Sens | Spec | COO | Constant | Antibody | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | ||||||
| CD10 | BCL6 | FOXP1 | GCET1 | MUM1 | ||||||
| As in Nyman | 3,5 | 0.81 | 0.74 | GCB | − 0.57 | 2.47 | 1.11 | |||
| Non-GCB | − 3.29 | 4.38 | 5.06 | |||||||
| As in Hans and | 1,2,5 | 0.86 | 0.88 | GCB | − 4.21 | 5.01 | 7.00 | 1.05 | ||
| As in Colomo | Non-GCB | − 3.09 | 0.20 | 4.99 | 5.69 | |||||
| As in Hans* | 1,5 | 0.77 | 0.91 | GCB | − 2.29 | 6.53 | 2.11 | |||
| Non-GCB | − 2.12 | 1.28 | 6.45 | |||||||
| As in Choi | 1,2,3,4,5 | 0.87 | 0.91 | GCB | − 4.80 | 4.59 | 6.31 | 0.71 | 3.28 | 1.02 |
| Non-GCB | − 3.98 | − 0.41 | 3.80 | 3.75 | 1.24 | 4.73 | ||||
| As in Choi* | 1,3,4,5 | 0.86 | 0.91 | GCB | − 3.34 | 5.67 | 1.78 | 4.03 | 1.67 | |
| Non-GCB | − 3.46 | 0.24 | 4.39 | 1.69 | 5.12 | |||||
| As in VY3 | 1,2,3 | 0.87 | 0.91 | GCB | − 4.18 | 4.86 | 6.99 | 0.64 | ||
| Non-GCB | − 2.90 | − 0.74 | 4.58 | 4.61 | ||||||
| As in VY4 | 1,2,3,4 | 0.84 | 0.90 | GCB | − 4.75 | 4.49 | 6.44 | 0.91 | 3.26 | |
| Non-GCB | − 2.97 | − 0.86 | 4.40 | 4.70 | 1.12 | |||||
| – | 1,4,5 | 0.81 | 0.92 | GCB | − 3.14 | 6.01 | 3.93 | 2.22 | ||
| Non-GCB | − 2.23 | 1.09 | 1.44 | 6.49 | ||||||
Columns (Antibody) show the coefficient associated with each antibody for GCB classification (Sensibility performance) and non-GCB classification (Specificity performance). The first column remarks the combinations of antibodies as were used by IHC-decision trees. BCL6 and CD10 in the LDF are strongly related to the classification of GCB cases, whereas the inclusion of MUM1 in any algorithm is related to non-GCB detection, getting the most considerable value in the LDF
Numeric Tags 1 = CD10, 2 = BCL6, 3 = FOXP1, 4 = GCTE1, and 5 = MUM1
Metrics of IHC-decision tree and machine learning algorithms
| Algorithm | Antibody combination | Acc | Sens | Spec | PPV | NPV | LR+ | LR− | |
|---|---|---|---|---|---|---|---|---|---|
| IHC-decision tree | Nyman | 3,5 | 0.79 | 0.65 | 0.95 | 0.93 | 0.71 | 12.47 | 0.37 |
| Colomo | 1,2,5 | 0.84 | 0.77 | 0.91 | 0.91 | 0.79 | 8.98 | 0.25 | |
| Hans | 1,2,5 | 0.89 | 0.95 | 0.83 | 0.86 | 0.94 | 5.52 | 0.06 | |
| Hans* | 1,5 | 0.86 | 0.95 | 0.76 | 0.81 | 0.94 | 3.94 | 0.06 | |
| Choi | 1,2,3,4,5 | 0.93 | 1.00 | 0.84 | 0.87 | 1.00 | 6.44 | 0.00 | |
| Choi* | 1,3,4,5 | 0.83 | 0.79 | 0.86 | 0.86 | 0.79 | 5.73 | 0.24 | |
| VY3 | 1,2,3 | 0.90 | 0.97 | 0.83 | 0.86 | 0.96 | 5.61 | 0.04 | |
| VY4 | 1,2,3,4 | 0.90 | 0.97 | 0.83 | 0.86 | 0.96 | 5.61 | 0.04 | |
| Machine learning | PV | 1,3,4,5 | 0.94 | 0.95 | 0.93 | 0.94 | 0.95 | 13.8 | 0.05 |
| ANN | 1,2,3,4,5 | 0.94 | 0.95 | 0.93 | 0.94 | 0.95 | 13.8 | 0.05 | |
| BS | 1,2,3,4,5 | 0.94 | 0.95 | 0.93 | 0.94 | 0.95 | 13.8 | 0.05 | |
| SVM | 1,2,3,4,5 | 0.94 | 0.97 | 0.91 | 0.92 | 0.96 | 11.23 | 0.04 | |
| SVM | 1,2,3,4 | 0.94 | 0.97 | 0.91 | 0.92 | 0.96 | 11.23 | 0.04 |
Metrics correspondent to eight IHC-decision tree algorithms and the best five machine learning algorithms are shown, cases of the VY subset were classified. Numeric Tags 1= CD10, 2 = BCL6, 3 = FOXP1, 4 = GCTE1, and 5 = MUM1. IHC-decision tree algorithms could not overcome any of the remarkable metrics obtained for the best five machine learning algorithms
Acc: accuracy; Sens: sensitivity; Spec: specificity; PPV: positive predictive value; NPV: negative predictive values; LR+: likelihood ratio for positive test results; LR−: likelihood ratio for negative test result; PV: Perfecto–Villela; ANN: artificial neural networks; BS: Bayesian simple; SVM: support vector machine
Fig. 1Performance and agreement comparison of IHC-decision tree and machine learning algorithms. a Accuracy ranking. Machine learning (gray bars) and IHC algorithms (white bars) were ordered by accuracy. PV, ANN (1,2,3,4,5), BS (1,2,3,4,5), SVM (1,2,3,4,5), and SVM (1,2,3,4) algorithms showed the highest accuracy, whereas Hans ranked in 21th place. b ROC space. Machine learning algorithms (gray markers), particularly PV (blue marker), ANN (1,2,3,4,5), BS (1,2,3,4,5), SVM (1,2,3,4,5), and SVM (1,2,3,4) allocated in the far left-hand side of the graph, suggesting a better performance when compared with IHC-decision tree algorithms (white markers). Nyman, Colomo and Choi* showed a more conservative performance. c Agreement heatmap. Scale represents moderate (0.41 ≤ κ ≤ 0.60), good (0.61 ≤ κ ≤ 0.80), to very good agreement (κ > 0.81) with red, black and green, respectively. Machine learning algorithms provided an almost perfect agreement with GEP, being ANN (1,2,3,4,5), BS (1,2,3,4,5), PV, SVM (1,2,3,4,5), and SVM (1,2,3,4) with the highest values (all with κ = 0.88, P < 0.001). A very good agreement within machine learning algorithms was observed (κ: 0.77–1.00). The concordance between IHC-decision tree algorithms was from moderate to good (κ: 0.41–0.79), except for Choi having a very good agreement with both VY3 and VY4 (κ = 0.95, P < 0.001). Numeric Tags 1 = CD10, 2 = BCL6, 3 = FOXP1, 4 = GCTE1, and 5 = MUM1. PV: Perfecto–Villela; B: Bayesian; BS: Bayesian simple; BN: Naïve Bayesian; ANN: artificial neural networks; SVM: support vector machines
Fig. 2Survival analysis comparison of IHC-decision tree and machine learning algorithms. Overall survival of patients classified by Hans and the best five machine learning algorithms is shown. GCB (black) overall survival was significatively better than non-GCB (grey) cases for both VY subset and clinical sample set, except for Hans when used in clinical sample set. Numeric Tags 1 = CD10, 2 = BCL6, 3 = FOXP1, 4 = GCTE1, and 5 = MUM1. PV: Perfecto–Villela; BS: Bayesian simple; ANN: artificial neural networks; SVM: support vector machines