| Literature DB >> 34888226 |
Luca Pasquini1,2, Antonio Napolitano3, Martina Lucignani3, Emanuela Tagliente3, Francesco Dellepiane2, Maria Camilla Rossi-Espagnet2,4, Matteo Ritrovato5, Antonello Vidiri6, Veronica Villani7, Giulio Ranazzi8, Antonella Stoppacciaro8, Andrea Romano2, Alberto Di Napoli2,9, Alessandro Bozzao2.
Abstract
Radiomic models outperform clinical data for outcome prediction in high-grade gliomas (HGG). However, lack of parameter standardization limits clinical applications. Many machine learning (ML) radiomic models employ single classifiers rather than ensemble learning, which is known to boost performance, and comparative analyses are lacking in the literature. We aimed to compare ML classifiers to predict clinically relevant tasks for HGG: overall survival (OS), isocitrate dehydrogenase (IDH) mutation, O-6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation, epidermal growth factor receptor vIII (EGFR) amplification, and Ki-67 expression, based on radiomic features from conventional and advanced magnetic resonance imaging (MRI). Our objective was to identify the best algorithm for each task. One hundred fifty-six adult patients with pathologic diagnosis of HGG were included. Three tumoral regions were manually segmented: contrast-enhancing tumor, necrosis, and non-enhancing tumor. Radiomic features were extracted with a custom version of Pyradiomics and selected through Boruta algorithm. A Grid Search algorithm was applied when computing ten times K-fold cross-validation (K=10) to get the highest mean and lowest spread of accuracy. Model performance was assessed as AUC-ROC curve mean values with 95% confidence intervals (CI). Extreme Gradient Boosting (xGB) obtained highest accuracy for OS (74,5%), Adaboost (AB) for IDH mutation (87.5%), MGMT methylation (70,8%), Ki-67 expression (86%), and EGFR amplification (81%). Ensemble classifiers showed the best performance across tasks. High-scoring radiomic features shed light on possible correlations between MRI and tumor histology.Entities:
Keywords: genetics; glioblastoma; high-grade glioma (HGG); machine learning; radiomics; survival
Year: 2021 PMID: 34888226 PMCID: PMC8649764 DOI: 10.3389/fonc.2021.601425
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Radiomic workflow followed in the present study.
Figure 2Machine learning classifiers tested in the present study. Non-ensemble learners included KNeighbors, logistic regressor, and decision tree. Ensemble learners included boosting, stacking, and bagging classifiers.
Number of patients and label distributions for label-sequence combination.
| ADC | FLAIR | MPRAGE | rCBV | T1 | T2 | |
|---|---|---|---|---|---|---|
| SURV12 (0/1) | 134 (65/69) | 140 (68/72) | 138 (66/72) | 93 (45/48) | 122 (61/61) | 122 (60/62) |
| MGMT (0/1) | 110 (41/69) | 115 (43/72) | 114 (42/72) | 80 (33/47) | 100 (39/61) | 102 (39/63) |
| IDH (0/1) | 86 (71/15) | 89 (74/15) | 89 (74/15) | 60 (51/9) | 77 (63/14) | 78 (65/13) |
| KI67 (0/1) | 100 (18/82) | 106 (21/85) | 103 (22/81) | 77 (16/61) | 97 (17/80) | 94 (16/78) |
| EGFR (0/1) | 65 (21/44) | 69 (23/46) | 66 (23/43) | 49 (16/33) | 65 (22/43) | 62 (20/42) |
Surv12 best results (reported as mean ± standard deviation).
| ROI | SEQ | xGB | GB | RF | LR | ST | KN | DT | AB | ST_ABC | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NET | ADC | Acc% | 71,8 ± 10 | 68,8 ± 11,4 | 67,9 ± 6,5 | 46,3 ± 5,4 | 71 ± 9 | 61,2 ± 12,3 | 59,2 ± 11,7 | 73,6 ± 9,3 | 64,2 ± 12,6 |
| NET | ADC | Roc % | 71,8 ± 9,7 | 69,1 ± 11,1 | 67,9 ± 6,5 | 46,3 ± 5,4 | 71 ± 9 | 61,2 ± 12,3 | 59,2 ± 11,7 | 73,6 ± 9,3 | 64,2 ± 12,6 |
| NET | FLAIR | Acc % | 72,1 ± 13,7 | 67,4 ± 9,9 | 71,6 ± 8,4 | 62 ± 13,6 | 69 ± 12 | 54,3 ± 15 | 59 ± 13,7 | 68,9 ± 7 | 62,3 ± 14 |
| NET | FLAIR | Roc % | 72,4 ± 14 | 67 ± 11 | 72,1 ± 7,6 | 62,3 ± 13,7 | 69 ± 12,2 | 53,9 ± 14,8 | 58,8 ± 13 | 69,5 ± 7,7 | 59 ± 13 |
| NEC | T2 | Acc % | 74,5 ± 11 | 65,8 ± 12,6 | 67 ± 16,7 | 58,7 ± 14,3 | 73,6 ± 9 | 52,3 ± 15,2 | 60,7 ± 11,4 | 72,7 ± 9,5 | 58,1 ± 13,9 |
| NEC | T2 | Roc % | 74,2 ± 10,9 | 65 ± 11,2 | 66,4 ± 17 | 58,8 ± 14,4 | 73 ± 9,4 | 52 ± 14,9 | 59 ± 11 | 72,5 ± 9,6 | 56,3 ± 14,3 |
Figure 3Best ROC curves for Surv12 prediction: (A) AB classifier with ADC sequence on NET ROI; (B) xGB classifier with T2 sequence on NEC ROI; (C) xGB classifier with FLAIR sequence on NET ROI.
MGMT best results (reported as mean ± standard deviation).
| ROI | SEQ | xGB | GB | RF | LR | ST | KN | DT | AB | ST_ABC | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CET | FLAIR | Acc % | 63,3 ± 11,3 | 68,1 ± 13,4 | 70.7 ± 9,3 | 65,5 ± 11,4,4 | 67,9 ± 15,7 | 52,2 ± 12,7 | 59,4 ± 14,4 | 70,8 ± 14,1 | 64,5 ± 15,7 |
| CET | FLAIR | Roc % | 62,8 ± 11,7 | 66,8 ± 13,4 | 63,4 ± 12,2 | 59 ± 10,6 | 67 ± 16,8 | 51,4 ± 13,3 | 55,5 ± 12,1 | 68,8 ± 14,6 | 62 ± 14,2 |
Figure 4Best ROC curve for MGMT prediction: AB classifier with FLAIR sequence on CET ROI.
IDH best results (reported as mean ± standard deviation).
| ROI | SEQ | xGB | GB | RF | LR | ST | KN | DT | AB | ST_ABC | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| NET | rCBV | Acc % | 83,5 ± 12,8 | 82,8 ± 12 | 76,2 ± 16,2 | 77,3 ± 14,4 | 86,7 ± 11,8 | 69,2 ± 17,5 | 78,7 ± 14,5 | 87,5 ± 11,9 | 82,8 ± 12,4 |
| NET | rCBV | Roc % | 83,2 ± 12,8 | 82 ± 13,5 | 78,3 ± 15,5 | 78 ± 14,7 | 85,8 ± 12,3 | 69 ± 18,3 | 78,3 ± 15 | 86,7 ± 12 | 82 ± 12,4 |
| NET | T1 | Acc % | 80,2 ± 14 | 81 ± 13,8 | 80 ± 12,5 | 68,7 ± 12 | 84,2 ± 15 | 66 ± 21 | 75,2 ± 13,7 | 85,9 ± 14 | 80,9 ± 12 |
| NET | T1 | Roc % | 79,4 ± 15 | 80,7 ± 15 | 78,2 ± 12,3 | 67,9 ± 11,4 | 83 ± 14,7 | 66,7 ± 21,2 | 76,3 ± 14,5 | 85,8 ± 14,9 | 80 ± 13 |
| CET | T2 | Acc % | 80,2 ± 14 | 81 ± 13,8 | 80 ± 12,5 | 68,7 ± 12 | 84,2 ± 15 | 66 ± 21 | 75,2 ± 13,7 | 85,9 ± 14 | 80,9 ± 12 |
| CET | T2 | Roc % | 79,4 ± 15 | 80,7 ± 15 | 78,2 ± 12,3 | 67,9 ± 11,4 | 83 ± 14,7 | 66,7 ± 21,2 | 76,3 ± 14,5 | 85,8 ± 14,9 | 80 ± 13 |
| NEC | T2 | Acc % | 77,4 ± 9,8 | 77,9 ± 11 | 79 ± 11 | 70,3 ± 12,5 | 79,2 ± 10,7 | 69,3 ± 14,3 | 75,8 ± 12,6 | 80,8 ± 10,2 | 79,5 ± 9,5 |
| NEC | T2 | Roc % | 76,6 ± 10 | 77 ± 10 | 78 ± 11,2 | 70,7 ± 12,6 | 78,9 ± 9,7 | 70 ± 14,9 | 77,5 ± 12,9 | 80,5 ± 10,6 | 78,4 ± 9 |
Figure 5Best ROC curves for IDH prediction: (A) AB classifier with rCBV sequence on NET ROI; (B) AB classifier with T2 sequence on CET ROI; (C) AB classifier with T2 sequence on NEC ROI; (D) ST classifier with T1 sequence on NET ROI.
KI67 best results (reported as mean ± standard deviation).
| ROI | SEQ | xGB | GB | RF | LR | ST | KN | DT | AB | ST_ABC | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CET | ADC | Acc % | 82,3 ± 8,4 | 81,6 ± 9,7 | 83,9 ± 9,8 | 63,7 ± 13,6 | 82,6 ± 10,5 | 67,5 ± 10 | 76,5 ± 12 | 86 ± 10,6 | 83 ± 8,2 |
| CET | ADC | Roc % | 64,6 ± 15 | 64,5 ± 17,3 | 67,5 ± 18,9 | 50,8 ± 17,5 | 63,2 ± 17,8 | 60 ± 15,7 | 60 ± 19 | 70 ± 20 | 64,4 ± 17 |
Figure 6Best ROC curve for KI67 prediction: AB classifier with ADC sequence on CET ROI.
EGFR best results (reported as mean ± standard deviation).
| ROI | SEQ | xGB | GB | RF | LR | ST | KN | DT | AB | ST_ABC | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CET | rCBV | Acc % | 69,8 ± 15,1 | 75,4 ± 15 | 73,1 ± 16 | 64,3 ± 16,3 | 72,9 ± 14,3 | 61,3 ± 21,4 | 66,7 ± 19,4 | 81 ± 13,8 | 66,5 ± 18,7 |
| CET | rCBV | Roc % | 63,9 ± 19,5 | 64,6 ± 18,5 | 64,7 ± 20 | 62,2 ± 21,8 | 65,7 ± 18,9 | 63,4 ± 23,3 | 59,4 ± 23,2 | 74,3 ± 17,3 | 62,6 ± 20 |
| CET | T2 | Acc % | 76,4 ± 15,2 | 74,7 ± 15 | 76,4 ± 16 | 60,8 ± 18,8 | 76 ± 17,8 | 59,7 ± 20,4 | 61,3 ± 18,7 | 77,8 ± 13,8 | 71,5 ± 16 |
| CET | T2 | Roc % | 70,4 ± 22,7 | 69,7 ± 19,8 | 76,3 ± 17 | 65,4 ± 15,7 | 69,8 ± 22,8 | 60,2 ± 19,5 | 55,7 ± 20,4 | 74,1 ± 17,6 | 65,6 ± 20,6 |
Figure 7Best ROC curves for EGFR prediction: (A) AB classifier with rCBV sequence on CET ROI; (B) AB classifier with T2 sequence on CET ROI.