| Literature DB >> 30979965 |
David Molina-García1, Luis Vera-Ramírez2, Julián Pérez-Beteta3, Estanislao Arana4, Víctor M Pérez-García3.
Abstract
Many studies have built machine-learning (ML)-based prognostic models for glioblastoma (GBM) based on radiological features. We wished to compare the predictive performance of these methods to human knowledge-based approaches. 404 GBM patients were included (311 discovery and 93 validation). 16 morphological and 28 textural descriptors were obtained from pretreatment volumetric postcontrast T1-weighted magnetic resonance images. Different prognostic ML methods were developed. An optimized linear prognostic model (OLPM) was also built using the four significant non-correlated parameters with individual prognosis value. OLPM achieved high prognostic value (validation c-index = 0.817) and outperformed ML models based on either the same parameter set or on the full set of 44 attributes considered. Neural networks with cross-validation-optimized attribute selection achieved comparable results (validation c-index = 0.825). ML models using only the four outstanding parameters obtained better results than their counterparts based on all the attributes, which presented overfitting. In conclusion, OLPM and ML methods studied here provided the most accurate survival predictors for glioblastoma to date, due to a combination of the strength of the methodology, the quality and volume of the data used and the careful attribute selection. The ML methods studied suffered overfitting and lost prognostic value when the number of parameters was increased.Entities:
Mesh:
Year: 2019 PMID: 30979965 PMCID: PMC6461644 DOI: 10.1038/s41598-019-42326-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the performance and number of attributes used for the different models studied in this paper.
| Model | Number of parameters | c-indexes | |
|---|---|---|---|
| Discovery | Validation | ||
| Cox | 4 | 0.735 | 0.744 |
| Best linear (OLPM) | 4 | 0.771 | 0.817 |
| NN with CV | 4 | 0.791 | 0.825 |
| NN | 4 | 0.740 | 0.751 |
| RFF_SVM | 4 | 0.747 | 0.783 |
| libSVM | 4 | 0.739 | 0.756 |
| RT | 4 | 0.696 | 0.681 |
| NN | 44 | 0.794 | 0.746 |
| RFF_SVM | 44 | 0.801 | 0.766 |
| libSVM | 44 | 0.752 | 0.700 |
| RT | 44 | 0.741 | 0.630 |
Cox, best optimized linear prognosis model and machine learning-based approaches are included. Results are listed for both the discovery and validation cohorts. C-indexes in the validation group over 0.8 are boldfaced.
Figure 1Kaplan-Meier curves obtained for the OPML and the best ML method (NN with CV) in the discovery (A,C respectively) and validation (B,D respectively) cohorts.
Figure 2Comparison of the predictive value (c-index) and number of variables for the models developed in this paper versus representative models from the literature. Previous approaches are shown in yellow, with different symbols corresponding to different studies. ML methods described in this paper are shown in red and linear models in blue. Results are given for the best models in each reference and for the validation groups when available.
Summary of patient characteristics, MR imaging and volumetric parameters for the groups of patients considered in the study.
| Discovery cohort | Validation cohort | ||
|---|---|---|---|
| Patient characteristics | Number of patients (censored) | 311 (27) | 93 (12) |
| Age (years) median (range) | 63 (19–86) | 62 (14–86) | |
| Sex {male (M), Female (F)} | 44% F; 56% M | 47% F; 53% M | |
| Survival (months) median (range) | 12.76 (0.13–82.97) | 11.77 (0.72–59.20) | |
| Type of resection (total, subtotal or biopsy) | 149 Total (47.91%) | 17 Total (18.28%) | |
| 113 Subtotal (36.33%) | — | ||
| 49 Biopsy (15.76%) | 8 Biopsy (8.60%) | ||
| — | 68 Unknown (73.12%) | ||
| Type of treatment (Chemotherapy (CT) and Radiotherapy (RT)) | 241 CT + RT (77.49%) | 74 CT + RT (79.57%) | |
| 27 RT alone (8.68%) | 4 RT alone (4.30%) | ||
| 5 CT alone (1.61%) | 4 CT alone (4.30%) | ||
| 38 no treatment (12.22%) | 11 no treatment (11.83%) | ||
| MRI characteristics | Pixel spacing (mm) mean (range) | 0.81 (0.46–1.09) | 0.90 (0.45–1.06) |
| Slice thickness (mm) mean (range) | 1.48 (1.00–2.00) | 1.41 (0.90–2.00) | |
| Spacing between slices (mm) mean (range) | 1.10 (0.50–2.00) | 1.36 (0.70–2.00) | |
| Number of slices mean (range) | 174 (80–360) | 150 (72–305) | |
| Volumetric parameters | Tumor volume (cm3) mean (range) | 33.14 (0.48–132.54) | 41.82 (2.47–116.12) |
| CE volume (cm3) mean (range) | 19.64 (0.44–90.06) | 24.90 (2.46–90.95) | |
| Necrotic volume (cm3) mean (range) | 13.50 (0.03–89.31) | 16.92 (0.00–69.20) | |
| CE rim width (cm) mean (range) | 0.57 (0.22–1.65) | 0.63 (0.24–1.25) | |
| Maximum diameter (cm) mean (range) | 5.11 (1.30–11.09) | 5.71 (2.55–9.80) | |
| Total surface (cm2) mean (range) | 67.27 (3.00–226.32) | 83.13 (13.27–196.04) | |
| Surface regularity mean (range) | 0.62 (0.24–0.99) | 0.57 (0.30–0.83) | |