| Literature DB >> 33772334 |
Marta Ferreira1, Pierre Lovinfosse2, Johanne Hermesse3, Marjolein Decuypere4, Caroline Rousseau5,6, François Lucia7,8, Ulrike Schick7,8, Caroline Reinhold9, Philippe Robin10, Mathieu Hatt8, Dimitris Visvikis8, Claire Bernard2, Ralph T H Leijenaar11,12, Frédéric Kridelka4, Philippe Lambin12,13, Patrick E Meyer14, Roland Hustinx15.
Abstract
PURPOSE: To test the performances of native and tumour to liver ratio (TLR) radiomic features extracted from pre-treatment 2-[18F] fluoro-2-deoxy-D-glucose ([18F]FDG) PET/CT and combined with machine learning (ML) for predicting cancer recurrence in patients with locally advanced cervical cancer (LACC).Entities:
Keywords: Cervical cancer; Disease-free survival; Machine learning; Radiomics; [18F]FDG PET/CT
Mesh:
Substances:
Year: 2021 PMID: 33772334 PMCID: PMC8440288 DOI: 10.1007/s00259-021-05303-5
Source DB: PubMed Journal: Eur J Nucl Med Mol Imaging ISSN: 1619-7070 Impact factor: 9.236
Patient’s characteristics
| CHU Liège (Scanner A) | CHU Brest and ICO St Herbain (Scanner B) | Total | CHU Mcgill (Scanner C) | |
|---|---|---|---|---|
| Number of patients | 89 | 51 | 140 | 18 |
| Age (median and range in years) | 50 (23–76) | 52 (23–82) | 51 (23–82) | 50 (28–86) |
| FIGO (%) | ||||
| IB1-IB2 | 18% | 12% | 16% | 6% |
| IIA-IIB | 66% | 58% | 64% | 56% |
| IIIA-IIIB | 12% | 18% | 14% | 33% |
| IVA | 3% | 12% | 6% | 6% |
| Histology (% of SCC) | ||||
| 87% | 82% | 85% | 89% | |
| LN metastasis | ||||
| % of patients | 19% | 16% | 18% | 28% |
| Recurrence (%) | 21% | 35% | 26% | 50% |
Fig. 1Radiomics pipeline
Hazard ratios (HR) with 95% confidence intervals and P values of clinical features, as well as TV, MTV, SUV Max and TLG after performing a univariate Cox proportional hazard model to predict DFS. The Youden Index was used to find a threshold for each predictor, plot Kaplan-Meier curves and evaluate the performance of each individual feature in predicting DFS in the test set. AUC, recall, precision and F1-score were used as DFS performance metrics
| Precision | Recall | AUC | HR (95% CI) | Feature | |||
|---|---|---|---|---|---|---|---|
| 0.32 | 0.33 | 0.2 | 1 | 0.6 | 0.14 | 1.23 (0.94–1.6) | FIGO |
| 0.15 | 0.4 | 0.4 | 0.4 | 0.64 | 0.33 | 1.14 (0.87–1.5) | Histology |
| 0.67 | 0.22 | 0.25 | 0.2 | 0.54 | 0.15 | 1.22 (0.93–1.6) | Metastasis |
| 0.027 | 0.44 | 0.5 | 0.4 | 0.61 | 0.84 | 0.97 (0.71–1.3) | Age |
| 0.17 | 0 | 0 | 0 | 0.48 | 0.01 | 1.4 (1.1–1.9) | TV |
| 0.23 | 0.36 | 0.24 | 0.8 | 0.48 | 0.65 | 1.06 (0.81–1.4) | MTV 50% |
| 0.87 | 0.22 | 0.15 | 0.4 | 0.43 | 0.1 | 1.25 (0.96–1.6) | SUV MAX |
| 0.67 | 0.15 | 0.13 | 0.2 | 0.43 | 0.32 | 1.14 (0.88–1.5) | TLG 50% |
Features which were significant in univariate and multivariate analysis. For each of the features, we show the hazard ratios (HR) with 95% confidence intervals and P values after performing the univariate Cox proportional hazard model. Additionally, we also show the AUC, recall, precision, F1-score and P value of KM curve. The threshold to measure the last metrics was defined using the Youden index of the training data set ROC curve. Feature’s names are described as in the supplementary data B with the additional information of the discretisation width and features origin (OR or TLR)
| Precision | Recall | AUC | HR (95% CI) | Feature | ||||
|---|---|---|---|---|---|---|---|---|
| 0.79 | 0.21 | 0.14 | 0.4 | 0.46 | 0.034 | 0.00049 | 0.44 (0.28–0.70) | GLDZM_DZNN_0.5 (OR) |
| 0.044 | 0.43 | 0.28 | 0.1 | 0.68 | 0.0023 | 0.00058 | 1.61 (1.20–2.10) | GLSZM_HILAE_0.5 (TLR) |
| 0.47 | 0.13 | 0.1 | 0.2 | 0.43 | 0.0016 | 0.001 | 1.65 (1.20–2.20) | GLDZM_DZV_0.05 (TLR from interpolated images) |
| 0.088 | 0.24 | 0.14 | 0.8 | 0.61 | 0.0022 | 0.001 | 1.72 (1.20–2.40) | Stats_qcod_0.2 (TLR from interpolated images) |
| 0.15 | 0.4 | 0.4 | 0.4 | 0.64 | NA | 0.002 | 0.65 (0.49–0.85) | Histology |
Fig. 2Kaplan-Meier curve of the each individual significant feature in univariate and multivariate analysis, after the Cox proportional hazard model. a GLDZM_DZNN_0.5 (OR) (Threshold = 0.59, log-rank test P value 0.079). b GLSZM_HILAE_0.5 (TLR) (Threshold = 0.07, log-rank test P value 0.044), c GLDZM_DZV_0.05 (TLR from interpolated images) (Threshold = 1.28, log-rank test P value 0.47). d Stats_qcod_0.2 (TLR from interpolated images) (Threshold = 1.18, log-rank test P value 0.088). e Histology (Threshold = 0.5, log-rank test P value 0.15)
Fig. 3Kaplan-Meier curve of the test set for the best OR (a) and TLR (b) model. Red and blue curves represent respectively patients with better and worse prognosis. The log-rank test was used to estimate statistical significance of the difference between survival curves. The P value obtained from the log-rank test is shown in the left down corner of each image. The difference between both the Kaplan-Meier curves is statistical significant (log-rank P value = 0.034 for the OR model and 0.002 for the TLR model)
Variation of Bootstrap mean AUC, F1-score and F2-score, precision, recall and AUCpr values according to the different test data obtained from the different scanners. Table 4 (part a) shows the models using OR radiomics features and Table 4 (part b) those with TLR features. The 95% confidence intervals are in parentheses
| AUC | Precision | Recall | AUCpr | |||
|---|---|---|---|---|---|---|
| a | ||||||
| Mix Scanner A and B | 0.65 (0.47–0.73) | 0.44 (0.25–0.57) | 0.56 (0.32–0.69) | 0.32 (0.18–0.44) | 0.69 (0.40–0.80) | 0.38 (0.22–0.58) |
| Scanner A | 0.54 (0.24–0.67) | 0.37 (0–0.57) | 0.46 (0–0.63) | 0.29 (0–0.5) | 0.55 (0–0.67) | 0.36 (0.15–0.72) |
| Scanner B | 0.81 (0.50–1) | 0.52 (0.25–0.67) | 0.69 (0.36–0.83) | 0.37 (0.17–0.50) | 0.91 (0.50–1) | 0.57 (0.27–1) |
| Scanner C | 0.57 (0.36–0.75) | 0.40 (0.14–0.67) | 0.35 (0.12–0.61) | 0.58 (0.20–1) | 0.32 (0.11–0.56) | 0.61 (0.46–0.82) |
| b | ||||||
| Mix Scanner A and B | 0.78 (0.67–0.88) | 0.49 (0.25–0.67) | 0.56 (0.22–0.74) | 0.42 (0.25–0.60) | 0.63 (0.2–0.8) | 0.53 (0.33–0.72) |
| Scanner A | 0.70 (0.56–0.84) | 0.36 (0–0.57) | 0.41 (0–0.63) | 0.31 (0–0.50) | 0.46 (0–0.67) | 0.39 (0.25–0.65) |
| Scanner B | 0.95 (0.78–1) | 0.67 (0.4–1) | 0.78 (0.45–1) | 0.57 (0.29–1) | 0.89 (0.50–1) | 0.88 (0.58–1) |
| Scanner C | 0.50 (0.37–0.65) | 0.25 (0–0.46) | 0.20 (0–0.38) | 0.46 (0–0.75) | 0.18 (0–0.33) | 0.55 (0.46–0.67) |
Radiomic signatures developed by Altazi et al. The models were tested in the mix test set (test set A + B)
| AUC | |
|---|---|
| Ref [ | |
| G1 | 0.49 |
| G2 | 0.56 |
| G3 | 0.6 |