| Literature DB >> 32161279 |
Marta Bogowicz1,2, Arthur Jochems3, Timo M Deist3, Stephanie Tanadini-Lang4, Shao Hui Huang5, Biu Chan5, John N Waldron5, Scott Bratman5, Brian O'Sullivan5, Oliver Riesterer4,6, Gabriela Studer4,7, Jan Unkelbach4, Samir Barakat3, Ruud H Brakenhoff8, Irene Nauta8, Silvia E Gazzani9, Giuseppina Calareso10, Kathrin Scheckenbach11, Frank Hoebers12, Frederik W R Wesseling12, Simon Keek3, Sebastian Sanduleanu3, Ralph T H Leijenaar3, Marije R Vergeer13, C René Leemans8, Chris H J Terhaard14, Michiel W M van den Brekel15, Olga Hamming-Vrieze16, Martijn A van der Heijden15, Hesham M Elhalawani17, Clifton D Fuller17, Matthias Guckenberger4, Philippe Lambin3.
Abstract
A major challenge in radiomics is assembling data from multiple centers. Sharing data between hospitals is restricted by legal and ethical regulations. Distributed learning is a technique, enabling training models on multicenter data without data leaving the hospitals ("privacy-preserving" distributed learning). This study tested feasibility of distributed learning of radiomics data for prediction of two year overall survival and HPV status in head and neck cancer (HNC) patients. Pretreatment CT images were collected from 1174 HNC patients in 6 different cohorts. 981 radiomic features were extracted using Z-Rad software implementation. Hierarchical clustering was performed to preselect features. Classification was done using logistic regression. In the validation dataset, the receiver operating characteristics (ROC) were compared between the models trained in the centralized and distributed manner. No difference in ROC was observed with respect to feature selection. The logistic regression coefficients were identical between the methods (absolute difference <10-7). In comparison of the full workflow (feature selection and classification), no significant difference in ROC was found between centralized and distributed models for both studied endpoints (DeLong p > 0.05). In conclusion, both feature selection and classification are feasible in a distributed manner using radiomics data, which opens new possibility for training more reliable radiomics models.Entities:
Mesh:
Year: 2020 PMID: 32161279 PMCID: PMC7066122 DOI: 10.1038/s41598-020-61297-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristic of studied cohorts.
| Center | BD2Decide | Design | MD Anderson | PMH | VUmc | USZ | |
|---|---|---|---|---|---|---|---|
| number of patients | 206 | 141 | 110 | 441 | 100 | 176 | |
| 2 years OS | dead | 55 64% | 36 66% | 0 0% | 96 72% | 31 55% | 40 71% |
| alive | 151 36% | 105 34% | 0 0% | 345 28% | 69 45% | 136 29% | |
| unknown | 0 0% | 0 0% | 110 100% | 0 0% | 0 0% | 0 0% | |
| HPV | positive | 33 16% | 0 0% | 98 89% | 274 62% | 23 23% | 58 33% |
| negative | 61 30% | 141 100% | 12 11% | 116 26% | 77 77% | 82 47% | |
| unknown | 112 54% | 0 0% | 0 0% | 51 12% | 0 0% | 36 20% | |
| Head and neck tumor site | oropharynx | 128 62% | 63 45% | 110 100% | 441 100% | 100 100% | 113 64% |
| hypopharynx | 13 6% | 47 33% | 0 0% | 0 0% | 0 0% | 37 21% | |
| larynx | 20 10% | 31 22% | 0 0% | 0 0% | 0 0% | 16 9% | |
| oral cavity | 45 22% | 0 0% | 0 0% | 0 0% | 0 0% | 10 6% |
Figure 1Scheme of the distributed model training. Model training was divided into two parts: feature selection and model fitting. In both parts local statistics were computed at the local repositories and sent to the central server. In the central server the global statistics were estimated and sent back to the local repositories. Finally, the model was tested in a validation cohort.
Figure 2Comparison of feature selection methods based on the area under receiver operating characteristics (AUC). The bars present results from both centralized (light gray and light blue) and distributed (dark grey and dark blue) feature selection together with 95% confidence intervals. No statistically significant difference was observed between the selection methods (DeLong p-value> 0.05).
Figure 3Comparison of nomograms for models obtained using centralized and distributed logistic regression. The coefficients of the models are identical. Example for the model prediction HPV, trained on the cohorts: bd2decide, md anderson, vumc, usz.
Figure 4The receiver operating characteristics of radiomics-based models for HPV prediction. The AUCs are given with 95% confidence interval. No significant difference in ROC was observed between models trained in the centralized and distributed workflow.
Figure 5Comparison of Kaplan-Meier curves for the risk-group split based on the 2 years overall survival models trained centrally and distributed. Both models performed equally well on all validation cohorts. The G-rho test p-values and odds ratio (OR) are shown for comparison.