| Literature DB >> 35109864 |
Lilang Lv1,2, Bowen Xin3, Yichao Hao3, Ziyi Yang4,2,5,6, Junyan Xu4,2,5,6, Lisheng Wang7, Xiuying Wang3, Shaoli Song8,9,10,11, Xiaomao Guo12,13.
Abstract
BACKGROUND: To develop and validate a survival model with clinico-biological features and 18F- FDG PET/CT radiomic features via machine learning, and for predicting the prognosis from the primary tumor of colorectal cancer.Entities:
Keywords: 18F-FDG PET/CT; Prediction; Prognosis; Radiomics; Stage III Colorectal cancer
Mesh:
Substances:
Year: 2022 PMID: 35109864 PMCID: PMC8812058 DOI: 10.1186/s12967-022-03262-5
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Fig. 1Radiomics workflow. Input PET/CT was collected from patients with colorectal cancer (Stage I–IV). Feature engineering was used to extract radiomics features from region of interests in PET/CT images and to select important features via log-rank test and variable hunting. Two random survival forest (RSF) models M1 and M2 were constructed followed by statistical analysis including survival prediction, correlation analysis and individualized interpretation
Fig. 2Methodology and results of feature selection. A Methodology of feature selection: univariate log-rank test was applied to select features with p < 0.05 to form feature set a. Multivariate variable hunting was used to select discriminative feature combination (final feature set b) via five-fold cross validation. B Results of feature selection: the number of selected features in feature selection pipeline
Patients characteristics of the training and validation sets
| Characteristics | Training set (n = 138) | Validation set (n = 58) | P value* |
|---|---|---|---|
| CEA | 18.022 ± 41.604 | 25.928 ± 51.872 | 0.090 |
| CA199 | 0.535 | ||
| High | 36 | 12 | |
| Normal | 102 | 46 | |
| Lymph nodes | 2.326 ± 3.560 | 2.172 ± 3.320 | 0.804 |
| Stage | 0.486 | ||
| I | 13 | 3 | |
| II | 42 | 23 | |
| III | 55 | 23 | |
| IV | 28 | 9 | |
| SUVmax | 14.865 ± 6.100 | 13.430 ± 4.772 | 0.229 |
| SUVmean | 8.866 ± 3.539 | 8.091 ± 2.831 | 0.270 |
| TLG | 180.122 ± 163.749 | 168.098 ± 153.9764 | 0.887 |
Continuous data except follow-up time (which was shown with median) were demonstrated with means ± standard deviations while categorical data were demonstrated with the number of each category and percentage. *p-value was calculated by using χ2 test for categorical variable and Wilcoxon test for continuous variable
Fig. 3The performance of prognostic models. Figure A and B showed the comparison of the prognostic performance of different modalities on D-1 ~ 4 and D-3 respectively. Figure C and D were K–M Curves for different modalities on D-1 ~ 4 and D-3 respectively. Log-rank test was used for statistical tests used in K–M curve. P-value < 0.05 indicated survival distributions of high-risk and low-risk groups were significantly different. P-value was marked in each sub-figure of Figure C and D
Fig. 4Feature importance and Pearson correlation of M1 and M2. Figure A was the univariate importance of features in models M1 and M2. Clinical, PET, and CT features were represented by using blue, gray, and red bar, respectively. Figure B showed the overlapping between features selected for models M1 and M2. In Figure C, Pearson test was used for correlational statistical analysis. P-value < 0.05 indicated significant correlation identified between two variables (indicated with colored cells)
Fig. 5Case study for individualized result interpretation. Figure A showed the predicted survival curves of individual patients yielded by the model M2. Figure B showed the values of radiomic features of patients in the case study. Figure C visualized the tumor region in 3D body imaging, in PET/CT imaging and PET/CT radiomics features