| Literature DB >> 34178682 |
Yuntai Cao1,2,3,4, Guojin Zhang3,5, Jing Zhang3,6, Yingjie Yang7, Jialiang Ren8, Xiaohong Yan9, Zhan Wang10, Zhiyong Zhao2,3,4, Xiaoyu Huang2,3,4, Haihua Bao1, Junlin Zhou3,4.
Abstract
BACKGROUND: This study aimed to develop and validate a computed tomography (CT)-based radiomics model to predict microsatellite instability (MSI) status in colorectal cancer patients and to identify the radiomics signature with the most robust and high performance from one of the three phases of triphasic enhanced CT.Entities:
Keywords: CT; colorectal cancer; microsatellite instability; radiomics; triphasic enhanced phase
Year: 2021 PMID: 34178682 PMCID: PMC8222982 DOI: 10.3389/fonc.2021.687771
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Patient inclusion and exclusion details and the patient recruitment pathway.
Figure 2Workflow of microsatellite instability (MSI) prediction building and analysis. The tumors were segmented on arterial phase (A, B), delayed phase (C, D) and venous phase (E, F) CT images to form volumes of interest (VOIs). One thousand and thirty-seven quantitative radiomics features were extracted from each patient. The least absolute shrinkage and selection operator (LASSO) was used to select the features. Multivariate logistic regression was used to build radiomics, clinical, and clinicoradiomics combined models for MSI prediction. Finally, the radiomics signature and clinical factors were incorporated into a nomogram for individual evaluation. Receiver operating characteristic curves were used to evaluate the clinical usefulness of the nomogram.
Characteristics of patients in the training cohort [median (Q1, Q3) or no. (%)].
| Characteristics | Training cohort (n=441) | |||
|---|---|---|---|---|
| MSS | MSI | P value | ||
| Age (years) | 61.00 (51.00, 68.00) | 51.00 (42.50, 63.00) | <0.001 | |
| Gender | Female | 153 (40.9%) | 33 (49.3%) | 0.203 |
| Male | 221 (59.1%) | 34 (50.7%) | ||
| Tumor Location | Left | 267 (71.4%) | 22 (32.8%) | <0.001 |
| Right | 107 (28.6%) | 45 (67.2%) | ||
| CEA level | 4.03 (2.18, 12.82) | 2.81 (1.60, 6.37) | 0.009 | |
| CA125 level | 12.02 (8.73, 17.30) | 16.71 (9.59, 24.64) | 0.004 | |
| CA199 level | 13.45 (7.74, 26.59) | 9.99 (5.94, 25.36) | 0.067 | |
| cT stage | T1 | 12 (3.2%) | 0 (0.0%) | 0.671 |
| T2 | 58 (15.5%) | 10 (14.9%) | ||
| T3 | 236 (63.1%) | 47 (70.1%) | ||
| T4 | 68 (18.2%) | 10 (14.9%) | ||
| cN stage | N0 | 210 (56.1%) | 44 (65.7%) | 0.201 |
| N1 | 81 (21.7%) | 11 (16.4%) | ||
| N2 | 83 (22.2%) | 12 (17.9%) | ||
| Maximum diameter (mm) | 19.80 (15.71, 25.62) | 24.70 (18.31, 30.80) | 0.001 | |
Predictive performance of different models in training and validation cohorts.
| Feature_num | Methods | Training cohort | Validation cohort | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC | Accuracy | Sensitivity | Specificity | PPV | NPV | ||
| 6 | APR | 0.775(0.715-0.835) | 0.698(0.653-0.741) | 0.716(0.612-0.806) | 0.695(0.583-0.810) | 0.296(0.265-0.322) | 0.932(0.920-0.941) | 0.786(0.644-0.929) | 0.689(0.557-0.801) | 0.667(0.333-1.000) | 0.692(0.519-0.962) | 0.273(0.158-0.360) | 0.923(0.900-0.943) |
| 10 | VPR | 0.827(0.774-0.880) | 0.744(0.700-0.784) | 0.731(0.612-0.836) | 0.746(0.631-0.869) | 0.340(0.301-0.371) | 0.939(0.929-0.948) | 0.810(0.674-0.946) | 0.754(0.627-0.855) | 0.556(0.222-0.889) | 0.788(0.500-1.000) | 0.312(0.154-0.421) | 0.911(0.867-0.929) |
| 16 | DPR | 0.887(0.847-0.927) | 0.787(0.746-0.824) | 0.791(0.701-0.896) | 0.786(0.722-0.909) | 0.398(0.370-0.429) | 0.955(0.951-0.960) | 0.953(0.903-1.000) | 0.852(0.738-0.930) | 1.000(0.778-1.000) | 0.827(0.808-0.981) | 0.500(0.437-0.500) | 1.000(1.000-1.000) |
| 16 | FR | 0.904(0.870-0.938) | 0.803(0.762-0.839) | 0.836(0.716-0.925) | 0.797(0.684-0.885) | 0.424(0.387-0.449) | 0.964(0.959-0.968) | 0.893(0.804-0.982) | 0.787(0.663-0.881) | 0.778(0.444-1.000) | 0.788(0.635-0.962) | 0.389(0.267-0.450) | 0.953(0.943-0.962) |
| 3 | Clinical | 0.781(0.722-0.840) | 0.721(0.677-0.762) | 0.716(0.567-0.836) | 0.722(0.618-0.799) | 0.316(0.268-0.350) | 0.934(0.924-0.940) | 0.919(0.833-1.000) | 0.869(0.758-0.942) | 0.889(0.442-1.000) | 0.865(0.596-1.000) | 0.533(0.362-0.563) | 0.978(0.969-0.981) |
| 4 | Clinical Radiomics | 0.898(0.860-0.937) | 0.837(0.799-0.870) | 0.821(0.672-0.896) | 0.840(0.663-0.912) | 0.478(0.429-0.500) | 0.963(0.954-0.966) | 0.964(0.919-1.000) | 0.918(0.819-0.973) | 1.000(0.667-1.000) | 0.904(0.846-1.000) | 0.643(0.545-0.643) | 1.000(1.000-1.000) |
FR, fusion of radiomics features of arterial phase, venous phase, and delayed phase; Clinical, fusion of clinical, and radiological characteristics; Clinical Radiomics, fusion of clinicalradiological features and radiomics features. APR, radiomics model of arterial phase; AUC, area under the curve; D, DPR, radiomics model of delayed phase; NPV, negative predictive value; PPV, positive predictive value; VPR, radiomics model of venous phase.
The final signatures selected from 3D radiomics features.
| Arterial phase (n=10) | Venous phase (n=10) | Delayed phase (n=16) | Radiomics (n=16) |
|---|---|---|---|
| A_original_glszm_GrayLevelVariance | V_original_glszm_GrayLevelVariance | D_original_shape_Elongation | V_original_glszm_ZoneEntropy |
| A_log.sigma.5.0.mm.3D_glszm_LargeAreaHighGrayLevelEmphasis | V_original_glszm_ZoneEntropy | D_original_firstorder_Range | V_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis |
| A_wavelet.LHL_firstorder_90Percentile | V_log.sigma.5.0.mm.3D_gldm_DependenceNonUniformityNormalized | D_original_ngtdm_Contrast | V_wavelet.HLH_firstorder_Mean |
| A_wavelet.LHL_firstorder_Skewness | V_wavelet.LHL_glcm_MCC | D_log.sigma.3.0.mm.3D_glszm_LargeAreaLowGrayLevelEmphasis | V_wavelet.HHH_glszm_LargeAreaLowGrayLevelEmphasis |
| A_wavelet.LHL_gldm_SmallDependenceLowGrayLevelEmphasis | V_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis | D_log.sigma.3.0.mm.3D_gldm_DependenceNonUniformityNormalized | V_wavelet.HHH_gldm_SmallDependenceLowGrayLevelEmphasis |
| A_wavelet.LHH_glszm_SmallAreaEmphasis | V_wavelet.LHL_gldm_DependenceVariance | D_log.sigma.3.0.mm.3D_ngtdm_Contrast | D_original_firstorder_Range |
| V_wavelet.HLL_glszm_SizeZoneNonUniformity | D_log.sigma.5.0.mm.3D_glszm_LargeAreaHighGrayLevelEmphasis | D_original_ngtdm_Contrast | |
| V_wavelet.HLH_firstorder_Mean | D_log.sigma.5.0.mm.3D_ngtdm_Busyness | D_log.sigma.3.0.mm.3D_glszm_LargeAreaLowGrayLevelEmphasis | |
| V_wavelet.HHH_glszm_LargeAreaLowGrayLevelEmphasis | D_wavelet.LLH_glcm_InverseVariance | D_log.sigma.3.0.mm.3D_gldm_DependenceNonUniformityNormalized | |
| V_wavelet.HHH_gldm_SmallDependenceLowGrayLevelEmphasis | D_wavelet.LHL_glszm_GrayLevelNonUniformityNormalized | D_log.sigma.5.0.mm.3D_ngtdm_Busyness | |
| D_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis | D_wavelet.LLH_glcm_InverseVariance | ||
| D_wavelet.LHH_glcm_InverseVariance | D_wavelet.LHL_glszm_GrayLevelNonUniformityNormalized | ||
| D_wavelet.LHH_glszm_LargeAreaEmphasis | D_wavelet.LHH_glcm_InverseVariance | ||
| D_wavelet.LHH_gldm_DependenceNonUniformityNormalized | D_wavelet.LHH_gldm_DependenceNonUniformityNormalized | ||
| D_wavelet.HLH_glcm_Imc1 | D_wavelet.HLH_glcm_Imc1 | ||
| D_wavelet.LLL_firstorder_Skewness | D_wavelet.LLL_firstorder_Skewness |
Figure 3The receiver operating characteristic (ROC) curves of the different models in training cohort (A) and validation cohort (B). AUC, area under the curve; APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.
Figure 4A Clinical Radiomics nomogram for preoperative identification of microsatellite instability status in colorectal cancer patients (A). The nomogram was constructed based on multivariate logistic regression and consisted of three clinical factors and 16 radiomics signatures. Calibration curves of the different models in training cohort (B) and validation cohort (C); the y-axis represents the actual microsatellite instability rate and the x-axis represents the predicted microsatellite instability risk. The diagonal dotted line indicates that the predicted outcome perfectly corresponds with the actual outcome. The solid line indicates the bias-corrected accuracy of the different models, with a closer fit to the diagonal dotted line representing a better prediction. Decision curve analysis of the different models in training cohort (D) and validation cohort (E); the y-axis represents the net benefit, which is calculated by subtracting the expected harm (false positives) from the expected benefit (gaining true positives) and subtracting expected harm (deleting false positives). The higher curve at any given threshold probability is the optimal prediction to maximize net benefit. The solid colored lines represent the different models. The solid gray line represents the assumption that all patients had microsatellite instability. The solid black line represents the assumption that no patients had microsatellite instability. APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase; HL, Hosmer-Lemeshow test.
Figure 5Precision-recall (PR) curves of the different models in the training cohort (A) and validation cohort (B). PR represents the relationship between precision and recall. The larger the area under the curve value of the PR curve, the better the model performance. Precision = true positive/(true positive + false positive); recall = true positive/(true positive + false negative). APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.
Figure 6Heat map comparison of the different models in the training cohort. The values in the matrix represent the results of Delong test between two models. APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.