Literature DB >> 34178682

Predicting Microsatellite Instability Status in Colorectal Cancer Based on Triphasic Enhanced Computed Tomography Radiomics Signatures: A Multicenter Study.

Yuntai Cao^1,2,3,4, Guojin Zhang^3,5, Jing Zhang^3,6, Yingjie Yang⁷, Jialiang Ren⁸, Xiaohong Yan⁹, Zhan Wang¹⁰, Zhiyong Zhao^2,3,4, Xiaoyu Huang^2,3,4, Haihua Bao¹, Junlin Zhou^3,4.

Abstract

BACKGROUND: This study aimed to develop and validate a computed tomography (CT)-based radiomics model to predict microsatellite instability (MSI) status in colorectal cancer patients and to identify the radiomics signature with the most robust and high performance from one of the three phases of triphasic enhanced CT.
METHODS: In total, 502 colorectal cancer patients with preoperative contrast-enhanced CT images and available MSI status (441 in the training cohort and 61 in the external validation cohort) were enrolled from two centers in our retrospective study. Radiomics features of the entire primary tumor were extracted from arterial-, delayed-, and venous-phase CT images. The least absolute shrinkage and selection operator method was used to retain the features closely associated with MSI status. Radiomics, clinical, and combined Clinical Radiomics models were built to predict MSI status. Model performance was evaluated by receiver operating characteristic curve analysis.
RESULTS: Thirty-two radiomics features showed significant correlation with MSI status. Delayed-phase models showed superior predictive performance compared to arterial- or venous-phase models. Additionally, age, location, and carcinoembryonic antigen were considered useful predictors of MSI status. The Clinical Radiomics nomogram that incorporated both clinical risk factors and radiomics parameters showed excellent performance, with an AUC, accuracy, and sensitivity of 0.898, 0.837, and 0.821 in the training cohort and 0.964, 0.918, and 1.000 in the validation cohort, respectively.
CONCLUSIONS: The proposed CT-based radiomics signature has excellent performance in predicting MSI status and could potentially guide individualized therapy.

Entities: Chemical

Keywords: CT; colorectal cancer; microsatellite instability; radiomics; triphasic enhanced phase

Year: 2021 PMID： 34178682 PMCID： PMC8222982 DOI： 10.3389/fonc.2021.687771

Source DB: PubMed Journal: Front Oncol ISSN： 2234-943X Impact factor: 6.244

Introduction

Colorectal cancer (CRC) is characterized by complex biological features and shows distinct heterogeneity. Even though the clinicopathological characteristics of CRC are similar, there is still significant variability in treatment response and prognosis (1). Two major molecular events are involved in the occurrence and development of CRC (2, 3). The vast majority of CRCs are caused by chromosomal instability events (approximately 85%), including mutations in APC, KRAS, and TP53 genes, etc. However, a small percentage of CRCs are caused by microsatellite instability (MSI) (approximately 15%). Mismatch repair (MMR) genes are highly conserved and are involved in repairing DNA base mismatches. They are beneficial in maintaining genome stability and reducing spontaneous mutations (4). MMR proteins include MLH1, MSH2, MSH6, and PMS2. During DNA replication, minor DNA mismatches occasionally occur, which are recognized by these proteins and then cut and synthesized into new strands for repair (5, 6). When any one of these four proteins are non-functional, they cause accumulation of DNA base mismatches in proliferating cells, a phenomenon known as MSI (6). MSI status is currently a key predictor for evaluating the treatment strategies and prognosis of CRC patients (7, 8). Compared with microsatellite-stable (MSS) CRC patients, CRC patients with MSI status are more likely to benefit from immunotherapy, but they do not benefit from pyrimidine analogs or fluorouracil-based adjuvant chemotherapy (9–11). In addition, CRC patients with MSI status may have a favorable prognosis (12–14). The National Comprehensive Cancer Network (NCCN), European Society for Medical Oncology (ESMO), and Japanese Society for Cancer of the Colon and Rectum (JSCCR) guidelines recommend testing the MSI status of CRC patients (4, 15, 16). At present, MSI status detection is mainly done through immunohistochemistry (IHC) and polymerase chain reaction (PCR) methods on biopsy or surgical tissue, both of which are invasive and costly (8, 17). Furthermore, the small part of the tissue captured by biopsy may not be sufficient to accurately reflect the MSI status of tumors (18, 19). In addition, these advanced biological tests can only be performed in qualified tertiary medical centers, as local medical institutions have not widely adopted these methods because of the lack of suitable medical equipment (20). Therefore, developing a non-invasive, cost-effective, and easily repeatable method to identify MSI status could help clinicians to develop more accurate treatment strategies for CRC patients. Radiomics is a burgeoning field in the era of precision medicine, involving screening, diagnosis, treatment, and prognostic assessment of multiple systemic diseases (21–24). By extracting high-dimensional, mineable data from medical imaging and evaluating its association with clinicopathologic factors or gene expression, radiomics facilitates the formulation of individualized treatment strategies. Radiomics has been widely used in CRC stage assessment (21), tumor differentiation identification (25), post-neoadjuvant chemotherapy efficacy evaluation (26), and KRAS mutation status identification (27). A previous study demonstrated a significant correlation between a CT-based radiomics signature and MSI status in CRC patients (28, 29). These results indicate that pretreatment CT may be associated with MSI status and that radiomics analysis may greatly contribute to MSI status identification. However, previous studies have only included a single group and lack external validation. Moreover, the superiority of the venous phase (VP) compared to arterial and delayed phases (AP and DP, respectively) in the prediction of MSI status in CRC patients remains to be confirmed. Therefore, the aim of this study was to investigate whether a CT-based radiomics signature could identify MSI status in CRC patients and to identify the phase with the most robust and high-performing radiomics signature from triphasic enhanced CT.

Materials and Methods

Patients

Ethical approval was obtained by the medical ethics committee in both participating centers (center I: Lanzhou University Second Hospital; center II: The Second People’s Hospital of Lanzhou city), and the requirement for informed consent was waived due to the retrospective nature of the study. Patient inclusion and exclusion details and the patient recruitment pathway are shown in . The institutional database in center I was searched for eligible patients who underwent curative resection between March 2014 and August 2020, and a total of 441 consecutive patients were enrolled. This study included 255 males (42.2%) and 186 females (57.8%), with an average age of 58.64 ± 12.92 years (range, 20–89 years). Furthermore, 61 patients from center II were also enrolled between July 2018 and August 2020, including 38 males (62.3%) and 23 females (37.7%), with an average age of 56.93 ± 11.94 years (range, 27–84 years). The model for MSI prediction was established in the training cohort and evaluated in the external validation cohort. The baseline clinical data of all CRC patients, including age, sex, tumor location, carcinoembryonic antigen (CEA) level, CA125 level, and CA199 level, were collected. Two radiologists (radiologist A, Y.T.C.; radiologist B, J.Z.) with more than 10 years of experience in abdominal imaging collected radiological features on preoperative CT images and recorded the results, including clinical tumor/lymph node (cT/N) stage and tumor maximum diameter (maximum diameter perpendicular to the long axis of the cross-sectional image). In order to minimize the deviation of the measurement results, the quantitative data was taken as the final result by the average of the measurement values of the two radiologists, while the qualitative data is diagnosed by the two radiologists independently, and the disagreement is resolved through negotiation.

Figure 1

Patient inclusion and exclusion details and the patient recruitment pathway.

Identification of MSI Status

MSI status was evaluated by immunohistochemical staining of MMR proteins (MLH1, MSH2, MSH6, PMS2). The standard streptavidin biotin-peroxidase procedure was performed on postoperative tissues to identify the MSI status. Patients were classified into the MSI or MSS group according to the staining results of MMR proteins. Among the four MMR proteins, negative staining for one or more proteins was defined as MSI. MSS was defined as positive staining for all four MMR proteins (6).

CT Image Acquisition and Segmentation

All patients underwent abdominal and/or pelvic enhanced CT scans in two institutions, which covered the whole tumor. Triphasic enhanced CT images were retrieved from the picture archiving and communication system (PACS, Carestream; Rochester, NY) and stored in corresponding folders in DICOM format for further analysis. The CT scanner and acquisition parameters of the three institutions are listed in . Two gastrointestinal radiologists (radiologist A and radiologist B) performed three-dimensional (3D) radiomics segmentation on AP, VP, and DP using ITK-SNAP software (version 3.6.0; www.itksnap.org). Radiologist A segmented 300 cases and radiologist B segmented the other 202 cases. For radiomics segmentation, an ROI was manually delineated on each slice of the tumor. Air and feces in the intestinal tract and pericolonic fat were carefully excluded from the contours. Finally, three ROIs (AP, DP, and VP) were generated for each patient. To evaluate inter-observer reproducibility and robustness of feature extraction, radiologist A and radiologist B randomly selected 30 patients and performed manual segmentation again. We estimated the reproducibility of feature extraction using inter-class correlation coefficients (ICCs), where ICCs greater than 0.80 indicated good reproducibility (30). Additionally, 30 patients were randomly selected from each CT scanner to build the CT scanner set for calculating intra- and interclass correlation coefficients (ICCs).

Feature Extraction

Before feature extraction, we adopted three steps to preprocess the CT images. First, we resampled images to 1 mm × 1 mm × 1 mm using linear interpolation to try to reduce the influence of different layer thicknesses. Second, we transformed the continuous images into discrete integer values using gray-level discretization processing (bin width = 25). Finally, Laplacian of Gaussian (LoG) and wavelet image filters were used to eliminate mixed noise in the processing of image digitization and to obtain low- or high-frequency features. Radiomics features were extracted using the PyRadiomics package (31). Seven classes of radiomics features were extracted from the original and filtered images (wavelet and LoG). Finally, 1037 3D radiomics features were extracted from each patient. The feature types and their numbers are as follows: (1) first-order (histogram) features (n = 198); (2) shape features (n = 14); (3) gray-level co-occurrence matrix (GLCM) features (n = 264); (4) gray-level run-length matrix (GLRLM) features (n = 176); (5) gray-level size zone matrix (GLSZM) features (n = 176); (6) neighboring gray-tone difference matrix (NGTDM) features (n = 55); (7) gray-level dependence matrix (GLDM) features (n = 154). The specific definitions and descriptions of the features are demonstrated in the .

Features Selection and Prediction Model Building

After radiomics feature extraction, all missing data in the training cohort were replaced by median value, z-score normalization was performed on each feature, and the same preprocessing procedure was applied to the validation cohort. We performed a binary classification task for MSI status prediction: MSS vs. MSI expression. It is worth noting that the sample numbers of the two groups were unbalanced between the training and validation cohorts. The initial bias adjustment method was used to correct the influence of unbalanced sample size. The adjustment bias b 0 was determined using the following equation: The process of radiomics feature selection that is most related to MSI status consists of three steps. First, univariate analysis with the Mann-Whitney U test was performed for feature selection to retain features with p < 0.05 for the subsequent process. Second, the least absolute shrinkage and selection operator (LASSO) method was used to retain features closely associated with MSI status. Finally, multivariable stepwise logistic regression further eliminated irrelevant features and retained the most informative features. A ten times five-fold cross-validation method was applied to avoid overfitting and to identify the model with the best performance. Three radiomics models were established based on the above radiomics signatures in triphasic phase-enhanced CT images (APR, VPR, and DPR models). In order to verify whether the model combining the triphasic enhanced phases can improve the prediction performance of MSI status, the FR model was built based on AP, VP, and DP fusion features from 3D segmentation patterns. The maximum area under the curve (AUC) in the training cohort determined the final regularization parameter. Furthermore, the Radiomics models predicted a radiomics signature demonstrating the likelihood of MSI status for each patient.

Clinical, Combined Model, and Nomogram Construction

For clinical and radiological features, the chi-squared test or Fisher’s exact test was used to compare differences in sex, CEA, CA125, CA199, cT stage and cN stage, while the Student’s t-test or Mann-Whitney U test was used to compare differences in age, and maximum diameter between the MSS and MSI groups in the training and external validation cohorts. Generally, P-values < 0.05 (two-sided) were considered statistically significant. We performed multivariable analyses to identify the most important features. A clinical model was established based on the inclusion of selected features. A combined model (clinical Radiomics) was developed based on correlated clinicalradiological features and radiomics features to verify whether the combination of radiomics signatures and clinicalradiological features could improve the prediction of MSI status, and it was presented as an individualized nomogram. Using multivariate logistic regression coefficients, a nomogram incorporating clinicalradiological characteristics and radiomics signatures was created for the training and external validation cohorts following the selection of clinical characteristics and model comparison. This nomogram provides a more convenient and reliable tool for patients and clinicians. A flowchart of the study is shown in .

Figure 2

Workflow of microsatellite instability (MSI) prediction building and analysis. The tumors were segmented on arterial phase (A, B), delayed phase (C, D) and venous phase (E, F) CT images to form volumes of interest (VOIs). One thousand and thirty-seven quantitative radiomics features were extracted from each patient. The least absolute shrinkage and selection operator (LASSO) was used to select the features. Multivariate logistic regression was used to build radiomics, clinical, and clinicoradiomics combined models for MSI prediction. Finally, the radiomics signature and clinical factors were incorporated into a nomogram for individual evaluation. Receiver operating characteristic curves were used to evaluate the clinical usefulness of the nomogram.

Statistical Analyses

All statistical analyses were conducted using the R statistical software package (version 3.6.3; http://www.Rproject.org). Student’s t-test, the Mann-Whitney U test, and the chi-squared test or Fisher’s exact test were used to compare continuous and categorical variables, as appropriate. A two-sided P-value < 0.05 was considered statistically significant. ICCs were used to calculate the consistency of measurements between the two radiologists and different CT scanners. Receiver operating characteristic (ROC) analysis was used to evaluate the predictive accuracy of the different models. The AUC, 95% confidence interval (CI), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated for each model. Precision-recall (PR) curves and the DeLong test were used to compare the AUC estimates of the discrimination performance between different predictive models. A calibration curve was constructed to assess the goodness-of-fit of the models. The Hosmer-Lemeshow (HL) test was performed to assess the agreement between the predicted MSI status and the observed outcomes. To verify the clinical usefulness of the models, we quantified the net benefit at different threshold probabilities in the dataset using decision curve analysis (DCA).

Results

Clinical Characteristics

Patient and tumor characteristics in the training cohort are listed in . This study included 502 CRC patients (441 patients in center I, 61 patients in center II) in the final analysis. The prevalence of MSI was 15.19% (67/441) in center I and 14.75% (9/61) in center II.

Table 1

Characteristics of patients in the training cohort [median (Q1, Q3) or no. (%)].

Characteristics		Training cohort (n=441)
Characteristics		MSS	MSI	P value
Age (years)		61.00 (51.00, 68.00)	51.00 (42.50, 63.00)	<0.001
Gender	Female	153 (40.9%)	33 (49.3%)	0.203
	Male	221 (59.1%)	34 (50.7%)
Tumor Location	Left	267 (71.4%)	22 (32.8%)	<0.001
	Right	107 (28.6%)	45 (67.2%)
CEA level		4.03 (2.18, 12.82)	2.81 (1.60, 6.37)	0.009
CA125 level		12.02 (8.73, 17.30)	16.71 (9.59, 24.64)	0.004
CA199 level		13.45 (7.74, 26.59)	9.99 (5.94, 25.36)	0.067
cT stage	T1	12 (3.2%)	0 (0.0%)	0.671
	T2	58 (15.5%)	10 (14.9%)
	T3	236 (63.1%)	47 (70.1%)
	T4	68 (18.2%)	10 (14.9%)
cN stage	N0	210 (56.1%)	44 (65.7%)	0.201
	N1	81 (21.7%)	11 (16.4%)
	N2	83 (22.2%)	12 (17.9%)
Maximum diameter (mm)		19.80 (15.71, 25.62)	24.70 (18.31, 30.80)	0.001

Characteristics of patients in the training cohort [median (Q1, Q3) or no. (%)].

Predictive Performance of the Clinical Model

Age, tumor location, CEA level, CA125 level, and maximum diameter were found to be significantly different (P < 0.05) between the MSI and MSS groups in the training cohort, but other characteristics were not significantly different (P > 0.05). Finally, after multivariate analyses, age, tumor location, and CEA were selected as independent predictors of MSI and were enrolled into the clinical model ( ). The clinical model showed moderate performance in predicting MSI both in the training cohort and the validation cohort, with an AUC of 0.781 (95%CI, 0.722-0.840) in the training cohort and 0.919 (95%CI, 0.833-1.000) in the validation cohort ( ). The accuracy, sensitivity, and specificity were 0.721, 0.716, and 0.722 in the training cohort and 0.869, 0.889, and 0.865 in the validation cohort, respectively.

Table 3

Predictive performance of different models in training and validation cohorts.

Feature_num	Methods	Training cohort						Validation cohort
Feature_num	Methods	AUC	Accuracy	Sensitivity	Specificity	PPV	NPV	AUC	Accuracy	Sensitivity	Specificity	PPV	NPV
6	APR	0.775(0.715-0.835)	0.698(0.653-0.741)	0.716(0.612-0.806)	0.695(0.583-0.810)	0.296(0.265-0.322)	0.932(0.920-0.941)	0.786(0.644-0.929)	0.689(0.557-0.801)	0.667(0.333-1.000)	0.692(0.519-0.962)	0.273(0.158-0.360)	0.923(0.900-0.943)
10	VPR	0.827(0.774-0.880)	0.744(0.700-0.784)	0.731(0.612-0.836)	0.746(0.631-0.869)	0.340(0.301-0.371)	0.939(0.929-0.948)	0.810(0.674-0.946)	0.754(0.627-0.855)	0.556(0.222-0.889)	0.788(0.500-1.000)	0.312(0.154-0.421)	0.911(0.867-0.929)
16	DPR	0.887(0.847-0.927)	0.787(0.746-0.824)	0.791(0.701-0.896)	0.786(0.722-0.909)	0.398(0.370-0.429)	0.955(0.951-0.960)	0.953(0.903-1.000)	0.852(0.738-0.930)	1.000(0.778-1.000)	0.827(0.808-0.981)	0.500(0.437-0.500)	1.000(1.000-1.000)
16	FR	0.904(0.870-0.938)	0.803(0.762-0.839)	0.836(0.716-0.925)	0.797(0.684-0.885)	0.424(0.387-0.449)	0.964(0.959-0.968)	0.893(0.804-0.982)	0.787(0.663-0.881)	0.778(0.444-1.000)	0.788(0.635-0.962)	0.389(0.267-0.450)	0.953(0.943-0.962)
3	Clinical	0.781(0.722-0.840)	0.721(0.677-0.762)	0.716(0.567-0.836)	0.722(0.618-0.799)	0.316(0.268-0.350)	0.934(0.924-0.940)	0.919(0.833-1.000)	0.869(0.758-0.942)	0.889(0.442-1.000)	0.865(0.596-1.000)	0.533(0.362-0.563)	0.978(0.969-0.981)
4	Clinical Radiomics	0.898(0.860-0.937)	0.837(0.799-0.870)	0.821(0.672-0.896)	0.840(0.663-0.912)	0.478(0.429-0.500)	0.963(0.954-0.966)	0.964(0.919-1.000)	0.918(0.819-0.973)	1.000(0.667-1.000)	0.904(0.846-1.000)	0.643(0.545-0.643)	1.000(1.000-1.000)

FR, fusion of radiomics features of arterial phase, venous phase, and delayed phase; Clinical, fusion of clinical, and radiological characteristics; Clinical Radiomics, fusion of clinicalradiological features and radiomics features. APR, radiomics model of arterial phase; AUC, area under the curve; D, DPR, radiomics model of delayed phase; NPV, negative predictive value; PPV, positive predictive value; VPR, radiomics model of venous phase.

Radiomics Signature Building and Discrimination Performance Assessment

ICCs were calculated to evaluate the agreement of features extracted by the two radiologists and different CT scanners, and ICC values > 0.80 indicated good agreement. A total of 1037 3D radiomics features from AP, VP, and DP images were extracted for each patient, and irrelevant features were removed as described earlier. Finally, 6 AP, 10 VP, and 16 DP 3D radiomics features were retained as the final signatures. The feature names and distributions are listed in . The values of these features were significantly different between the MSI and MSS groups. Following stepwise regression analysis, 16 features were selected after fusion of the radiomics features from AP, VP, and DP (FR model). Significant differences were found in these features between the MSI and MSS groups ( ). As shown in , the feature heatmaps show that the correlation between most of the features is below than 0.9, indicating that the final features are less collinear with each other. Four models were built based on the above radiomics signatures for preoperatively predicting MSI (APR, DPR, VPR, and FR models). The AUC, accuracy, sensitivity, specificity, PPV, and NPV for each model are listed in and . The DPR model had optimal predictive performance compared to APR or VPR in the training and validation cohorts ( ). In addition, the FR model had a higher predictive AUC than APR, DPR, or VPR in the training cohort. In the validation cohort, the FR model had a higher predictive AUC than APR or VPR in the training cohort but slightly lower than the AUC of the DPR model.

Table 2

The final signatures selected from 3D radiomics features.

Arterial phase (n=10)	Venous phase (n=10)	Delayed phase (n=16)	Radiomics (n=16)
A_original_glszm_GrayLevelVariance	V_original_glszm_GrayLevelVariance	D_original_shape_Elongation	V_original_glszm_ZoneEntropy
A_log.sigma.5.0.mm.3D_glszm_LargeAreaHighGrayLevelEmphasis	V_original_glszm_ZoneEntropy	D_original_firstorder_Range	V_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis
A_wavelet.LHL_firstorder_90Percentile	V_log.sigma.5.0.mm.3D_gldm_DependenceNonUniformityNormalized	D_original_ngtdm_Contrast	V_wavelet.HLH_firstorder_Mean
A_wavelet.LHL_firstorder_Skewness	V_wavelet.LHL_glcm_MCC	D_log.sigma.3.0.mm.3D_glszm_LargeAreaLowGrayLevelEmphasis	V_wavelet.HHH_glszm_LargeAreaLowGrayLevelEmphasis
A_wavelet.LHL_gldm_SmallDependenceLowGrayLevelEmphasis	V_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis	D_log.sigma.3.0.mm.3D_gldm_DependenceNonUniformityNormalized	V_wavelet.HHH_gldm_SmallDependenceLowGrayLevelEmphasis
A_wavelet.LHH_glszm_SmallAreaEmphasis	V_wavelet.LHL_gldm_DependenceVariance	D_log.sigma.3.0.mm.3D_ngtdm_Contrast	D_original_firstorder_Range
	V_wavelet.HLL_glszm_SizeZoneNonUniformity	D_log.sigma.5.0.mm.3D_glszm_LargeAreaHighGrayLevelEmphasis	D_original_ngtdm_Contrast
	V_wavelet.HLH_firstorder_Mean	D_log.sigma.5.0.mm.3D_ngtdm_Busyness	D_log.sigma.3.0.mm.3D_glszm_LargeAreaLowGrayLevelEmphasis
	V_wavelet.HHH_glszm_LargeAreaLowGrayLevelEmphasis	D_wavelet.LLH_glcm_InverseVariance	D_log.sigma.3.0.mm.3D_gldm_DependenceNonUniformityNormalized
	V_wavelet.HHH_gldm_SmallDependenceLowGrayLevelEmphasis	D_wavelet.LHL_glszm_GrayLevelNonUniformityNormalized	D_log.sigma.5.0.mm.3D_ngtdm_Busyness
		D_wavelet.LHL_glszm_LargeAreaHighGrayLevelEmphasis	D_wavelet.LLH_glcm_InverseVariance
		D_wavelet.LHH_glcm_InverseVariance	D_wavelet.LHL_glszm_GrayLevelNonUniformityNormalized
		D_wavelet.LHH_glszm_LargeAreaEmphasis	D_wavelet.LHH_glcm_InverseVariance
		D_wavelet.LHH_gldm_DependenceNonUniformityNormalized	D_wavelet.LHH_gldm_DependenceNonUniformityNormalized
		D_wavelet.HLH_glcm_Imc1	D_wavelet.HLH_glcm_Imc1
		D_wavelet.LLL_firstorder_Skewness	D_wavelet.LLL_firstorder_Skewness

Figure 3

The receiver operating characteristic (ROC) curves of the different models in training cohort (A) and validation cohort (B). AUC, area under the curve; APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.

The final signatures selected from 3D radiomics features. Predictive performance of different models in training and validation cohorts. FR, fusion of radiomics features of arterial phase, venous phase, and delayed phase; Clinical, fusion of clinical, and radiological characteristics; Clinical Radiomics, fusion of clinicalradiological features and radiomics features. APR, radiomics model of arterial phase; AUC, area under the curve; D, DPR, radiomics model of delayed phase; NPV, negative predictive value; PPV, positive predictive value; VPR, radiomics model of venous phase. The receiver operating characteristic (ROC) curves of the different models in training cohort (A) and validation cohort (B). AUC, area under the curve; APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.

Predictive Performance of the Combined Model

As shown in , a Clinical Radiomics combined model was developed that incorporates clinical risk factors and DP radiomics signatures, which was presented as a quantitative nomogram. The Clinical Radiomics model showed excellent predictive ability for MSI status, with an AUC, accuracy, and sensitivity of 0.898, 0.837, and 0.821 in the training cohort and 0.964, 0.918, and 1.000 in the validation cohort, respectively. As shown in and , the Clinical Radiomics model had a better predictive AUC value than either the clinical model or the radiomics models in the training cohort and validation cohort.

Figure 4

A Clinical Radiomics nomogram for preoperative identification of microsatellite instability status in colorectal cancer patients (A). The nomogram was constructed based on multivariate logistic regression and consisted of three clinical factors and 16 radiomics signatures. Calibration curves of the different models in training cohort (B) and validation cohort (C); the y-axis represents the actual microsatellite instability rate and the x-axis represents the predicted microsatellite instability risk. The diagonal dotted line indicates that the predicted outcome perfectly corresponds with the actual outcome. The solid line indicates the bias-corrected accuracy of the different models, with a closer fit to the diagonal dotted line representing a better prediction. Decision curve analysis of the different models in training cohort (D) and validation cohort (E); the y-axis represents the net benefit, which is calculated by subtracting the expected harm (false positives) from the expected benefit (gaining true positives) and subtracting expected harm (deleting false positives). The higher curve at any given threshold probability is the optimal prediction to maximize net benefit. The solid colored lines represent the different models. The solid gray line represents the assumption that all patients had microsatellite instability. The solid black line represents the assumption that no patients had microsatellite instability. APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase; HL, Hosmer-Lemeshow test.

Figure 5

Figure 6

Heat map comparison of the different models in the training cohort. The values in the matrix represent the results of Delong test between two models. APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase.

Precision-recall (PR) curves of the different models in the training cohort (A) and validation cohort (B). PR represents the relationship between precision and recall. The larger the area under the curve value of the PR curve, the better the model performance. Precision = true positive/(true positive + false positive); recall = true positive/(true positive + false negative). APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase. Heat map comparison of the different models in the training cohort. The values in the matrix represent the results of Delong test between two models. APR, radiomics model of arterial phase; DPR, radiomics model of delayed phase; VPR, radiomics model of venous phase; FR, radiomics model of fusion of arterial phase, delayed phase and venous phase features; Clinical Radiomics, fusion of clinical risk factors and radiomics features of delayed phase. The DCA results for the clinical model, radiomics models, and combined nomogram are presented in . The nomogram achieved more clinical utility in predicting MSI than the clinical model or radiomics model alone. The DCA curve of the nomogram demonstrated that when the threshold probability of a patient or doctor ranged between 5% and 80%, the use of the nomogram added greater benefit for MSI prediction than the treat-all-patients scheme or the treat-none scheme in the training cohort.

Discussion

In this study, we investigated the association between triphasic enhanced CT radiomics features and MSI status. Six, ten, and sixteen radiomics features showed significant correlation with MSI status in AP, DP, and VP, respectively. Four radiomics models (APR, DPR, VPR, and FR) were proposed using the above radiomics features in the training cohort to predict MSI status for patients with colorectal cancer, and we validated its performance in an external validation cohort from another center. Our study showed that the DPR model had a higher outstanding performance than the APR or VPR models in both the training and external validation cohorts. Meanwhile, the nomogram, based on DP radiomics features and clinical risk factors, showed excellent identification ability for MSI status in both training (AUC: 0.898, 95% CI 0.860-0.937) and external validation (AUC: 0.964, 95% CI 0.919–1.000) cohorts. Our nomogram may be useful for predicting the MSI status of CRC patients and, thus, has the potential to aid in the determination of therapeutic strategies. In common studies, the results of external validation cohort are lower than the training cohort due to overfitting. Our results show that the results of external validation cohort are slightly higher than the training cohort, and lack of overlap between the 95% CI of the accuracies between the training and the validation cohorts. Since our validation cohort is external data set, there is often some deviation in distribution between the two data sets due to geographical location and other factors, which may cause the model performance of the validation cohort higher than the training cohort. Validation on additional cohort is required to ensure the model’s reproducible and generalizable. In the present study, the incidence of MSI was 15.19% (67/441) in the training cohort and 14.75% (9/61) in the external validation cohort, which is consistent with previous literature (32, 33). CRC patients with MSI have distinct prognoses and treatment strategies compared to patients with MSS tumors, including better prognosis and benefits from fluorouracil chemotherapy; moreover, MSI may be a negative marker for immunotherapy. Previous studies (28, 29) have investigated the association between MSI and radiomics features. Fan et al. (28) used CT-based radiomics to predict the MSI status in 119 stage II CRC patients. The predictive AUC of the radiomics model (combination of clinical factors and radiomics features) was 0.752. Pernicka et al. (29) proposed a CT-based radiomics model for the prediction of MSI in stage II–III colon cancer. The combined model (combination of clinical factors and radiomics features) had moderate diagnostic efficacy, with AUC values of 0.80 and 0.79 in the training and validation sets, respectively. Both studies contained small samples and lacked effective validation of external data. Our proposed clinicoradiomics combined model performed better than previous models in both training and external validation cohorts. Therefore, it may be a potential quantitative tool for individualized MSI prediction. Due to the low incidence of MSI, the data distribution in this study was significantly unbalanced. The unbalanced distribution of data is a common problem in classification. Therefore, the bias adjustment method was used to overcome the training fit error in our study. A previous study used synthetic minority over-sampling technique (SMOTE) methods (28) to resolve data imbalance. The SMOTE method is based on increasing the “artificial” sample to resolve the imbalance of the data set. However, this strategy is prone to model overfitting and is difficult to demonstrate validity. In our study, 1037 quantitative features were extracted from CT images to build radiomics signatures. During the image preprocessing stage, LoG and wavelet filters (27) were applied to process the original image. Of the 1037 radiomics features, 6, 10, and 16 features were retained in AP, DP, and VP images, respectively, all of which demonstrated high correlations with MSI and were stable across multiple centers. To our surprise, the majority of radiomics features were LoG and wavelet filter features (26/32 in radiomics features) in the present study, which means that LoG and wavelet filters can improve the efficiency of capturing more phenotypic features related to MSI of CRC. In the present study, the texture feature was the most frequent radiomics feature in triphasic enhanced CT signatures (4/6 in AP, 9/10 in VP, 13/16 in DP). Texture features are microscopic features in an image that have been shown to be highly correlated with tumor heterogeneity (34, 35). However, these features are not easily identified by the human eye and cannot be interpreted as having a clear meaning (36). Our results showed that most texture features were associated with MSI status. Compared with the MSS group, the values of these features were significantly higher in the MSI group, which indicated more homogeneity in the ROI. Our finding is in line with those of previous reports (28, 29) that texture features were also the most frequent radiomics features for MSI prediction. We observed that first-order statistic features including A_wavelet.LHL_firstorder_90Percentile, A_wavelet.LHL_firstorder_Skewness, V_wavelet.HLH_firstorder_Mean, D_original_firstorder_Range, and D_wavelet.LLL_firstorder_Skewness were significantly associated with MSI status, which was consistent with the results of the studies by Fan et al. and Pernicka et al. (28, 29) The results of their studies show that the MSI status is associated with kurtosis and intensity histograms. Among the triphasic enhanced CT models for the prediction of the MSI status in the training cohort, the DPR model showed the highest performance, with an AUC value of 0.887, followed by 0.827 in the VPR model and 0.775 in the APR model. A similar trend was found in the validation cohort; the predictive AUCs of the DPR, VPR, and APR models were 0.953, 0.810, and 0.876, respectively. Although the VP is the most commonly used phase in gastrointestinal radiomics research, and previous radiomics features for MSI prediction were extracted from portal VP CT images. However, to date, this is the first study to develop a radiomics based model to predict the risk of MSI status in CRC patients based on triphasic enhanced CT with big data. To our surprise, the DPR model showed the best predictive performance in the training and validation cohorts. The triphasic enhanced phase images reflect the uptake and clearance of iodine over time in AP, VP, and DP (37). In AP, the contrast agent is mainly in the intervascular space, which results in focal mucosa enhancement. During VP and DP, the contrast agent is evenly distributed between the intervascular space and the extravascular space, leading to a well-proportioned enhancement (38). The degree of tumor enhancement in AP is positively correlated with the density of microvessels in the tumor, while in VP and DP, the degree of tumor enhancement is related to the content of contrast agent in the tumor interstitial space and vascular space. In addition, CRCs lack normal lymphatic drainage, and the contrast agent tends to remain in the tumor interstitial space for a longer time (38, 39). Therefore, CRCs are significantly enhanced in AP, while VP and DP show continuous enhancement. Previous literature shows that the increase in structure in the enhanced image is proportional to iodine concentration (38). The high content and uniform distribution of contrast agents in tumors may be one of the reasons for the high diagnostic efficiency of the DPR model. This is exactly the same as the number of key features in our study. The numbers of radiomics signatures in DP, VP, and AP were 16, 10, and 6, respectively. The dynamic changes of CRC from AP, VP to DP showed obvious transmural enhancement from inside the tumor to the outside. For triphasic enhanced CT, AP is mainly used for tumor detection and assessment of the tumor extent along the colorectal wall, VP is used for differentiating CRC from adjacent organs and evaluating lymph nodes, and DP is used to determine the depth of tumor invasion (40). Therefore, the range of tumors detected in DP is larger than that in VP or AP. This means that the ROI delineation range of the DP is the largest during the delineation of the tumor in triphasic enhanced CT images, which is consistent with our observations in the process of delineating tumor ROIs. A positive correlation between increased tumor range and increased diagnostic efficiency has been confirmed by previous studies (41). From the above description, another reason for the high predictive performance of the DPR model could be the large ROI range of tumors in DP images. Age, location, and CEA were independent predictors of MSI status in the multivariate analysis. CRC patients with an MSI status have distinct clinical characteristics compared to those with MSS tumors, such as a predominance of right-sided colonic tumors, and early age. Our finding is consistent with the results of a previous study (28, 29, 42). CEA levels were significantly lower in the MSI group than in the MSS group, while CA125 was significantly higher in the MSI group than in the MSS group in the present study. A significant correlation between MSI status and the above clinical predictors suggests that genetic alterations may have independent influences on CRC development, thus resulting in distinct tumor biological behavior compared with that of MSS tumors. These parameters could be easily obtained and thus considered as novel approaches for predicting MSI status. Further studies are essential to validate our findings. As for radiation dose, the average dose length product of triphasic enhanced scans was 1934.76 ± 147.18 mGy*cm, which is slightly higher than the diagnostic reference level for adults (1490 mGy*cm) published by China’s National Health Industry standard (WS/T 637-2018) (43). Application of new techniques such as multi-model iterative reconstruction technology could effectively reduce the radiation dose in clinical practice (44). Several limitations of our study should be noted. First, 501 patients were excluded because they did not meet the inclusion or exclusion criteria, which inevitably produced selection bias. Second, due to the irregular shape of some tumors, manual segmentation is time-consuming and may have observer variability. In future studies, automated segmentation may be a potential tool to resolve this problem. Third, in this study, we used different imaging instruments and acquisition parameters to complete CT scanning. The influence of different instruments and different parameters on radiomics features is obvious. Therefore, it is important to standardize scanning protocols in different instruments and different institutions.

Conclusion

In conclusion, we proposed and validated a CT-based radiomics model, incorporating clinical risk factors and radiomics parameters, which showed a relatively high diagnostic performance for the risk prediction of MSI in patients with CRC. This model may be a potential tool for preoperatively identifying the MSI status and can be used in individualized therapeutic strategy planning and prognostic prediction.

Data Availability Statement

Data are available from the corresponding author upon reasonable request.

Ethics Statement

The studies involving human participants were reviewed and approved by Lanzhou University Second Hospital medical ethics committee. The ethics committee waived the requirement of written informed consent for participation.

Author Contributions

Conception and design: JZho, HB, and YC. Collection and assembly of the data: YC and YY. Development of the methodology: JR. Data analysis and interpretation: All authors. Manuscript writing: All authors. All authors contributed to the article and approved the submitted version.

Funding

This study received funding from the National Natural Science Foundation of China (82071872), Open Fun project of Key Laboratory of Medical Imaging of Gansu Province (GSYX202009), Science and Technology Project of Qinghai Province (No. 2017-SF-158) and Qinghai Provincial Key Clinical Specialty Construction Project.

Conflict of Interest

Author JR was employed by company GE Healthcare. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

43 in total

1. Perinodular and Intranodular Radiomic Features on Lung CT Images Distinguish Adenocarcinomas from Granulomas.

Authors: Niha Beig; Mohammadhadi Khorrami; Mehdi Alilou; Prateek Prasanna; Nathaniel Braman; Mahdi Orooji; Sagar Rakshit; Kaustav Bera; Prabhakar Rajiah; Jennifer Ginsberg; Christopher Donatelli; Rajat Thawani; Michael Yang; Frank Jacono; Pallavi Tiwari; Vamsidhar Velcheti; Robert Gilkeson; Philip Linden; Anant Madabhushi
Journal: Radiology Date: 2018-12-18 Impact factor: 11.105

2. Association Between Younger Age and Targetable Genomic Alterations and Prognosis in Non-Small-Cell Lung Cancer.

Authors: Adrian G Sacher; Suzanne E Dahlberg; Jennifer Heng; Stacy Mach; Pasi A Jänne; Geoffrey R Oxnard
Journal: JAMA Oncol Date: 2016-03 Impact factor: 31.777

3. The value of single-source dual-energy CT imaging for discriminating microsatellite instability from microsatellite stability human colorectal cancer.

Authors: Jingjun Wu; Yue Lv; Nan Wang; Ying Zhao; Pengxin Zhang; Yijun Liu; Anliang Chen; Jianying Li; Xin Li; Yan Guo; Tingfan Wu; Ailian Liu
Journal: Eur Radiol Date: 2019-03-22 Impact factor: 5.315

4. Defective mismatch repair as a predictive marker for lack of efficacy of fluorouracil-based adjuvant therapy in colon cancer.

Authors: Daniel J Sargent; Silvia Marsoni; Genevieve Monges; Stephen N Thibodeau; Roberto Labianca; Stanley R Hamilton; Amy J French; Brian Kabat; Nathan R Foster; Valter Torri; Christine Ribic; Axel Grothey; Malcolm Moore; Alberto Zaniboni; Jean-Francois Seitz; Frank Sinicrope; Steven Gallinger
Journal: J Clin Oncol Date: 2010-05-24 Impact factor: 44.544

5. Development and Validation of a Radiomics Nomogram for Preoperative Prediction of Lymph Node Metastasis in Colorectal Cancer.

Authors: Yan-Qi Huang; Chang-Hong Liang; Lan He; Jie Tian; Cui-Shan Liang; Xin Chen; Ze-Lan Ma; Zai-Yi Liu
Journal: J Clin Oncol Date: 2016-05-02 Impact factor: 44.544

6. Rectal Cancer: Assessment of Neoadjuvant Chemoradiation Outcome based on Radiomics of Multiparametric MRI.

Authors: Ke Nie; Liming Shi; Qin Chen; Xi Hu; Salma K Jabbour; Ning Yue; Tianye Niu; Xiaonan Sun
Journal: Clin Cancer Res Date: 2016-05-16 Impact factor: 12.531

7. Development of a dual-energy spectral CT based nomogram for the preoperative discrimination of mutated and wild-type KRAS in patients with colorectal cancer.

Authors: Yuntai Cao; Guojin Zhang; Haihua Bao; Shenghui Zhang; Jing Zhang; Zhiyong Zhao; Wenjuan Zhang; Weixia Li; Xiaohong Yan; Junlin Zhou
Journal: Clin Imaging Date: 2020-09-06 Impact factor: 1.605

8. The Application of a New Model-Based Iterative Reconstruction in Low-Dose Upper Abdominal CT.

Authors: Yongjun Jia; Bingying Zhai; Taiping He; Yong Yu; Nan Yu; Haifeng Duan; Chuangbo Yang; Xirong Zhang
Journal: Acad Radiol Date: 2019-01-17 Impact factor: 3.173

9. A radiomics model for preoperative prediction of brain invasion in meningioma non-invasively based on MRI: A multicentre study.

Authors: Jing Zhang; Kuan Yao; Panpan Liu; Zhenyu Liu; Tao Han; Zhiyong Zhao; Yuntai Cao; Guojin Zhang; Junting Zhang; Jie Tian; Junlin Zhou
Journal: EBioMedicine Date: 2020-07-30 Impact factor: 8.143

10. Radiomics Analysis of Iodine-Based Material Decomposition Images With Dual-Energy Computed Tomography Imaging for Preoperatively Predicting Microsatellite Instability Status in Colorectal Cancer.

Authors: Jingjun Wu; Qinhe Zhang; Ying Zhao; Yijun Liu; Anliang Chen; Xin Li; Tingfan Wu; Jianying Li; Yan Guo; Ailian Liu
Journal: Front Oncol Date: 2019-11-22 Impact factor: 6.244

1 in total

Review 1. Multi-Omic Approaches in Colorectal Cancer beyond Genomic Data.

Authors: Emilia Sardo; Stefania Napolitano; Carminia Maria Della Corte; Davide Ciardiello; Antonio Raucci; Gianluca Arrichiello; Teresa Troiani; Fortunato Ciardiello; Erika Martinelli; Giulia Martini
Journal: J Pers Med Date: 2022-01-18

1 in total