Literature DB >> 31182111

Novel nomograms to predict lymph node metastasis and liver metastasis in patients with early colon carcinoma.

Yongcong Yan1,2,3, Haohan Liu1,2,3, Kai Mao2, Mengyu Zhang4, Qianlei Zhou1,2,3, Wei Yu1,2,3, Bingchao Shi1,2,3, Jie Wang5, Zhiyu Xiao6.   

Abstract

BACKGROUND: Lymph node status and liver metastasis (LIM) are important in determining the prognosis of early colon carcinoma. We attempted to develop and validate nomograms to predict lymph node metastasis (LNM) and LIM in patients with early colon carcinoma.
METHODS: A total of 32,819 patients who underwent surgery for pT1 or pT2 colon carcinoma were enrolled in the study based on their records in the SEER database. Risk factors for LNM and LIM were assessed based on univariate and multivariate binary logistic regression. The C-index and calibration plots were used to evaluate LNM and LIM model discrimination. The predictive accuracy and clinical values of the nomograms were measured by decision curve analysis. The predictive nomograms were further validated in the internal testing set.
RESULTS: The LNM nomogram, consisting of seven features, achieved the same favorable prediction efficacy as the five-feature LIM nomogram. The calibration curves showed perfect agreement between nomogram predictions and actual observations. The decision curves indicated the clinical usefulness of the prediction nomograms. Receiver operating characteristic curves indicated good discrimination in the training set (area under the curve [AUC] = 0.667, 95% CI 0.661-0.673) and the testing set (AUC = 0.658, 95% CI 0.649-0.667) for the LNM nomogram and encouraging performance in the training set (AUC = 0.766, 95% CI 0.760-0.771) and the testing set (AUC = 0.825, 95% CI 0.818-0.832) for the LIM nomogram.
CONCLUSION: Novel validated nomograms for patients with early colon carcinoma can effectively predict the individualized risk of LNM and LIM, and this predictive power may help doctors formulate suitable individual treatments.

Entities:  

Keywords:  Colon carcinoma; Decision curve analysis; Liver metastasis; Lymph node metastasis; Nomogram; Surveillance, epidemiology, and end results

Mesh:

Year:  2019        PMID: 31182111      PMCID: PMC6558904          DOI: 10.1186/s12967-019-1940-1

Source DB:  PubMed          Journal:  J Transl Med        ISSN: 1479-5876            Impact factor:   5.531


Background

Colorectal cancer (CRC) is estimated to be the third leading cancer type among new cancer cases and deaths in the United States [1]. In 2018, among the two sexes combined, an estimated 97,220 new cases of colon carcinoma (5.6% of all cancer cases) [2] and an estimated 50,630 (8.3%) deaths from that cause occurred [1]. The poor prognosis and frequent recurrence of colon carcinoma might be related to lymph node metastasis (LNM) and distant metastasis [3]. According to the 7th American Joint Committee on Cancer (AJCC) cancer staging system [4], advanced colon carcinoma (stage III or IV) is diagnosed when LNM or distant metastasis occurs, regardless of the pathologic T (pT) classification. Studies have indicated that 27.3% of patients diagnosed with colon carcinoma develop liver metastasis during the course of their disease, and the proportions of patients with synchronous and metachronous liver metastasis (LIM) were 14.5% and 12.8% [5], respectively. In addition, we found that some advanced colon carcinoma patients remained at pT1 or pT2 due to the migration and invasion capabilities of early colon carcinoma. When colon carcinoma is detected in a localized stage, the 5-year relative survival is 91.1%. However, the 5-year relative survival of colon carcinoma patients with regional metastasis or distant metastasis were 71.7% and 13.3%, respectively [6]. Therefore, early detection of colon carcinoma metastasis is important for modifying therapeutic strategies and improving patient prognosis. Most studies of colon cancer metastasis have used lymph nodes to predict the prognosis and recurrence of colon carcinoma [7-11]; research on LIM is much less common. Additionally, there have been few reports or methods to predict LNM and LIM of colon carcinoma. Because the clinicopathological risk factors of LNM and LIM in patients with early colon carcinoma are poorly understood, we attempted to predict the risk factors based on a statistical predictive model. Nomograms are reliable graphical calculating models that are used to accurately calculate and predict individual risk events by combining all risk factors for tumor development [12, 13]. An increasing number of nomograms are being widely established to provide assistance in formulating individual treatment and follow-up management strategies in several cancers, such as oropharyngeal cancer [14], gastrointestinal stromal tumors [15], adenoid cystic carcinoma [16], bladder cancer [17], and prostate cancer [18]. To the best of our knowledge, no nomograms have been carried out to predict LNM and LIM using data gathered from patients with early colon carcinoma in the Surveillance, Epidemiology, and End Results (SEER) database. Here, we performed nomograms to predict LNM and LIM of early colon carcinoma by combining all relevant risk factors. In addition, decision curve analysis (DCA) and an assessment of clinical impact were conducted to illustrate the clinical utility of the model. This study aims to evaluate patients with early colon carcinoma using nomograms, discover patients with high risk scores and help to modify therapeutic strategies in clinical application.

Materials and methods

Patients and study design

The records of patients who underwent surgery for pT1 or pT2 colon carcinoma from 2004 to 2015 were retrieved from the SEER 18 registry database using SEER*Stat 8.3.5 software. The flow chart used for data selection is shown in Fig. 1. “The International Classification of Diseases for Oncology (ICD-O-3) Hist/behav, malignant” was used to screen colon carcinoma cases. “Year of diagnosis” ranged from 2004 to 2015. “Derived AJCC Stage Group 7th (2010+)”, “RX Summ-Surg Prim Site (1998+)”, and “Grading and differentiation codes in ICD-O-2” were used in the present study. The codes in Collaborative Stage (CS) (2004+), including tumor size, extension, lymph nodes and metastases, were also collected. The inclusion criteria were as follows: diagnostic confirmation was achieved based on microscopic analysis, and patient background characteristics (age, gender, race and marital status), tumor-related factors [i.e., tumor size and invasion, tumor numbers, histological grade, carcinoembryonic antigen (CEA), LIM, lung metastasis] and survival information were known and available. The exclusion criteria were as follows: death certificate or autopsy only and age < 18 years old. A total of 32,819 cases in the SEER cohort were included and analyzed. We further randomly divided the patients in a 2-to-1 ratio, forming a training set (n = 21,880) for nomogram construction and a validation set (n = 10,939) for internal verification. The data obtained in this study were rooted mainly in the public SEER database, which is available as open-access data. The ethics committee board of Sun Yat-sen Memorial Hospital, Sun Yat-sen University, approved the use of patients with early colon carcinoma for this study.
Fig. 1

Study flowchart

Study flowchart

Construction and validation of nomograms

Univariable and multivariable analysis were used to identify independent risk factors predictive of LNM and LIM in early colon carcinoma in the SEER discovery set. All variables were screened using the forward stepwise selection method in a multivariate binary logistic regression model [19, 20]. The SEER internal testing set was used to evaluate the predictive reliability and accuracy of the nomograms developed to predict LNM and LIM. For internal validation of the nomogram, we applied a bootstrapping method with 1000 resamples. The predictive performance of the nomograms was measured by a receiver operating characteristic (ROC) curve. Calibration curves were plotted to validate the accuracy and reliability of the nomograms by the Hosmer–Lemeshow test [21].

Clinical utility

DCA was performed to determine the clinical application value of the nomogram models by calculating the net benefits at each risk threshold probability [22, 23]. The net benefit (NB) was determined by subtracting the proportion of all false-positive patients from the proportion of true positives and weighted by the relative harm caused by forgoing treatment compared with the negative consequences of unnecessary treatment, the NB to the population of using the risk model together with highrisk threshold R is: NB = TPR*P−*(1−R)*FPR*(1−P) (TPR: true-positive rate; FPR: false-positive rate; P: prevalence of the outcome; R: proportion of cases with risk above risk threshold) [24]. Additionally, on the basis of the DCAs, we plotted curves to evaluate the clinical impact of the nomogram to help us more intuitively understand its significant value. These curves display the number of high-risk patients, along with the number of high-risk patients with outcomes of metastasis, at different threshold probabilities in a given population [25].

Statistical analysis

All statistical analyses were performed using the software IBM SPSS Statistics (version 24, SPSS Inc., Chicago, IL, USA) and the programming language R (version 3.3.4, http://www.R-project.org) for Windows. Patient clinical characteristics are summarized as the mean (s.d.) for continuous measures. The Chi squared test and Student’s t-test were used to compare categorical variables and continuous variables. The ROC curve, nomogram, calibration plots, DCA and clinical impact curves were calculated in R 3.3.4 with relevant packages, such as the survival ROC, rms, calibrate and decision curve packages. The cutoff values of the risk scores from the predictive nomograms of LNM and LIM were determined based on the maximum Youden index of the ROC curve in the training set, and the patients were divided into low- and high-risk groups. All statistical tests were two-sided, and a P value < 0.05 was considered statistically significant.

Results

Clinical characteristics of patients

The demographic and clinical characteristics of colon carcinoma patients in both cohorts are summarized in Table 1, and there were no significant differences between the two sets (P>0.05, Table 1). LNM was present in 3111 of 21,880 patients (14.2%) and 30 of 10,939 patients (14.5%) in the training and testing sets, respectively. LIM occurred in 1.5% of patients in the training set and 1.2% of patients in the testing set. There was no statistically significant difference in LNM rate (P = 0.277) or LIM rate (P = 0.06) between the two sets. In the correlation analysis, five variables, namely, histological grade, T classification, tumor size, serum CEA level and overall survival, were significantly correlated (P < 0.001) with LNM (Table 2) and LIM (Table 3) in both the training and testing sets.
Table 1

Demographic and clinical characteristics of colon carcinoma patients

Clinicopathological variablesSEER cohort (n = 32,819)P value
Entire cohortTraining n = 21,880Validation n = 10,939
Age67.08 (13.40)67.02 (13.38)67.19 (13.42)0.826
Gender
 Female16,47910,96755120.659
 Male16,34010,9135427
Marital status
 Married18,09312,04660470.922
 Single12,66884624206
 Unknown20581372686
Race
 American Indian/Alaska Native223154690.735
 Asian or Pacific Islander24671650817
 Black395626581298
 White26,17317,4188755
Histological grade
 Well differentiated6214415820560.172
 Moderately differentiated23,52915,6347895
 Poorly differentiated26151761854
 Undifferentiated461327134
Histological type
 Adenocarcinoma28,35618,91794390.620
 Carcinoid tumor986644342
 Neuroendocrine carcinoma29119497
 Mucinous adenocarcinoma15851075510
 Other16011050551
TNM
 I27,70818,48292260.408
 II915833
 III439829061492
 IV619431188
T classification
 T117,01711,34456730.990
 T215,80210,5365266
N classification
 N028,11418,76993450.277
 N1397126231348
 N2734488246
M classification
 M032,20021,44910,7510.512
 M1619431188
Tumor size
 < 5 cm24,48816,29481940.536
 ≥ 5 cm356524111154
 Unknown476631751591
Liver metastasis
 Negative32,36421,55710,8070.06
 Positive455323132
Lung metastasis
 Negative32,71321,80110,9120.09
 Positive916724
 Unknown14122
Bone metastasis
 Negative32,79321,86110,9320.227
 Positive14122
 Unknown1174
Brain metastasis
 Negative32,79821,86710,9310.440
 Positive541
 Unknown1596
CEA
 Negative12,156811140450.943
 Borderline805426
 Positive338522701115
 Unknown17,19811,4455753
Tumor number
 122,78915,21475750.898
 2749549892506
 319141262652
 > 3621415206
Overall survival
 Alive28,20618,79794090.812
 Dead461330831530

CEA, carcinoembryonic antigen

Table 2

Correlations between clinicopathological characteristics of patients and lymph node metastasis in the training and validation sets

Clinicopathological variablesTraining setValidation set
NegativePositiveP valueNegativePositiveP value
Age67.36 (13.31)64.97 (13.58) 0.023 67.52 (13.36)65.18 (13.59)0.357
Gender
 Female942115460.61947048080.815
 Male934815654641786
Marital status
 Married10,2601786 0.002 5141906 0.017
 Single729711653593613
 Unknown121216061175
Race
 American Indian/Alaska Native132225811 0.001
 Asian or Pacific Islander1363287675142
 Black21904681074224
 White15,084233475381217
Histologic grade
 Well differentiated3790368 < 0.001 1885171 < 0.001
 Moderately differentiated13,445218967471148
 Poorly differentiated1292469620234
 Undifferentiated242859341
Histologic type
 Adenocarcinoma16,2762641 < 0.001 80601379 0.005
 Carcinoid tumor5657931230
 Neuroendocrine carcinoma144508116
 Mucinous adenocarcinoma89617941892
 Other88816247477
T classification
 T110,3131031 < 0.001 5125548< 0.001
 T28456208042201046
Tumor size
 <5 cm13,9492345< 0.00169841210< 0.001
 ≥ 5 cm1927484919235
 Unknown28912821442149
CEA
 Negative68091302< 0.0013405640< 0.001
 Borderline4311224
 Positive1770500861254
 Unknown10,14712985057696
Tumor number
 112,99622180.12564231152 0.035
 243156742170336
 3110016257280
 > 33585718026
Overall survival
 Alive16,2582539< 0.00180871322< 0.001
 Dead25115721258272

Italic values: statistical differences are significant. CEA, carcinoembryonic antigen

Table 3

Correlations between clinicopathological characteristics of patients and liver metastasis in the training and validation sets

Clinicopathological variablesTraining setValidation set
NegativePositiveP valueNegativePositiveP value
Age67.07 (13.38)63.70 (12.88)0.49467.24 (13.42)62.78 (13.15)0.481
Gender
 Female10,8211460.084546448 0.001
 Male10,736177534384
Marriage
 Married11,8811650.3115970770.461
 Single8324138415650
 Unknown1352206815
Race
 American Indian/Alaska Native1522 0.006 6720.063
 Asian or Pacific Islander16341680314
 Black260058127622
 White17,171247866194
Histological grade
 Well differentiated412434< 0.001204313< 0.001
 Moderately differentiated15,3942407795100
 Poorly differentiated17204184113
 Undifferentiated31981286
Histological type
 Adenocarcinoma18,6322850.06793271120.154
 Carcinoid tumor64133411
 Neuroendocrine carcinoma1904943
 Mucinous adenocarcinoma1054215019
 Other1040105447
T classification
 T111,222122< 0.001563142< 0.001
 T210,335201517690
Tumor size
 <5 cm16,310164< 0.001812074< 0.001
 ≥ 5 cm231299111836
 Unknown311560156922
CEA
 Negative805754< 0.001401926< 0.001
 Borderline531251
 Positive2097173105758
 Unknown11,35095570647
Tumor number
 115,0012130.4417482930.687
 2490485247432
 31242206466
 > 341052051
Overall survival
 Alive18,646151< 0.001934267< 0.001
 Dead2911172146565

Italic values: differences are statistically significant

CEA, carcinoembryonic antigen

Demographic and clinical characteristics of colon carcinoma patients CEA, carcinoembryonic antigen Correlations between clinicopathological characteristics of patients and lymph node metastasis in the training and validation sets Italic values: statistical differences are significant. CEA, carcinoembryonic antigen Correlations between clinicopathological characteristics of patients and liver metastasis in the training and validation sets Italic values: differences are statistically significant CEA, carcinoembryonic antigen

Independent significant factors in the training set

To further identify candidate predictors of LNM and LIM, we evaluated all clinicopathological features by binary logistic regression analysis. Risk factors for LNM and LIM were initially identified by univariate logistic regression analysis in the training set (Table 4). Marital status, histological grade, histological type, T classification, tumor size and CEA were associated with LNM. Additionally, there were eight clinicopathological variables related to LIM, namely, age, race, histological grade, histological type, T classification, tumor size, CEA and N classification. A multivariate regression analysis was performed on all factors to verify the risk factors of LNM and LIM (Table 5). Eight variables were actually associated with LNM: age (45–65: odds ratio (OR) 0.83, 95% CI 0.692 to 0.996, P = 0.045; ≥ 65: 0.525, 0.438 to 0.63, P < 0.001), marital status (Single: 0.898, 0.826 to 0.976, P = 0.012; Unknown: 0.806, 0.675 to 0.962, P = 0.017), race (White: 0.732, 0.637 to 0.842, P < 0.001), histological grade (Moderately differentiated: 1.644, 1.442 to 1.875, P < 0.001; Poorly differentiated: 3.641, 3.088 to 4.292, P < 0.001; Undifferentiated: 3.462, 2.609 to 4.593, P < 0.001), histological type (Carcinoid tumor: 1.752, 1.328 to 2.311, P < 0.001; Neuroendocrine carcinoma: 3.74, 2.613 to 5.534, P < 0.001), T classification (T2: 2.221, 2.03 to 2.431, P < 0.001), tumor size (≥ 5 cm: 1.125, 1.003 to 1.262, P = 0.045; Unknown: 0.84, 0.731 to 0.967, P = 0.015) and CEA (Positive: 1.385, 1.228 to 1.561, P < 0.001; Unknown: 0.74, 0.678 to 0.808, P < 0.001). Similarly, LIM was related to five variables: age (≥ 65: 0.532, 0.332 to 0.851, P = 0.008), histologic grade (Moderately differentiated: 1.501, 1.032 to 2.184, P = 0.034; Poorly differentiated: 1.670, 1.028 to 2.714, P = 0.038), tumor size (≥ 5 cm: 2.886, 2.203 to 3.783, P < 0.001; Unknown: 2.463, 1.8 to 3.37, P < 0.001), CEA (positive: 10.436, 7.595 to 14.335, P < 0.001) and N classification (N1: 3.909, 2.999 to 5.095, P < 0.001; N2: 12.131, 8.670 to 16.975, P < 0.001).
Table 4

Risk factors for lymph node metastasis and liver metastasis identified by univariate logistic regression analysis

Clinicopathological variablesLymph node metastasisLiver metastasis
OR95% CIP valueOR95% CIP value
Age
 <4511
 45–650.860.729–1.0160.760.7740.5–1.20.253
 ≥ 650.6090.517–0.7170.5280.342–0.814 0.004
Gender
 Female11
 Male1.020.946–1.1010.6061.2220.98–1.5240.075
Marital status
 Married11
 Single0.9170.847–0.993 0.033 1.1940.951–1.4990.128
 Unknown0.7580.638–0.901 0.002 1.0650.667–1.70.791
Race
 Asian or Pacific Islander11
 American Indian/Alaska Native0.7920.495–1.2650.3281.3440.306–5.8990.695
 White0.7350.642–0.8411.4690.884–2.4420.138
 Black1.0150.863–1.1930.8582.2781.305–3.976 0.004
Histological grade
 Well differentiated11
 Moderately differentiated1.6671.493–1.883< 0.0011.8911.318–2.713< 0.001
 Poorly differentiated3.7393.217–4.345< 0.0012.8911.829–4.571< 0.001
 Undifferentiated3.6172.763–4.735< 0.0013.0431.396–6.626 0.005
Histological type
 Adenocarcinoma11
 Carcinoid tumor0.8620.679–1.0940.2220.3060.098–0.957 0.042
 Neuroendocrine carcinoma2.141.547–2.96< 0.0011.3760.508–3.3710.53
 Mucinous adenocarcinoma1.2311.043–1.453 0.014 1.3030.833–2.3080.247
 Other1.1240.946–1.3360.1830.6290.334–1.1850.151
T classification
 T111
 T22.4612.271–2.665< 0.0011.7891.426–2.244< 0.001
Tumor size
 <5 cm11
 ≥ 5 cm1.4941.34–1.666< 0.0014.2123.269–5.425< 0.001
 Unknown0.580.509–0.66< 0.0011.8941.406–2.553< 0.001
CEA
 Negative11
 Borderline1.3380.688–2.6010.3912.8150.382–20.7260.31
 Positive1.4771.316–1.658< 0.00112.3099.035–16.77< 0.001
 Unknown0.6690.616–0.7271.2490.893–1.7460.194
Tumor number
 111
 20.9150.834–1.0040.0611.2220.947–1.5730.123
 30.8630.727–1.0240.0911.1340.714–1.80.593
 >30.9330.703–1.2380.6310.8590.352–2.0960.738
N classification
 N011
 N1NANANA4.6873.64–6.036< 0.001
 N2NANANA17.3512.761–23.59< 0.001

Italic values: differences are statistically significant. OR: odds ratio; 95% CI, 95% confidence interval; CEA, carcinoembryonic antigen; NA, not available

Table 5

Risk factors for lymph node metastasis and liver metastasis identified by multivariate logistic regression analysis

Clinicopathological variablesLymph node metastasisLiver metastasis
OR95% CIP valueOR95% CIP value
Age
 <4511
 45–650.830.692–0.996 0.045 0.7510.468–1.2060.236
 ≥ 650.5250.438–0.63< 0.0010.5320.332–0.851 0.008
Marriage
 Married1
 Single0.8980.826–0.976 0.012
 Unknown0.8060.675–0.962 0.017
Race
 Asian or Pacific Islander1
 American Indian/Alaska Native0.7590.469–1.2270.261
 White0.7320.637–0.842< 0.001
 Black1.0220.863–1.210.799
Histological grade
 Well differentiated11
 Moderately differentiated1.6441.442–1.875< 0.0011.5011.032–2.184 0.034
 Poorly differentiated3.6413.088–4.292< 0.0011.6701.028–2.714 0.038
 Undifferentiated3.4622.609–4.593< 0.0011.9390.847–4.4370.117
Histological type
 Adenocarcinoma1
 Carcinoid tumor1.7521.328–2.311< 0.001
 Neuroendocrine carcinoma3.742.613–5.534< 0.001
 Mucinous adenocarcinoma1.0460.881–1.2410.607
 Other1.1180.933–1.3390.226
T classification
 T11
 T22.2212.03–2.431< 0.001
Tumor size
 <5 cm11
 ≥ 5 cm1.1251.003–1.262 0.045 2.8862.203–3.783< 0.001
 Unknown0.840.731–0.967 0.015 2.4631.8–3.37< 0.001
CEA
 Negative11
 Borderline1.4680.743–2.90.2692.7630.367–20.8150.324
 Positive1.3851.228–1.561< 0.00110.4367.595–14.335< 0.001
 Unknown0.740.678–0.808< 0.0011.3950.994–1.9580.055
N classification
 N011
 N1NANANA3.9092.999–5.095< 0.001
 N2NANANA12.1318.670–16.975< 0.001

OR, odds ratio; 95% CI, 95% confidence interval; NA, not available

Italic values: differences are statistically significant

Risk factors for lymph node metastasis and liver metastasis identified by univariate logistic regression analysis Italic values: differences are statistically significant. OR: odds ratio; 95% CI, 95% confidence interval; CEA, carcinoembryonic antigen; NA, not available Risk factors for lymph node metastasis and liver metastasis identified by multivariate logistic regression analysis OR, odds ratio; 95% CI, 95% confidence interval; NA, not available Italic values: differences are statistically significant

Development of nomograms for LNM and LIM prediction

Based on the independent risk factors identified in the multivariate regression analysis, two nomograms were developed to predict the possibility of LNM (Fig. 2a) and LIM (Fig. 2b) in patients with early colon carcinoma. Furthermore, point assignments and predictive scores for each variable in the nomogram models were calculated in Table 6. According to the LNM nomogram, histological grade made the largest contribution, followed by T stage, age, marital status, serum CEA level and histological type. N classification made the largest contribution in the LIM nomogram, followed by histological grade, tumor size, serum CEA level and age. The calibration curves for predicting LNM and LIM in the training set (Fig. 2c, e) showed good agreement between predictions and observations.
Fig. 2

Nomogram and calibration curves for predicting lymph node metastasis and liver metastasis in patients with early colon carcinoma. There are seven factors in the lymph node metastasis prediction nomogram (a) and five factors in the liver metastasis prediction nomogram (b). Calibration curves for predicting lymph node metastasis and liver metastasis in the training set (c, e) and in the testing set (d, f) are shown. All the points assigned on the top point scale for each factor are summed together to generate a total point score. The total point score is projected on the bottom scales to determine the probability of cancer metastasis in an individual. The nomogram-predicted frequency of metastasis is plotted on the x-axis, and the actual observed frequency of metastasis is plotted on the y-axis

Table 6

Point assignments and predictive scores for each variable in the nomogram models

VariablesClassificationNomogram score
Lymph node metastasisLiver metastasis
Age< 455816
45–65298
≥ 6500
MarriageMarried11NA
Single6NA
Unknown0NA
Histological gradeWell differentiated00
Moderately differentiated338
Poorly differentiated6716
Undifferentiated10023
Histological typeAdenocarcinoma0NA
Carcinoid tumor3NA
Neuroendocrine carcinoma5NA
Mucinous adenocarcinoma8NA
Other10NA
T classificationT10NA
T254NA
Tumor size< 5 cm60
≥ 5 cm317
Unknown035
CEANegative140
Borderline95
Positive510
Unknown015
N classificationN0NA0
N1NA50
N2NA100

CEA, carcinoembryonic antigen; NA, not available

Nomogram and calibration curves for predicting lymph node metastasis and liver metastasis in patients with early colon carcinoma. There are seven factors in the lymph node metastasis prediction nomogram (a) and five factors in the liver metastasis prediction nomogram (b). Calibration curves for predicting lymph node metastasis and liver metastasis in the training set (c, e) and in the testing set (d, f) are shown. All the points assigned on the top point scale for each factor are summed together to generate a total point score. The total point score is projected on the bottom scales to determine the probability of cancer metastasis in an individual. The nomogram-predicted frequency of metastasis is plotted on the x-axis, and the actual observed frequency of metastasis is plotted on the y-axis Point assignments and predictive scores for each variable in the nomogram models CEA, carcinoembryonic antigen; NA, not available

Performance and validation of nomograms for LNM and LIM prediction

The calibration curves for predicting LNM and LIM demonstrated that the nomograms were generally well calibrated in the testing set (Fig. 2d, f). To compare the predictive values for LNM and LIM of the nomogram models and clinicopathological risk factors, we applied ROC analysis. In the ROC curves of LNM in the training set (Fig. 3a) and the testing set (Fig. 3b), the area-under-the-curve (AUC) values of the nomograms were 0.667 (95% CI 0.661–0.673) and 0.658 (95% CI 0.649–0.667), respectively; these values were significantly larger than the AUCs of grade, tumor size and histological type in both sets (P < 0.0001). Similarly, the AUCs of nomograms of LIM in the training set (Fig. 3c) and the testing set (Fig. 3d), with values of 0.766 (95% CI, 0.760–0.771) and 0.825 (95% CI, 0.818–0.832), respectively, were higher than those for histological grade, histological type, tumor size and N classification. Moreover, we generated bar charts to evaluate the discriminatory power of the nomograms in LNM and LIM after calculating the risk scores from the nomograms. Using the maximum Youden index in the training set, we obtained cutoff values of 79 and 33 for the LNM and LIM nomograms, respectively. All patients were divided into low- and high-risk groups. Patients with predicted high-risk LNM actually had a higher proportion of N1 and N2 classification than the low-risk group in the training set (Fig. 4a). The proportion of N1 and N2 classification in the testing set was near the proportions in the training set (Fig. 4b). Similarly, the high-risk group had a greater possibility of LIM than the low-risk group in both the training and testing sets (Fig. 4c, d).
Fig. 3

Receiver operating characteristic (ROC) curve analysis for lymph node metastasis and liver metastasis. Comparisons of the predictive values of the nomogram models and clinicopathological risk factors for lymph node metastasis and liver metastasis according to ROC analysis. ROC curves of lymph node metastasis in the training set (a) and the testing set (b); ROC curves of liver metastasis in the training set (c) and the testing set (d). The AUC was calculated, and its 95% CI was estimated by bootstrapping. The P values were two-sided. Abbreviations: LN, lymph nodes; ROC, receiver operating characteristic; 95% CI, 95% confidence interval

Fig. 4

Discriminatory power of the nomograms for lymph node metastasis and liver metastasis, illustrated with bar charts. Risk classification for the predictive nomograms was conducted by the maximum Youden index of the ROC curve, and their performance in distinguishing lymph node metastasis and liver metastasis in the training set (a, c) and the testing set (b, d) were plotted

Receiver operating characteristic (ROC) curve analysis for lymph node metastasis and liver metastasis. Comparisons of the predictive values of the nomogram models and clinicopathological risk factors for lymph node metastasis and liver metastasis according to ROC analysis. ROC curves of lymph node metastasis in the training set (a) and the testing set (b); ROC curves of liver metastasis in the training set (c) and the testing set (d). The AUC was calculated, and its 95% CI was estimated by bootstrapping. The P values were two-sided. Abbreviations: LN, lymph nodes; ROC, receiver operating characteristic; 95% CI, 95% confidence interval Discriminatory power of the nomograms for lymph node metastasis and liver metastasis, illustrated with bar charts. Risk classification for the predictive nomograms was conducted by the maximum Youden index of the ROC curve, and their performance in distinguishing lymph node metastasis and liver metastasis in the training set (a, c) and the testing set (b, d) were plotted Kaplan–Meier survival curves of overall survival for patients according to LNM (Fig. 5a) and LIM (Fig. 5b) in the entire SEER cohort verified that patients who were predicted to have LNM or LIM had a significant disadvantage in overall survival (P < 0.0001). DCAs were performed on the nomograms for predicting LNM (Fig. 5c) and LIM (Fig. 5d) in the training set. Threshold probabilities of 0–0.3 for LNM or 0–0.2 for LIM were the most beneficial for predicting LNM and LIM with our nomograms. Based on these DCAs of LNM, we further plotted curves to evaluate the clinical impact of the nomograms to help us more intuitively understand their substantial value. Clinical impact curves of the LNM nomogram in the training set (Fig. 5e) and testing set (Fig. 5f) showed that the model had remarkable predictive power: the predicted number of high-risk patients was always greater than the number of high-risk patients with outcomes of metastasis when the risk threshold was in the range of 0–0.3, and the cost–benefit ratios would be acceptable in the same range.
Fig. 5

Kaplan–Meier survival curves, decision curve analyses, and clinical impact curves of overall survival for patients. Kaplan–Meier survival curves representing the overall survival of patients with lymph node metastasis (a) and liver metastasis (b) in the entire SEER cohort. The decision curves of the nomograms for predicting lymph node metastasis (c) and liver metastasis (d) in the training set were plotted. Clinical impact curves of the nomogram to predict lymph node metastasis in the training set (e) and the testing set (f) are shown. The y-axis represents the net benefit. The x-axis shows the threshold probability. The horizontal solid black line represents the hypothesis that no patients experienced lymph node metastasis or liver metastasis, and the solid gray line represents the hypothesis that all patients met the endpoint (c, d). At different threshold probabilities within a given population, the number of high-risk patients and the number of high-risk patients with the outcome were plotted (e, f)

Kaplan–Meier survival curves, decision curve analyses, and clinical impact curves of overall survival for patients. Kaplan–Meier survival curves representing the overall survival of patients with lymph node metastasis (a) and liver metastasis (b) in the entire SEER cohort. The decision curves of the nomograms for predicting lymph node metastasis (c) and liver metastasis (d) in the training set were plotted. Clinical impact curves of the nomogram to predict lymph node metastasis in the training set (e) and the testing set (f) are shown. The y-axis represents the net benefit. The x-axis shows the threshold probability. The horizontal solid black line represents the hypothesis that no patients experienced lymph node metastasis or liver metastasis, and the solid gray line represents the hypothesis that all patients met the endpoint (c, d). At different threshold probabilities within a given population, the number of high-risk patients and the number of high-risk patients with the outcome were plotted (e, f)

Discussion

Colon carcinoma ranks fourth in terms of incidence but fifth in terms of mortality worldwide in 2018. In 2018, among both genders combined, the incidence of colon carcinoma is approximately 1,096,601 new cases, and the mortality is approximately 551,269 [26]. Death from colon carcinoma typically occurs due to distant metastasis, while lymph node metastases are thought to occur before distant metastasis [3]. A study has reported that an increased number of lymph nodes evaluated is associated with increased survival. Therefore, lymph node evaluation is important for the prognosis and treatment of patients with colon cancer and may be a measure of quality care [9]. For distant metastasis, a population-based cancer registry in Burgundy reported that 27.3% of patients diagnosed with colon carcinoma develop LIM during the course of their disease, and the 5-year cumulative metachronous LIM rate was 14.5% in general, 3.7% for TNM stage I tumors, and 13.3% for stage II [5]. Metachronous LIM also contributed greatly to the poor prognosis and recurrence of colon carcinoma. When metastasis occurs, surgical treatments such as en bloc resections of the affected segments of the bowel and the associated draining lymph nodes [27], as well as adjuvant therapies, should be applied [28]. Partial or total colectomy is performed in the majority of patients with stage I and II colon cancer (84%), while 67% and 40% of patients with stage III and stage IV, respectively, receive chemotherapy in addition to colectomy to lower their risk of recurrence [29]. Several studies have examined the number [9], distribution and size of affected lymph nodes [8] or the ratio of metastatic to examined lymph nodes [7] to evaluate colon cancer survival. Some researchers have focused on mRNA expression of genes related to lymph nodes, such as guanylyl cyclase C (GCC) [11] and metastasis associated in colon cancer 1 (MACC1) [10], to evaluate colon cancer prognosis. It is unknown whether LIM is derived from cancer cells that first colonize intestinal lymph nodes or whether such metastases can form without prior lymph node involvement in colorectal cancer. Enquist et al. found direct hematogenous spread as a dissemination route contributing to CRC liver metastasis in CRC mouse models [30]. Therefore, the correlations between LNM, LIM and tumor recurrence should not be ignored, and in order to modify therapeutic strategies and improve patient prognosis, it is essential to estimate the risks of LNM and LIM in early colon carcinoma. c-MET, a proto-oncogene that initiates a range of signals to regulate various cellular functions, has been suggested to be associated with CRC progression [31] . Hiroya Takeuchi and coworkers reported that c-MET copy numbers in primary CRC of N1/N2-stage patients were significantly higher than the copy numbers in N0 cases (P < 0.03) and that overexpression of c-MET mRNA in primary CRC may be a predictor of tumor invasion and lymph node metastases [32]. Zuo et al. found that serum soluble lectin, which was increased in colon cancer patients with LIM compared to those without metastases, might be a promising new target for intervention in metastasis formation [33]. However, fundamental studies are not a direct way to predict metastasis in daily clinical practice and would be costly even if they could be employed in the clinic. As a result, we focused on clinical studies based on clinicopathological risk factors. Some researchers have estimated the risk of metastasis using clinicopathological variables and nomograms. A study of 160 patients with early colorectal cancer assessed CT and MRI data to establish imaging criteria for LNM and concluded that a short-diameter size criterion of ≥ 4.1 mm for metastatic lymph nodes showed sensitivity of 78.6% and specificity of 75% [34]. In addition, Yan-qi Huang et al. developed and validated a radiomics-based nomogram incorporating the radiomics signature, CT-imaged lymph node status, and clinical risk factors to facilitate the preoperative individualized prediction of LNM in patients with colorectal cancer [24]. Martin R. Weiser and colleagues developed a colon cancer recurrence nomogram to predict relapse based on the number of positive and negative lymph nodes, lymphovascular invasion and other risk factors [35]. Because nomograms are commonly used tools for prognosis in oncology and medicine [22] and straight scales are useful for relatively simple calculations, we decided to build a nomogram for LNM and LIM prediction in early colon carcinoma. The scarcity of studies examining liver metastasis in colon carcinoma supported our decision to develop a nomogram for predicting LIM in early colon carcinoma. Two nomograms were constructed and validated for predicting LNM and LIM in patients with early colon carcinoma. The nomogram for LNM incorporates seven factors, namely, age, marital status, CEA, histological type, T classification, histological grade and tumor size, while the nomogram for LIM includes five factors: age, CEA, tumor size, histological grade and N classification. Both of the nomograms demonstrated good agreement between predictions and observations in the training and testing sets. Furthermore, better diagnostic efficiencies were shown by ROC curves in comparison with histologic grade, histologic type, tumor size and N classification. In particular, the AUCs of the LIM nomograms were calculated with values of 0.766 (0.760–0.771) and 0.825 (0.818–0.832), respectively, in the training set and the testing set. However, the nomograms might not be useful with greater AUCs and good agreement between predictions and observations [13]. Therefore, decision curve analyses were performed in the present study. DCA is a novel method for evaluating diagnostic tests, prediction models and molecular markers. This method can also be easily extended to many of the applications common to performance measures for prediction models [22]. Here, good clinical utility was indicated in the proper range. Moreover, the clinical impact of the LNM nomogram on the basis of DCA, Kaplan–Meier survival curves and bar charts with Chi squared tests was used to improve the discriminatory power of the nomograms. The nomograms for predicting LNM and LIM actually possess good prediction efficiencies as judged by the methods above. In our study, a large number of cases in the SEER dataset were chosen and randomly divided into a training set and an internal testing set. Our purpose was to evaluate the prediction of LNM and LIM in early colon carcinoma from large quantities of patient data, which are convincing and readily available in clinical decision making. For clinical application, it is important to make the assessment of risk factors as convenient as possible. We considered the variables needed in our nomogram to be prevalent in clinical practice and convenient to acquire. The limitations of our study are the lack of external validation for the nomogram and the absence of genetic markers. Because the testing set in this study was derived from the same SEER dataset as the training study, potentially leading to overfitting of the model, external validation at our hospital or another institution should be performed. Multicenter validation with a large sample size is preferable because it yields high-level evidence for clinical application. In addition, our research did not incorporate genetic markers because clinical risk factors are easier to collect. However, a combination of clinical variables and genetic markers may improve the prediction of LNM and LIM in patients with early colon carcinoma.

Conclusions

In conclusion, based on the clinical risk factors identified in a large population-based cohort, we established the first practical nomograms that can objectively and accurately predict individualized risk of LNM and LIM. Moreover, the internal cohort validation results demonstrate that the two nomograms perform well and have high accuracy and reliability. Our nomograms were demonstrated to be clinically useful in DCAs, and they should therefore help clinicians to improve individual treatment, make clinical decisions and guide follow-up management strategies for patients with early colon carcinoma.
  34 in total

Review 1.  Scatter-factor and semaphorin receptors: cell signalling for invasive growth.

Authors:  Livio Trusolino; Paolo M Comoglio
Journal:  Nat Rev Cancer       Date:  2002-04       Impact factor: 60.716

2.  Development and Validation of Nomograms Predictive of Overall and Progression-Free Survival in Patients With Oropharyngeal Cancer.

Authors:  Carole Fakhry; Qiang Zhang; Phuc Felix Nguyen-Tân; David I Rosenthal; Randal S Weber; Louise Lambert; Andy M Trotti; William L Barrett; Wade L Thorstad; Christopher U Jones; Sue S Yom; Stuart J Wong; John A Ridge; Shyam S D Rao; James A Bonner; Eric Vigneault; David Raben; Mahesh R Kudrimoti; Jonathan Harris; Quynh-Thu Le; Maura L Gillison
Journal:  J Clin Oncol       Date:  2017-08-04       Impact factor: 44.544

3.  Cancer statistics, 2018.

Authors:  Rebecca L Siegel; Kimberly D Miller; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2018-01-04       Impact factor: 508.702

Review 4.  Nomograms in oncology: more than meets the eye.

Authors:  Vinod P Balachandran; Mithat Gonen; J Joshua Smith; Ronald P DeMatteo
Journal:  Lancet Oncol       Date:  2015-04       Impact factor: 41.316

5.  Colorectal cancer statistics, 2017.

Authors:  Rebecca L Siegel; Kimberly D Miller; Stacey A Fedewa; Dennis J Ahnen; Reinier G S Meester; Afsaneh Barzi; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2017-03-01       Impact factor: 508.702

6.  Novel roles of liver sinusoidal endothelial cell lectin in colon carcinoma cell adhesion, migration and in-vivo metastasis to the liver.

Authors:  Yunfei Zuo; Shuangyi Ren; Min Wang; Biao Liu; Juntao Yang; Xuezhang Kuai; Changwei Lin; Dianyuan Zhao; Li Tang; Fuchu He
Journal:  Gut       Date:  2012-05-25       Impact factor: 23.059

7.  Colon cancer survival is associated with decreasing ratio of metastatic to examined lymph nodes.

Authors:  Adam C Berger; Elin R Sigurdson; Thomas LeVoyer; Alexandra Hanlon; Robert J Mayer; John S Macdonald; Paul J Catalano; Daniel G Haller
Journal:  J Clin Oncol       Date:  2005-12-01       Impact factor: 44.544

8.  Decision curve analysis: a novel method for evaluating prediction models.

Authors:  Andrew J Vickers; Elena B Elkin
Journal:  Med Decis Making       Date:  2006 Nov-Dec       Impact factor: 2.583

9.  MACC1 mRNA levels predict cancer recurrence after resection of colorectal cancer liver metastases.

Authors:  Claudio Isella; Alfredo Mellano; Francesco Galimi; Consalvo Petti; Lorenzo Capussotti; Michele De Simone; Andrea Bertotti; Enzo Medico; Andrea Muratore
Journal:  Ann Surg       Date:  2013-06       Impact factor: 12.969

10.  Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers.

Authors:  Andrew J Vickers; Angel M Cronin; Elena B Elkin; Mithat Gonen
Journal:  BMC Med Inform Decis Mak       Date:  2008-11-26       Impact factor: 2.796

View more
  14 in total

1.  Development and external validation of prognostic nomograms for liver disease-free and overall survival in locally advanced rectal cancer with neoadjuvant therapy: a post cohort study based on the FOWARC trial.

Authors:  Jiaming Zhou; Tuoyang Li; Yuanlv Xiao; Jinxin Lin; Xiaoqiong Chen; Shaoyong Peng; Mingzhe Huang; Xuebin Shi; Linbin Cai; Pinzhu Huang; Meijin Huang
Journal:  Ann Transl Med       Date:  2022-06

2.  Integrated nomogram based on five stage-related genes and TNM stage to predict 1-year recurrence in hepatocellular carcinoma.

Authors:  Haohan Liu; Yongcong Yan; Ruibing Chen; Mengdi Zhu; Jianhong Lin; Chuanchao He; Bingchao Shi; Kai Wen; Kai Mao; Zhiyu Xiao
Journal:  Cancer Cell Int       Date:  2020-04-29       Impact factor: 5.722

3.  A novel nomogram for predicting liver metastasis in patients with gastrointestinal stromal tumor: a SEER-based study.

Authors:  Guowei Zhou; Keshuai Xiao; Guanwen Gong; Jiabao Wu; Ya Zhang; Xinxin Liu; Zhiwei Jiang; Chaoqun Ma
Journal:  BMC Surg       Date:  2020-11-25       Impact factor: 2.102

4.  BGN and COL11A1 Regulatory Network Analysis in Colorectal Cancer (CRC) Reveals That BGN Influences CRC Cell Biological Functions and Interacts with miR-6828-5p.

Authors:  Danqi Chen; Ying Qin; Mengmeng Dai; Lulu Li; Hongpeng Liu; Yaoyao Zhou; Cheng Qiu; Yan Chen; Yuyang Jiang
Journal:  Cancer Manag Res       Date:  2020-12-22       Impact factor: 3.989

5.  Construction of Nomograms for Predicting Lung and Bone Metastases in Patients with Intrahepatic Cholangiocarcinoma and Identification of Patients Who Can Benefit from Chemotherapy.

Authors:  Ning Shi; Yiping Zou; Yuanpeng Zhang; Hongwei Han; Zhihong Chen; Shiye Ruan; Liang Jin; Zuyi Ma; Zhenrong Chen; Qi Lou; Haosheng Jin
Journal:  J Oncol       Date:  2020-12-02       Impact factor: 4.375

6.  Establishment and validation of a novel predictive model to quantify the risk of bone metastasis in patients with prostate cancer.

Authors:  Yu-Jie Lu; Wei-Ming Duan
Journal:  Transl Androl Urol       Date:  2021-01

7.  A Web-Based Prediction Model for Overall Survival of Elderly Patients With Malignant Bone Tumors: A Population-Based Study.

Authors:  Jie Tang; JinKui Wang; Xiudan Pan
Journal:  Front Public Health       Date:  2022-01-11

8.  A web-based prediction model for overall survival of elderly patients with early renal cell carcinoma: a population-based study.

Authors:  Jinkui Wang; Jie Tang; Tiaoyao Chen; Song Yue; Wanting Fu; Zulong Xie; Xiaozhu Liu
Journal:  J Transl Med       Date:  2022-02-14       Impact factor: 5.531

9.  Risk factors and predictors of lymph nodes metastasis and distant metastasis in newly diagnosed T1 colorectal cancer.

Authors:  Kaibo Guo; Yuqian Feng; Li Yuan; Harpreet S Wasan; Leitao Sun; Minhe Shen; Shanming Ruan
Journal:  Cancer Med       Date:  2020-05-29       Impact factor: 4.452

10.  A novel prognostic model for adult patients with Hemophagocytic Lymphohistiocytosis.

Authors:  Jun Zhou; Jing Zhou; Zhi-Qi Wu; Hemant Goyal; Hua-Guo Xu
Journal:  Orphanet J Rare Dis       Date:  2020-08-20       Impact factor: 4.123

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.