Zhiyong Guo1, Zilin Wang1, Yige Liu1, Jing Han1, Jiannan Liu1, Chenping Zhang1. 1. Department of Oromaxillofacial-Head & Neck Oncology, Shanghai Ninth People's Hospital, College of Stomatology, Shanghai Jiao Tong University School of Medicine, National Clinical Research Center for Oral Diseases, Shanghai Key Laboratory of Stomatology & Shanghai Research Institute of Stomatology, Shanghai, China.
Abstract
BACKGROUND: Major salivary glands carcinoma (MSGC) is a relatively rare cancer with diverse histological types and biological behavior. The treatment planning and prognosis prediction are challenging for clinicians. The aim of the current study was to establish a reliable and effective nomogram to predict the overall survival (OS) and cancer-specific survival (CSS) for MSGC patients. METHODS: Patients pathologically diagnosed with MSGC were recruited from Surveillance, Epidemiology, and End Results (SEER) database and randomly divided into training and validation groups (7:3 ratio). Univariate, multivariate Cox proportional hazard models, and least absolute shrinkage and selection operator (LASSO) regression were adopted for the selection of risk factors. Nomograms were developed using R software. The model performance was evaluated by drawing receiver operating characteristic (ROC), overtime C-index curves, and calibration curves. Harrell C-index, areas under the curves (AUC), and Brier score were also calculated. The decision curve analysis (DCA) was conducted to measure the net clinical benefit. RESULTS: A total of 11,362 patients were identified and divided into training (n=7,953) and validation (n=3,409) dataset. Sex, age, race, marital status, site, differentiation grade, American Joint Committee on Cancer (AJCC) stage, T/N/M stage, tumor size, surgery, and histological type were incorporated into the Cox hazard model for OS prediction after variable selection, while all predictors, except for marital status and site, were selected for CSS prediction. For 5-year prediction, the AUC of the nomogram for OS and CSS was 83.5 and 82.7 in the training and validation dataset, respectively. The C-index was 0.787 for OS and 0.798 for CSS in the validation group. The Brier score was 0.0153 and 0.0130 for OS and CSS, respectively. The calibration curves showed that the nomogram had well prediction accuracy. From the perspective of DCA, a nomogram was superior to the AJCC stage and TNM stage in net benefit. In general, the performance of the nomogram was consistently better compared to the AJCC stage and TNM stage across all settings. CONCLUSIONS: The performance of the novel nomogram for predicting OS and CSS of MSGC patients was further verified, revealing that it could be used as a valuable tool in assisting clinical decision-making. 2021 Annals of Translational Medicine. All rights reserved.
BACKGROUND: Major salivary glands carcinoma (MSGC) is a relatively rare cancer with diverse histological types and biological behavior. The treatment planning and prognosis prediction are challenging for clinicians. The aim of the current study was to establish a reliable and effective nomogram to predict the overall survival (OS) and cancer-specific survival (CSS) for MSGC patients. METHODS: Patients pathologically diagnosed with MSGC were recruited from Surveillance, Epidemiology, and End Results (SEER) database and randomly divided into training and validation groups (7:3 ratio). Univariate, multivariate Cox proportional hazard models, and least absolute shrinkage and selection operator (LASSO) regression were adopted for the selection of risk factors. Nomograms were developed using R software. The model performance was evaluated by drawing receiver operating characteristic (ROC), overtime C-index curves, and calibration curves. Harrell C-index, areas under the curves (AUC), and Brier score were also calculated. The decision curve analysis (DCA) was conducted to measure the net clinical benefit. RESULTS: A total of 11,362 patients were identified and divided into training (n=7,953) and validation (n=3,409) dataset. Sex, age, race, marital status, site, differentiation grade, American Joint Committee on Cancer (AJCC) stage, T/N/M stage, tumor size, surgery, and histological type were incorporated into the Cox hazard model for OS prediction after variable selection, while all predictors, except for marital status and site, were selected for CSS prediction. For 5-year prediction, the AUC of the nomogram for OS and CSS was 83.5 and 82.7 in the training and validation dataset, respectively. The C-index was 0.787 for OS and 0.798 for CSS in the validation group. The Brier score was 0.0153 and 0.0130 for OS and CSS, respectively. The calibration curves showed that the nomogram had well prediction accuracy. From the perspective of DCA, a nomogram was superior to the AJCC stage and TNM stage in net benefit. In general, the performance of the nomogram was consistently better compared to the AJCC stage and TNM stage across all settings. CONCLUSIONS: The performance of the novel nomogram for predicting OS and CSS of MSGC patients was further verified, revealing that it could be used as a valuable tool in assisting clinical decision-making. 2021 Annals of Translational Medicine. All rights reserved.
Entities:
Keywords:
Major salivary glands carcinoma (MSGC); Surveillance, Epidemiology, and End Results (SEER); nomogram; prognosis; survival
Major salivary glands carcinoma (MSGC) is a rare malignant tumor, which accounts for 11% of oropharyngeal neoplasms in the United States (U.S.) (1,2). MSGC comprises over 20 different histological subtypes with different biological behavior and therapy responses. Due to the low number of patients in each subtype, there are no randomized phase III studies (3). The management of MSGC is still challenging. It there are no randomized phase III studies is reported that the 5-year overall survival (OS) is only about 50% (4). The rarity of MSGC, combined with complex classification schema and non-standard therapy, makes the prediction of the survival of MSGC patients difficult. Therefore, developing an effective prognostic tool to improve clinical treatment strategies for MSGC is of great clinical importance.Although there are many prognosis factors for cancers such as sex, race, age, and other characteristics, tumor staging systems are most commonly applied for the assessment of risk and the decision-making of cancer management in clinical practice, especially American Joint Committee on Cancer (AJCC) classification system and TNM staging system (5,6). Still, these systems have limitations: only tumor size, lymph nodes, and distant metastasis were considered. Moreover, MSGC patients classified as in the same stage may have completely different survival outcomes (7). The nomogram is an effective and convenient tool based on a statistical model for quantitative prediction of clinical events using multiple selected prognostic factors. It has been widely applied for predicting survival in patients with malignancies, such as lung cancer, breast cancer, and colon cancer (8-10). In their recent study, Lukovic et al. developed a predictive model for the prediction of distant metastasis (11), thus revealing the broad application prospect of predictive models in clinical practice for MSGC management. Ali et al. created a nomogram for predicting survival and recurrence of major salivary gland carcinoma based on the characteristics of 301 patients recruited at Memorial Sloan Kettering Cancer Centre between 1985 and 2009; however, the sample size of the development cohort was limited (12,13). Furthermore, histological type, which is a key factor in determining the biological behavior of the tumor, is not considered in the nomogram. Li et al. constructed a nomogram for survival prediction of MSGC (14). Nonetheless, improper study design and non-transparent reporting of results significantly damaged the reliability of the study. Therefore, an updated and reliable nomogram for predicting the survival of MSGC patients is urgently needed.In the present study, we used data of MSGC patients from the Surveillance, Epidemiology, and End Results (SEER) database to develop a predictive model, which was presented as a nomogram, and to test the performance for OS and cancer-specific survival (CSS). We present the following article in accordance with the TRIPOD reporting checklist (available at https://dx.doi.org/10.21037/atm-21-1725) (15).
Methods
Data source and selection criteria
Data for this study were obtained from the SEER database (approval number: 21476-Nov2019), which is an authoritative source for cancer statistics in the United States (https://seer.cancer.gov/). The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). The ethical approval was waived since the study was based on secondary data analysis.The inclusion criteria were the following: (I) patients diagnosed with major salivary gland carcinoma since 2004; (II) the survival time of patients is known. Patients were excluded if they met the following conditions: (I) survival time <1 month; (II) patients with missing data; (III) patients only diagnosed by autopsy or death certificate.Fourteen predictor variables included in the present study were: sex, age at diagnosis, race, marital status, location of the tumor, grade of differentiation, T/N/M stage, AJCC stage, size of the tumor, histological type of the tumor, surgery, and previously having/not having surgery of lymph nodes. Vital status and cancer-specific death, which were primary outcomes in this study, were also extracted. For radiotherapy and chemotherapy, patients were divided into two groups—“Yes” and “No/Unknown” in the SEER. Combining “No” and “Unknown” as a group would probably reduce the effect of the predictor and lead to confusion in the clinical practice. Radiotherapy and chemotherapy were not extracted in this study. All data were extracted using SEER*Stat 8.3.5 software.
Statistical analysis
The whole population extracted from the SEER database was randomly divided into training dataset and validation dataset at ratio 7:3. Categorical variables were analyzed by Chi-square test, and continuous variables (survival months) were analyzed by rank-sum test. Univariate and multivariate Cox proportional hazards regression were performed to assess the relationship between predictor variables and survival outcomes. Hazard ratio (HR) was applied as the effect size. To select predictor variables, the least absolute shrinkage and selection operator (LASSO) regression was adopted (16).The performance of the predictive model was evaluated in both the training dataset and validation data set. The discriminative performance of the predictive model was evaluated by Harrell C-index (C-index) at a given time point and over time C-index (17). The C-index estimates the probability that the model can correctly sort the sequence of events in a randomly selected pair of cases. The value of the C-index ranges from 0.5 to 1. A higher C-index means better predict performance (18). The receiver operating characteristic (ROC) curves were drawn, and the areas under the curves (AUC) were also calculated. AUC is another parameter for measuring the discriminative power of the model. Calibration curves were drawn to further evaluate the concordance of the predicted survival probability and the observed probability. Brier score, which is squared prediction error, was calculated. A lower Brier score means a lower prediction error (19). The decision curve analysis (DCA) was conducted for clinical significance assessment by calculating net benefit (20). All statistical analyses were performed using R version 4.0.3 (https://www.r-project.org/). An α level was set as 0.05.
Results
Characteristics of patients
A total of 14,753 eligible patients recorded in the SEER databases were initially recruited. Patients whose survival time was less than 1 month were excluded (n=718). Since the design of this study implied complete case analysis, patients with missing values were also excluded (n=2,673). Finally, 11,362 patients were included. Patients were randomly assigned into training dataset (n=7,953) and validation dataset (n=3,409) with a ratio 7:3. The flowchart of patient selection is shown in . There was no statistical difference in patients’ distribution between the training and validation dataset (P>0.05). The median survival time for the whole population was 48 months. The majority of the patients were male (n=6,400, 56%), white (n=9,214, 81%), married when diagnosed with MSGC, with an age between 60 and 80 (n=4,592, 40%). Most of lesions were located at parotid gland (n=9,191, 81%) and were classified as AJCC IV stage (n=3,139, 28%). For TNM stage, T1 accounted for 29% (n=3,305), N0 accounted for 67% (n=7,568), and M0 accounted for 89% (n=10,082). The size of most tumors was 0–1 cm (n=9,651, 85%). Mucoepidermoid carcinoma was the most common histological type (n=2,637, 23%). There were 86% patients (n=9,758, 86%) who underwent surgery, while 61% of total population (n=6,896) underwent removal of lymph nodes. A total of 3,616 death cases were reported in this cohort, including 2,338 cancer-related death cases and 1,278 competing death cases. The detailed characteristics of patients are shown in .
Figure 1
Flow chart of patient selection.
Table 1
Baseline demographic and clinical characteristics of the patients
Characteristic
Total (n=11,362), n [%]
Training dataset (n=7,953), n [%]
Validation dataset (n=3,409), n [%]
P value
Sex
–
Female
4,962 [44]
3,497 [44]
1,465 [43]
Male
6,400 [56]
4,456 [56]
1,944 [57]
Age
0.298
<40 years
1,573 [14]
1,084 [14]
489 [14]
40–59 years
3,562 [31]
2,531 [32]
1,031 [30]
60–80 years
4,592 [40]
3,211 [40]
1,381 [41]
>80 years
1,635 [14]
1,127 [14]
508 [15]
Race
0.234
White
9,214 [81.1]
6,468 [81.3]
2,746 [80.6]
Black
1,047 [9.2]
705 [8.9]
342 [10.0]
Others
971 [8.5]
690 [8.7]
281 [8.2]
Unknown
130 [1.1]
90 [1.1]
40 [1.2]
Marital status
0.379
Divorced
862 [7.6]
610 [7.7]
252 [7.4]
Married
6,321 [55.6]
4,416 [55.5]
1,905 [55.9]
Separated
91 [0.8]
73 [0.9]
18 [0.5]
Single
2,110 [18.6]
1,465 [18.4]
645 [18.9]
Widowed
1,190 [10.5]
839 [10.5]
351 [10.3]
Unknown
788 [6.9]
550 [6.9]
238 [7.0]
Site
0.306
Parotid gland
9,191 [80.9]
6,434 [80.9]
2,757 [80.9]
Submandibular gland
1,478 [13.0]
1,028 [12.9]
450 [13.2]
Sublingual gland
124 [1.1]
93 [1.2]
31 [0.9]
Overlapping of major gland
11 [<0.1]
5 [<0.1]
6 [0.2]
Unspecified major gland
558 [4.9]
393 [4.9]
165 [4.8]
Grade
0.575
I
1,509 [13.3]
1,066 [13.4]
443 [13.0]
II
2,673 [23.5]
1,895 [23.8]
778 [22.8]
III
2,342 [20.6]
1,617 [20.3]
725 [21.3]
IV
1,007 [8.9]
694 [8.7]
313 [9.2]
Unknown
3,831 [33.7]
2,681 [33.7]
1,150 [33.7]
AJCC stage
0.630
I
2,885 [25]
1,988 [25]
897 [26]
II
2,048 [18]
1,450 [18]
598 [18]
III
1,946 [17]
1,370 [17]
576 [17]
IV
3,139 [28]
2,207 [28]
932 [27]
Unknown
1,344 [12]
938 [12]
406 [12]
T stage
0.320
0
40 [0.4]
28 [0.4]
12 [0.4]
1
3,305 [29.1]
2,268 [28.5]
1,037 [30.4]
2
2,708 [23.8]
1,920 [24.1]
788 [23.1]
3
2,110 [18.6]
1,481 [18.6]
629 [18.5]
4
1,805 [15.9]
1,288 [16.2]
517 [15.2]
X
1,394 [12.3]
968 [12.2]
426 [12.5]
N stage
0.743
0
7,568 [66.6]
5,317 [66.9]
2,251 [66.0]
1
1,389 [12.2]
978 [12.3]
411 [12.1]
2
1,565 [13.8]
1,084 [13.6]
481 [14.1]
3
69 [0.6]
46 [0.6]
23 [0.7]
X
771 [6.8]
528 [6.6]
243 [7.1]
M stage
0.741
0
10,082 [88.7]
7,069 [88.9]
3,013 [88.4]
1
573 [5.0]
396 [5.0]
177 [5.2]
X
707 [6.2]
488 [6.1]
219 [6.4]
Tumor size
0.547
≤1 cm
9,651 [84.9]
6,762 [85.0]
2,889 [84.7]
>1, ≤2 cm
90 [0.8]
63 [0.8]
27 [0.8]
>2, ≤3 cm
16 [0.1]
13 [0.2]
3 [<0.1]
>3, ≤4 cm
22 [0.2]
19 [0.2]
3 [<0.1]
>4, ≤5 cm
12 [0.1]
8 [0.1]
4 [0.1]
>5 cm
4 [<0.1]
2 [<0.1]
2 [<0.1]
Unknown
1,567 [13.8]
1,086 [13.7]
481 [14.1]
Histological type
0.886
Unclassified carcinoma
172 [1.5]
116 [1.5]
56 [1.6]
Secondary carcinoma
662 [5.8]
468 [5.9]
194 [5.7]
Large cell carcinoma
41 [0.4]
27 [0.3]
14 [0.4]
Undifferentiated carcinoma
45 [0.4]
32 [0.4]
13 [0.4]
Small cell carcinoma
87 [0.8]
58 [0.7]
29 [0.9]
Squamous cell carcinoma
1,706 [15.0]
1,189 [15.0]
517 [15.2]
Lymphoepithelial carcinoma
86 [0.8]
60 [0.8]
26 [0.8]
Adenocarcinoma
885 [7.8]
617 [7.8]
268 [7.9]
Basal cell adenocarcinoma
175 [1.5]
126 [1.6]
49 [1.4]
Adenoid cystic carcinoma
1,210 [10.6]
860 [10.8]
350 [10.3]
Neuroendocrine carcinoma
79 [0.7]
52 [0.7]
27 [0.8]
Oxyphilic adenocarcinoma
88 [0.8]
60 [0.8]
28 [0.8]
Mucoepidermoid carcinoma
2,637 [23.2]
1,817 [22.8]
820 [24.1]
Duct carcinoma
246 [2.2]
161 [2.0]
85 [2.5]
Acinar cell carcinoma
1,380 [12.1]
980 [12.3]
400 [11.7]
Epithelial-myoepithelial carcinoma
243 [2.1]
174 [2.2]
69 [2.0]
Carcinoma in pleomorphic adenoma
495 [4.4]
346 [4.4]
149 [4.4]
Malignant myoepithelioma
174 [1.5]
130 [1.6]
44 [1.3]
Others
951 [8.4]
680 [8.6]
271 [7.9]
Surgery
0.538
Yes
9,758 [85.9]
6,845 [86.1]
2,913 [85.5]
No
1,500 [13.2]
1,033 [13.0]
467 [13.7]
Unknown
104 [0.9]
75 [0.9]
29 [0.9]
Removal of lymph nodes
0.713
Yes
6,896 [60.7]
4,826 [61]
2,070 [60.7]
Biopsy only
155 [1.4]
115 [1.4]
40 [1.2]
No
4,109 [36.2]
2,872 [36]
1,237 [36.3]
Unknown
202 [1.8]
140 [1.8]
62 [1.8]
Survival months, median [inter-quartile range]
48 [21, 89]
47 [21, 88]
49 [21, 90]
0.250
Cancer-specific death
0.561
Alive
9,024 [79]
6,305 [79]
2,719 [80]
Dead
2,338 [21]
1,648 [21]
690 [20]
Vital status
0.480
Alive
7,746 [68]
5,438 [68]
2,308 [68]
Dead
3,616 [32]
2,515 [32]
1,101 [32]
AJCC, American Joint Committee on Cancer; CI, confidence interval.
Flow chart of patient selection.AJCC, American Joint Committee on Cancer; CI, confidence interval.
Establishment of predict model
First, univariate Cox regression was constructed. With reference to OS, all variables were significantly correlated with outcome (P<0.001). The results of multivariate Cox regression showed that sex, age, race, grade, AJCC stage, T/N stage, tumor size, surgery, and histological type were significantly correlated with OS (P<0.001) (). Both M stage and site also showed a significant correlation (P<0.01), except for marital status and removal of lymph nodes (P>0.05). As for CSS, similar results were obtained in the univariate Cox regression, while marital status and M stage were not significantly associated with CSS in the multivariate Cox regression (P>0.05) (). Then the LASSO model was applied for variable selection (Figure S1). For OS, only one variable (removal of lymph nodes) was not incorporated in the model. A total of 12 predictors, except for marital status and site, were selected for CSS prediction. The regression coefficients are provided in Table S1.
Table 2
Cox univariate and multivariate analysis of OS rates in training dataset
OS, overall survival; HR, hazard ratio; CI, confidence interval.CSS, cancer-specific survival; HR, hazard ratio; CI, confidence interval.The nomogram was drawn using the “rms” package (). When using the nomogram to predict the survival of patients, a perpendicular line was drawn towards the point line from each predictor to identify the specific value of each variable. The total points were the sum of values for each variable. Then, the survival likelihood was determined by drawing a line from the total points’ line down along the survival axis.
Figure 2
Nomograms for prediction of 3-year and 5-year OS (A) and CSS (B) rate of MSGC patients. For histological type, the number 1 to 19 represents: (I) unclassified carcinoma; (II) secondary carcinoma; (III) large cell carcinoma; (IV) undifferentiated carcinoma; (V) small cell carcinoma; (VI) squamous cell carcinoma; (VII) lymphoepithelial carcinoma; (VIII) adenocarcinoma; (IX) basal cell adenocarcinoma; (X) adenoid cystic carcinoma; (XI) neuroendocrine carcinoma; (XII) oxyphilic adenocarcinoma; (XIII) mucoepidermoid carcinoma; (XIV) duct carcinoma; (XV) acinar cell carcinoma; (XVI) epithelial-myoepithelial carcinoma; (XVII) carcinoma in pleomorphic adenoma; (XVIII) malignant myoepithelioma; (XIX) others. AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival; MSGC, major salivary glands carcinoma.
Nomograms for prediction of 3-year and 5-year OS (A) and CSS (B) rate of MSGC patients. For histological type, the number 1 to 19 represents: (I) unclassified carcinoma; (II) secondary carcinoma; (III) large cell carcinoma; (IV) undifferentiated carcinoma; (V) small cell carcinoma; (VI) squamous cell carcinoma; (VII) lymphoepithelial carcinoma; (VIII) adenocarcinoma; (IX) basal cell adenocarcinoma; (X) adenoid cystic carcinoma; (XI) neuroendocrine carcinoma; (XII) oxyphilic adenocarcinoma; (XIII) mucoepidermoid carcinoma; (XIV) duct carcinoma; (XV) acinar cell carcinoma; (XVI) epithelial-myoepithelial carcinoma; (XVII) carcinoma in pleomorphic adenoma; (XVIII) malignant myoepithelioma; (XIX) others. AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival; MSGC, major salivary glands carcinoma.
Evaluation of model performance
The performance of the predictive model was compared with the AJCC stage and TNM stage system. The values of AUC, C-index, and Brier score are presented in and Table S2. For OS, AUC of nomogram in the training dataset was 83.5 (82.4–84.7), while the AUC of TNM stage and AJCC stage was 72.7 (71.3–74.1) and 71.6 (70.2–73.0), respectively. In the validation dataset, the AUC of the nomogram was 82.7 (81.0–84.3), while the AUC of the TNM stage and AJCC stage were 71.0 (68.9–73.2) and 69.8 (67.7–71.9), respectively. Similar results were found for CSS: nomogram yielded the highest value of AUC of 83.9 (82.6–85.2), followed by TNM stage (75.9, 74.4–77.3) and AJCC stage (73.9, 72.4–75.3) in the training cohort. The AUC values of the nomogram, TNM stage, AJCC stage were 82.8 (80.9–84.7), 74.5 (72.2–76.7), 72.6 (70.4–74.8), respectively, in the validation cohort. As shown in and Figure S2, the AUC values of the nomogram were the best across all settings.
Table 4
Comparison of AUC, C-index, and Brier score for 5-year prediction between the nomogram, TNM stage, and AJCC stage in MSGC patients
Survival types
Training dataset
Validation dataset
AUC
95% CI
C-index
95% CI
Brier score
AUC
95% CI
C-index
95% CI
Brier score
OS
Nomogram
83.5
82.4–84.7
0.796
0.788–0.804
0.0148
82.7
81.0–84.3
0.787
0.774–0.799
0.0153
TNM stage
72.7
71.3–74.1
0.724
0.713–0.734
0.0188
71.0
68.9–73.2
0.711
0.695–0.726
0.0194
AJCC stage
71.6
70.2–73.0
0.751
0.739–0.762
0.0187
69.8
67.7–71.9
0.730
0.712–0.748
0.0194
CSS
Nomogram
83.9
82.6–85.2
0.806
0.797–0.815
0.0128
82.8
80.9–84.7
0.798
0.784–0.812
0.0130
TNM stage
75.9
74.4–77.3
0.753
0.742–0.764
0.0153
74.5
72.2–76.7
0.742
0.725–0.759
0.0155
AJCC stage
73.9
72.4–75.3
0.781
0.769–0.793
0.0160
72.6
70.4–74.8
0.769
0.750–0.788
0.0160
MSGC, major salivary gland carcinoma; OS, overall survival; CSS, cancer-specific survival; AJCC, American Joint Committee on Cancer; AUC, area under curve; CI, confidence interval.
Figure 3
The ROC curves of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). ROC, receiver operating characteristic; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.
MSGC, major salivary gland carcinoma; OS, overall survival; CSS, cancer-specific survival; AJCC, American Joint Committee on Cancer; AUC, area under curve; CI, confidence interval.The ROC curves of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). ROC, receiver operating characteristic; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.The C-index of the nomogram in predicting OS was 0.796 (0.788–0.804) and 0.787 (0.774–0.799), showing a greater predictive performance compared with TNM stage (0.724, 0.713–0.734; 0.711, 0.695–0.726) and AJCC stage (0.751, 0.739–0.762; 0.730, 0.712–0.748) in the training group and validation group. For CSS, the C-index of nomogram was 0.806 (0.797–0.815) in training group and 0.798 (0.78–0.81) in validation group. The C-index of TNM stage was 0.75 (0.74–0.76) and 0.74 (0.784–0.812), while the C-index of AJCC stage was 0.781 (0.769–0.793) and 0.769 (0.750–0.788) in training set and validation set, respectively. The over-time C-indexes of nomogram were consistently better than TNM stage and AJCC stage throughout the investigated period, as shown in .
Figure 4
The overtime C-index of nomograms, AJCC stage, and TNM stage for predicting OS (A,B) and CSS (C,D) in training and validation dataset. C-index, Harrell C-index; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.
The overtime C-index of nomograms, AJCC stage, and TNM stage for predicting OS (A,B) and CSS (C,D) in training and validation dataset. C-index, Harrell C-index; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.The Brier scores of the nomogram in predicting OS, which was 0.0148 and 0.0153 in the training set and validation set, respectively, were better than the TNM stage and AJCC stage. Concerning the TNM stage and AJCC stage, the Brier scores were 0.0188, 0.0187 in the training group, and 0.0194, 0.0194 in the validation group. As for CSS, the Brier score of the nomogram, TNM stage, AJCC stage in the training dataset were 0.0128, 0.0153, 0.0160, and 0.0130, 0.0155, 0.0160 in the validation set, respectively. The calibration curves displayed by the nomogram were closer to the reference line, which indicated that the nomogram exhibited a higher accuracy than other systems (,
Figure S3).
Figure 5
The calibration curves of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.
The calibration curves of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.The DCA was adopted for the clinical significance assessment of the nomogram. The performance of the nomogram was also compared with the TNM stage and AJCC stage. As shown in and Figure S4, the net benefits of nomogram were higher compared to the other two systems, no matter in training dataset or validation dataset, for OS or CSS. The results of DCA indicated that the nomogram established by the present study was applicable for predicting the 3- and 5-year CSS and OS.
Figure 6
The DCA of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). DCA, decision curve analysis; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.
The DCA of nomograms, AJCC stage, and TNM stage for 5-year prediction in training and validation dataset. (A,B) OS in the training dataset (A) and validation dataset (B). (C,D) CSS in the training dataset (C) and validation dataset (D). DCA, decision curve analysis; AJCC, American Joint Committee on Cancer; OS, overall survival; CSS, cancer-specific survival.
Discussion
The incidence rate of MSGC ranges from 1.1 to 1.3/100,000/year, which is quite low (21,22). The limited sample size greatly impacts the related research, as it increases the research error and may lead to a biased result. SEER database is an authoritative source of information on cancer incidence, diagnosis, therapy, and survival in the United States. This database collects and publishes cancer incidence and survival data from eighteen cancer registries covering approximately 35% of the U.S. population. The SEER database has been widely adopted in cancer-related research (23-25). In this study, a large cohort of patients (n=14,753) diagnosed with MSGC was recruited from SEER. To the best of our knowledge, this is the largest sample size used so far in the research on the prognosis of MSGC.Given the morphologic diversity and rarity of MSGC, there are no high-quality randomized data defining predictive factors. In a relatively large cohort, site, histological type, race, sex, and age at diagnosis were identified as predictors for MSGC OS (26). Li et al. failed to find a prognostic, predictive factor for lymphoepithelial carcinoma in the major salivary gland in a cohort containing 37 patients (27). In a study comprising 95 patients recruited from a single institution, male sex and >10 pack-year smoking history, grade, N stage, squamous cell carcinoma, perineural invasion, and neck dissection significantly reduced OS (28). In the present study, multivariate Cox analysis identified sex, age, race, grade, AJCC stage, T/N/M stage, tumor size, surgery, histological type, and site as significant factors predicting OS. All predictors, except for marital status and M stage, were significant in multivariate Cox analysis for CSS.Sex was proven to be an independent risk factor of MSGC survival. Females have been reported to have better survival outcomes compared with males, which is consistent with our result (26,29). Not surprisingly, the risk increases with the patients’ age. As for the race, black people were at the highest risk, followed by white, and then other races, which was consistent with some previous studies (30-32). It is important to note that this may be due to the socioeconomic differences between black people and other races in some areas (33). Poor differentiation grade and high AJCC stage are two independent indicators for worse survival outcomes. T stage and tumor size reflected the condition of the primary site. Interestingly, the patient with a larger primary lesion might have better survival compared to a smaller lesion, which was contradictory to the results of other studies (29). On the one hand, the larger tumor was often associated with better differentiation grade and histological type, which led to better survival outcomes. Patients who suffered from highly malignant tumors usually died before their tumor grew larger. On the other hand, this may be due to the small sample size with large tumors (>1 cm) in this cohort (n=1,171, 15.1%). Whether in univariate, multivariate Cox regression, or LASSO model, tumor size was significantly related to the OS and CSS; thus, it was incorporated in the final model. M stage, which reflects distant tumor metastasis, was one of the most important predictors for cancer survival (34). In the present study, the M stage showed a significant association with OS but not CSS (P>0.05). The important role of distant metastasis in MGSC has been well-recognized (35,36). Considering the clinical significance, the M stage was finally included. Surgery was the primary approach for MSGC management. Patients who underwent surgery had better survival compared to those who did not. In the 2017 World Health Organization classification, there were over 20 different histological subtypes of MSGC, with specific features and outcomes. Consequently, it is important to consider the histological type for prediction survival of MSGC. In the present study, when the number of patients diagnosed as a specific subtype was less than 10, it was put into the category of “Other”. Epithelial-myoepithelial carcinoma and lymphoepithelial carcinoma had better survival, while patients diagnosed with unclassified carcinoma had worse OS and CSS. Radiotherapy and chemotherapy were not incorporated in the model. For radiotherapy and chemotherapy, patients were divided into two groups—“Yes” and “No/Unknown” in the SEER, while “Unknown” was set as a separated group for other variables, such as race, material status, grade, AJCC stage. After discussion, we thought that combining “No” and “Unknown” as a group would reduce the effect of these two variables and might lead to confusion in the clinical practice.The key strengths in the present study are the following: first, the SEER database provided a total of 11,362 MSGC patients. Compared with a study by Li et al., the sample size in the present study was larger (4,218 vs. 14,753). This cohort, which to the best of our knowledge is the largest one used thus far, is large enough to ensure the reliability and effectiveness of the nomogram. Second, the histological type is considered in the nomogram. The Memorial Sloan Kettering Cancer Center (MSKCC) a salivary carcinoma nomogram, is a common tool used for MSGC survival prediction (12,13,37). Still, unlike the histological type, the only histological grade was considered for the development of the MSKCC nomogram. As an important factor, the survival outcomes in different histological types greatly vary. Third, treatment planning and prognosis prediction are currently dependent on the AJCC stage and TNM stage systems to a great extent. To evaluate the value of using the nomogram, the performance of the nomogram was compared with the AJCC stage and TNM stage. C-index, AUC, Brier score was calculated. Calibration curves, ROC curves, and DCA were also drawn. The result showed that the performance of the nomogram was consistently better compared to the AJCC stage and TNM stage. Finally, even though Li et al. and our nomograms were constructed based on the SEER database, only the present study was conducted according to the TRIPOD statement (30). Moreover, the performance of the nomogram was evaluated all-around to ensure the reliability of the prediction tool.The present study also has some limitations. The variables extracted from the SEER database were limited. Records related to prognosis, such as lymphovascular invasion, perineural invasion, tobacco/alcohol use, were inaccessible. In this cohort, the seventh edition of the AJCC stage was adopted as only a small portion of patients had information about the eighth edition of the AJCC stage. As a result, the extranodal extension of lymph nodes was not considered in the present study, which is a common limitation in research collecting information from SEER. Moreover, target therapy has an increasingly important role in MSGC treatment (38). Once the patient accepts target therapy, the survival will certainly change. Furthermore, different drugs can also have different roles. Due to the limitation of SEER, the information on target therapy remains unknown. Although the nomogram performed well in internal validation, external validation is needed in future studies.
Conclusions
A reliable nomogram for OS and CSS prediction in MSGC patients based on the Cox regression model was developed using the SEER database. The nomogram showed good predictive discrimination and accuracy compared with the AJCC stage and TNM stage. Consequently, the nomogram can become a practical tool to help clinicians to make a treatment plan and prognosis predictions.The article’s supplementary files as
Authors: Davide Lombardi; Michele Tomasoni; Alberto Paderno; Davide Mattavelli; Marco Ferrari; Simonetta Battocchio; Francesco Missale; Francesco Mazzola; Giorgio Peretti; Davide Mocellin; Daniele Borsetto; Jonathan M Fussey; Paul Nankivell; Nikoleta Skalidi; Mario Bussi; Leone Giordano; Andrea Galli; Gianluigi Arrigoni; Elena Raffetti; Paul Pracy; Vincent Vander Poorten; Piero Nicolai Journal: Oral Oncol Date: 2020-10-30 Impact factor: 5.337
Authors: Laura D Locati; Donata Galbiati; Giuseppina Calareso; Salvatore Alfieri; Susanne Singer; Stefano Cavalieri; Cristiana Bergamini; Paolo Bossi; Ester Orlandi; Carlo Resteghini; Francesca Platini; Roberta Granata; Pasquale Quattrone; Moela Mancinelli; Luigi Mariani; Salvatore Lo Vullo; Lisa F Licitra Journal: Cancer Date: 2020-02-07 Impact factor: 6.860
Authors: Eva Katharina Egger; Matthias B Stope; Florian Recker; Dominique Konsgen; Jennifer Landsberg; Anne Frohlich; Alina Abramian; Alexander Mustea Journal: Anticancer Res Date: 2021-02 Impact factor: 2.480
Authors: Safina Ali; Frank L Palmer; Changhong Yu; Monica DiLorenzo; Jatin P Shah; Michael W Kattan; Snehal G Patel; Ian Ganly Journal: Ann Surg Oncol Date: 2013-10-17 Impact factor: 5.344
Authors: C Fushimi; Y Tada; H Takahashi; T Nagao; H Ojiri; T Masubuchi; T Matsuki; K Miura; D Kawakita; H Hirai; E Hoshino; S Kamata; T Saotome Journal: Ann Oncol Date: 2018-04-01 Impact factor: 32.976
Authors: Allison W Kurian; Kevin C Ward; Paul Abrahamse; Irina Bondarenko; Ann S Hamilton; Dennis Deapen; Monica Morrow; Jonathan S Berek; Timothy P Hofer; Steven J Katz Journal: J Clin Oncol Date: 2021-02-09 Impact factor: 44.544