Literature DB >> 31977914

Development and validation of a nomogram for predicting long-term overall survival in nasopharyngeal carcinoma: A population-based study.

Jiayuan Wu1, Quan Zhou2, Zhenyu Pan3, Yufeng Wang4, Liren Hu4, Guanghua Chen5, Shengpeng Wang6, Jun Lyu7.   

Abstract

We aimed to develop a nomogram based on a population-based cohort to estimate the individualized overall survival (OS) for patients with nasopharyngeal carcinoma (NPC) and compare its predictive value with that of the traditional staging system.Data for 3693 patients with NPC were extracted from the Surveillance, Epidemiology, and End Results dataset and randomly divided into two sets: training (n = 2585) and validation (n = 1108). On the basis of multivariate Cox regression analysis, a nomogram was constructed to predict the 3-, 5-, and 10-year survival probability for a patient. The performance of the nomogram was quantified with respect to discrimination, calibration, and clinical utility.In the training set, age, sex, race, marital status, histological type, T stage, N stage, M stage, radiotherapy, and chemotherapy were selected to develop a nomogram for predicting the OS probability based on the multivariate Cox regression model. The nomogram was generally more discriminative compared with the American Joint Committee on Cancer 7th staging system. Calibration plots exhibited an excellent consistency between the observed probability and the nomogram's prediction. Categorical net classification improvement and integrated discrimination improvement suggested that the predictive accuracy of the nomogram exceeded that of the classic staging system. With respect to decision curve analyses, the nomogram exhibited preferable net benefit gains than the staging system across a wide range of threshold probabilities.This proposed nomogram exhibits an excellent performance with regard to its predictive accuracy, discrimination capability, and clinical utility, and thus can be used as a convenient and reliable tool for prognosis prediction in patients with NPC.

Entities:  

Mesh:

Year:  2020        PMID: 31977914      PMCID: PMC7004579          DOI: 10.1097/MD.0000000000018974

Source DB:  PubMed          Journal:  Medicine (Baltimore)        ISSN: 0025-7974            Impact factor:   1.889


Introduction

Nasopharyngeal carcinoma (NPC) is a specific malignance that originates in the epithelial lining of the nasopharynx, accounting for only 0.6% of newly diagnosed cancers per year.[ Compared with other types of head and neck cancer, NPC has an uneven geographical distribution worldwide. NPC is uncommon in most area around the world, including the United States (US), with an incidence of <1 case per 100,000 person-years, but it is endemic in southern China and Southeast Asia, representing a peak incidence of 50 per 100,000 person-years.[ The NPC pathogenesis is thought to be associated with Epstein–Barr virus (EBV) infection, family genetic factors, and environmental factors according to the current evidence.[ Over the past three decades, the survival time of patients with NPC has prolonged because of the improvement of radiotherapy (RT) technology, the broad application of chemotherapy, and the accurate disease diagnosis. However, appropriately 30% of patients with NPC suffer a poor prognosis due to the development of recurrence or metastasis.[ Given that patients with a high risk of poor prognosis may undergo additional treatment, early identification of these patients is necessary.[ To date, the standard treatment strategies for patients with NPC are RT alone for the early stage and concurrent radiochemotherapy (CCRT) for the advanced stage. These treatment are performed based on the current Tumor–Node–Metastasis (TNM) staging system.[ However, an extreme variation of clinical outcomes has been reported among the patients with NPC with the same stage who receive similar treatment regimen.[ This phenomenon suggests that the TNM staging system is far from a perfect tool for prognostic prediction and therapeutic decision, because it only accounts for the tumor size and extension (T stage), lymph node involvement (N stage), and distant metastasis (M stage), without considering other factors with prognostic values, such as demographic features, histological type, and clinical treatment. Moreover, the TNM staging system divides patients into various groups but fails to evaluate the individualized risk of survival outcomes.[ Given that the prognosis of malignant patients is determined multifactorially, to develop an accurate tool that can help for the individualized risk classification and decision-making in clinic practice is urgent. As a simple and convenient mathematical model, nomograms have been widely used in clinical practice through intuitive visual presentation. A nomogram can take the weight of each relevant factor into consideration and integrate the independent factors to predict a numerical probability of a special clinical event for individual patients over time.[ Compared with the TNM staging system, several advantages have been found in nomograms, such as accurate estimation, strong discrimination, and user-friendliness. Several built nomograms are useful in the management of patients with NPC.[ To the best of our knowledge, nomograms for patients with NPC based on the population-based data have not been reported. In the present study, we aimed to develop and validate a nomogram for the prognostic prediction of patients with NPC based on a multi-institution and multi-population data from the Surveillance, Epidemiology, and End Results (SEER) dataset. We also compared the performance of this nomogram with the currently used American Joint Committee on Cancer (AJCC) staging system for NPC.

Materials and methods

Patients

With the help of SEER∗Stat software (version 8.3.5, National Cancer Institute, Bethesda, MD), we collected the detailed data on patients diagnosed with NPC (site recode International Classification of Disease for Oncology, 3rd Edition [ICD-O-3]/World Health Organization [WHO] 2008 of “nasopharynx” and behavior recode ICD-O-3 of “malignant”) from the SEER program of the National Cancer Institute.[ The SEER program provides data on cancer incidence, prevalence, and mortality in the United States, and covers 28% of the US population from 18 cancer registries. We only included cases from 2004 to 2015 because the SEER database comprises detailed stage information (2004 AJCC 6th and 2010 AJCC 7th) collected since 2004. Patients with insufficient or unknown data were excluded. Given that the SEER database was publicly accessible, Institutional Review Board approval was not required for the present study. We have received permission from the SEER registry to access the data (authorization number: 15709-Nov2017).

Variables

Demographic, clinicopathologic, and therapeutic variables were retrieved from the SEER database, including age, sex, race, marital status, grade, histological type, tumor stage (T stage), nodal stage (N stage), metastatic stage (M stage), clinical stage (TNM stage), RT, chemotherapy, follow-up time, and survival outcome. The 7th edition of AJCC staging system was applied to the patients in this study, whereas patients reported on the 6th edition were also converted to the 7th edition. Histological types of the patients with NPC were divided into four groups according to the WHO classification scheme by using the ICD-O-3 codes: keratinizing squamous cell carcinoma (KSCC, ICD-O codes 8070 and 8071), differentiated non-keratinizing carcinoma (DNKC, ICD-O codes 8072 and 8073), undifferentiated non-keratinizing carcinoma (UNKC, ICD-O codes 8020, 8021, and 8082), and others (carcinoma not otherwise specified [NOS]; ICD-O code 8010).[ The primary endpoint was overall survival (OS), defined as the interval from diagnosis of NPC to death from any cause or the last follow-up.

Construction of the nomogram

Eligible patients were randomly divided into either the training or validation set with the split ratio of 7:3. The data of the training set were used to perform the survival analysis and establish the nomogram, whereas those from the validation set were applied to validate the prediction model. We included age (continuous variable), sex (male or female), race (White, Black, Asian/Pacific Islander, or American Indian/Alaskan Native), marital status (married, single, or others), grade (well differentiated, moderately differentiated, poorly differentiated, or undifferentiated), histological type (KSCC, DNKC, UNKC, or others), T stage (T1, T2, T3, or T4), N stage (N0, N1, N2, or N3), M stage (M0 or M1), RT (yes or no), and chemotherapy (yes or no/unknown) as covariates in the regression model. Univariate and multivariate Cox regression analyses were conducted to identify the significant prognostic factors of OS by the backward stepwise method. Clinical stage (I, II, III, or IV) overlapping with the variables of T, N, and M stages were not included in the multivariate analysis. A nomogram for predicting 3-, 5-, and 10-year survival probability of patients with NPC was constructed based on the outcomes of the multivariate analysis.

Validation of the nomogram

The performance of the nomogram was first quantified in the training cohort and then in the validation cohort with respect to discrimination, calibration, and clinical utility. The discrimination performances of the nomogram were quantitatively assessed by the concordance index (C-index) and the area under curve (AUC) of the receiver operating characteristic (ROC) curve.[ Both C-index and AUC ranged from 0.5 to 1, with 1 indicating perfect discrimination and 0.5 indicating no discrimination. The net reclassification improvement (NRI) and integrated discrimination improvement (IDI) were calculated to determine the overall improvement in the predictive accuracy of patients by using the nomogram in place of the traditional TNM staging system. NRI refers to the difference in proportions of patients with events correctly assigned a higher probability and patients without events correctly assigned a low probability by an updated model compared with the initial model.[ IDI represents the improvement in average sensitivity (i.e., the true positive rate) without reducing the average specificity (i.e., the true negative rate) of a new model compared with that of a baseline model.[ The nomogram calibration was investigated from the graphical representations of the consistency of the predicted probabilities and the observed outcomes based on 1000 bootstrap resamples.[ Finally, we conducted the decision curve analysis (DCA) to validate the clinical values of the nomogram and the traditional TNM staging system, which is a method for assessing whether the clinical usefulness of prediction models increased the net benefits when realistic threshold probabilities were considered.[ These analyses were conducted by using R version 3.5.1 (http://www.r-project.org/), and a two-sided P value < .05 was considered statistically significant.

Results

Patients’ characteristics

A total of 13,620 patients diagnosed with NPC as the primary cancer were retrieved from the SEER database. After a strict filtration, 9927 cases were excluded because of unknown clinicopathological information or inaccurate staging according to the AJCC TNM classification. Finally, 3693 patients were included in the present study, of which 2585 and 1108 cases comprised the training cohort and the validation cohort, respectively. Of the whole cohort, 1497 (40.5%) patients had died at the end of the follow-up with a median follow-up period of 34 months. The 3-, 5-, and 10-year survival rates of the entire cohort were 66.7%, 57.9%, and 44.2%, respectively. The age ranged from 7 years to 92 years with a median of 55 years. The majority of the patients were male (70.7%), married (61.4%), underwent RT (87.1%), and received chemotherapy (81.1%). With regard to grade, 115 (3.1%) patients were well differentiated, 519 (14.1%) patients were moderately differentiated, 1591 (43.1%) patients were poorly differentiated, and 1468 (39.7%) patients were undifferentiated. The most common histological type was KSCC (35.5%), followed by UNKC (22.7%), DNKC (21.5%), and others (20.3%). The proportion of Stage I, Stage II, Stage III, and Stage IV was 9.5% (351/3594), 23.6% (870/3693), 29.9% (1104/3693), and 37.0% (1368/3693), respectively. The characteristics of the patients in the training set and validation set are shown in Table 1.
Table 1

Patient characteristics of the training cohort and the validation cohort (n = 3693).

Patient characteristics of the training cohort and the validation cohort (n = 3693).

Survival analysis and nomogram development

The outcomes of univariate and multivariate analyses in the training set were shown in Table 2. Based on the data of the training set, the multivariate analyses showed that age, sex, race, marital status, histological type, T stage, N stage, M stage, RT, and chemotherapy were the key predictive factors for OS, indicating statistical significance (Table 2).
Table 2

Univariate and multivariate Cox regression analyses of the training cohort.

Univariate and multivariate Cox regression analyses of the training cohort. A nomogram for predicting 3-, 5-, and 10-year OS was constructed based on these significant variables in the training set (Fig. 1). By adding up the scores associated with each variable, and projecting total scores to the bottom scale, OS probabilities could be estimated at 3-, 5-, and 10-year time points.
Figure 1

Nomogram for predicting 3-, 5-, and 10-year overall survival of patients with nasopharyngeal carcinoma. A or PI = Asian/Pacific Islander, DNKC = differentiated nonkeratinizing carcinoma, KSCC = keratinizing squamous cell carcinoma, UNKC = undifferentiated nonkeratinizing carcinoma.

Nomogram for predicting 3-, 5-, and 10-year overall survival of patients with nasopharyngeal carcinoma. A or PI = Asian/Pacific Islander, DNKC = differentiated nonkeratinizing carcinoma, KSCC = keratinizing squamous cell carcinoma, UNKC = undifferentiated nonkeratinizing carcinoma.

Nomogram validation

With respect to the training cohort, the nomogram provided a higher C-index for OS prediction (0.754, 95% CI: 0.738–0.770) than the AJCC 7th staging system (0.61, 95% CI: 0.59–0.63; P < .001). Similarly, the C-index of the validation set (0.763, 95% CI: 0.741–0.785) was also superior to that of the AJCC 7th staging system (0.61; 95% CI: 0.58–0.64; P < .001). These results implied that the nomogram was more robust than the existing AJCC staging system. Moreover, we compared the predictive ability of these two models by calculating the AUC of the ROC curves. The outcomes demonstrated that the nomogram had a better predictive ability than the AJCC 7th staging system concerning the 3-, 5-, and 10-year OS in either the training set (Fig. 2A–2C) or the validation set (Fig. 2D–F).
Figure 2

Time-dependent receiver operating characteristics curves of nomogram and AJCC staging system. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set.

Time-dependent receiver operating characteristics curves of nomogram and AJCC staging system. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set. The results of NRI and IDI calculations were depicted in Table 3. The usage of multiple variables to construct a comprehensive nomogram significantly improved the risk reclassification for 3-, 5-, and 10-year overall mortality prediction compared with the AJCC 7th staging system in both sets.
Table 3

The net reclassification improvement and integrated discrimination improvement of the nomogram compared to the AJCC 7th staging system.

The net reclassification improvement and integrated discrimination improvement of the nomogram compared to the AJCC 7th staging system. The calibration plots exhibited an excellent consistency between the observed probability and the nomogram's prediction with respect to the 3-, 5-, and 10-year OS in the training (Fig. 3A–3C) and validation sets (Fig. 3D–F). This finding suggested the appreciable reliability of the nomogram.
Figure 3

Calibration plots of nomogram. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set.

Calibration plots of nomogram. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set. In addition, DCA exhibited preferable net benefit along with the threshold probability in the formulated nomogram compared with the AJCC 7th staging system at different time points (Fig. 4), indicating the favorable clinical utility of the nomogram.
Figure 4

Decision curve analyses of nomogram and AJCC staging system. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set.

Decision curve analyses of nomogram and AJCC staging system. (A) 3-year, (B) 5-year, (C) 10-year overall survival in training set; (D) 3-year, (E) 5-year, (F) 10-year overall survival in validation set.

Discussion

In this era of personalized cancer therapy, to determine whether an individual patient is at the risk of adverse clinical outcomes is important so that he/she can receive the most appropriate treatment. The practical application of the TNM staging system is restricted because it only provides risk stratification at a population level without offering an association between the individual patient and his/her corresponding prognosis. In contrast, the nomogram can serve as a scoring system and a visualized predicting tool to help physicians in easily matching patients with their expected OS through a simple calculation in clinical practice. In the present study, we establish a comprehensive and convenient nomogram to predict the OS probability at different time points for patients with NPC by using a large patient cohort from the SEER database. Validating the nomogram is essential to ensure that it can be applied generally and to avoid overfitting. Compared with the commonly used AJCC 7th staging system, the current nomogram showed excellent predictive accuracy and discriminative ability. NRI and IDI also quantitatively proved that this comprehensive nomogram has a significantly increased likelihood of unfavorable OS identification, which is more effective than the traditional TNM classification. However, perfect predictive accuracy is not the same as usefulness in clinical practice. When the threshold probabilities of the net benefit are impractical, a model with good performance may also have limited applicability.[ Therefore, we applied the DCA curves to investigate the clinic validity of our nomogram. In the DCA curve, the horizontal and vertical axes represent the probability threshold and the net benefit, respectively. The 3-, 5-, and 10-year DCA curves showed that the nomogram produces clinical net benefits in the training and validation cohorts, indicating that in can guide clinical interventions for patients.[ The classic TNM staging system depends purely on the anatomical extent of the malignancy and has ignored the effect of other variables on patients’ survival. These issues can impair the predictive efficiency of NPC prognosis. The comprehensive model can completely reflect the intratumor heterogeneity of patients and quantify noninvasively and quantitatively the risk on a macroscopic scale by combining and illustrating the relative importance of various prognostic factors.[ Thus, the nomogram represents a superior performance than the traditional staging system. Moreover, our nomogram can predict the survival probability at 3-, 5-, and 10-year time points that can be useful for a reasonable follow-up schedule. Improved accuracy of a model comes at the expense of additional complexity. To balance the tradeoffs between comprehensiveness and comprehensibility during the modeling for nomograms is hard. Thus, we only collect practical variables with clinical importance, high reproducibility, and low time-varying effects. After controlling for confounding factors, we have identified 10 independent prognostic factors, including age, sex, race, marital status, histological type, T stage, N stage, M stage, RT, and chemotherapy, to build the predictive model. Here, we declared several novel findings. First, the old patients with NPC signify the high risk of overall mortality, which is in line with previous studies.[ However, these previous studies treated patients’ age as a categorical variable, resulting in the loss of some valuable information to some degree. Second, Asian and Pacific Islander patients have a survival advantage compared with the other races, including the White and Black populations. The epidermal growth factor receptor gene polymorphisms may help explain these racial differences.[ Third, a married status induces a survival benefit that is often explained by psychosocial factors. Married individuals are likely to have a positive mode to overcome the mental pressure caused by an illness and exhibit high compliance and tolerance to anticancer therapy because their married status provides additional emotional and financial support.[ Moreover, spousal supervision and family relations may help married patients in maintaining a positive lifestyle. Fourth, up to 35.5% of the patients with NPC in the US presented a histological type of keratinizing disease. This proportion was higher than that in the Chinese patients.[ Finally, over 60% of the patients in our study were at advanced stages. This phenomenon may be attributed to the difficulty of early diagnosis in NPC. As for NPC, several nomograms had been established based on different research objectives. Tang[ enrolled 4630 patients with nondisseminated NPC to construct nomograms with or without plasma EBV DNA for predicting 3- and 5-year disease-free survival, and the proposed nomograms provided statistically significantly better discrimination than the TNM classification. Cho[ designed nomograms in response to RT and OS based on the clinical data of 270 NPC patients, and the nomograms showed perfect matches to the independent data set through an external validation process. Huang[ included 7251 NPC patients undergoing intensity-modulated RT to create predictive models for the probabilities of NPC-related and other cause-specific mortality, respectively. These nomograms offered effective tools for the management of patients with NPC, providing patient counseling and clinical assessment. Compared with the abovementioned predictive models, the formulated nomogram in the present study showed the following strengths. First, this nomogram is the first established for NPC prognosis on the basis of population-based data from multiple institutes rather than on the information of endemic patients from a single center of Eastern Asia. Second, the variables identified by our nomogram are easily obtained and can reflect the common status of patients and disease activity, providing clinically relevant information in NPC. Third, predicting the survival probability over 10 years is important for a nomogram because a large proportion of patients with NPC can achieve a relatively long survival time due to the improvement of medical technology. Fourth, this nomogram has a good ethnical representation because it was created based on the racially diverse population. Finally, our study is the first to consider marital status in nomogram construction. The limitations of this study should be acknowledged. First, a selection bias might be observed due to the retrospective nature and the inclusion of patients with complete information. Second, several valuable variables that influence the survival outcome of patients with NPC, such as EBV infection, genetic predisposition, and tumor biomarkers, were unavailable from the SEER database, leading to a potential impairment of the nomogram effectiveness. Third, although we included the RT and chemotherapy as predictive factors into the nomogram, other important pieces of information, including therapeutic regimen, dosage, and duration time, were not provided by the SEER dataset. Thus, we were unable to evaluate their impact on the survival outcome. Fourth, local recurrence is the main cause of treatment failure for NPC, but the SEER database did not contain this information. Finally, the nomogram of our study was built based on the US population because of the lack of external validation. Its generalization to the global population, especially patients with NPC in epidemic areas, was still unclear.

Conclusion

We developed a nomogram to predict the probability of OS in patients with NPC based on a large, population-based cohort with long-term follow-up. Compared with the 7th edition of AJCC staging system, this nomogram had a perfect performance in predictive accuracy, discrimination capability, and clinical utility in inferring individuals’ risk of all-cause mortality and guiding personalized treatment for those patients. Therefore, the nomogram could be used as a practical tool to predict prognosis at different time points. Owing to the limitations of this study, subsequent works must provide well-designed, multi-institutional, and prospective validation process to remedy the abovementioned limitations and achieve a high level of evidence.

Author contributions

Conceptualization: Liren Hu, Jun Lyu. Data curation: Jiayuan Wu, Quan Zhou, Yufeng Wang. Formal analysis: Jiayuan Wu, Zhenyu Pan. Methodology: Guanghua Chen, Jun Lyu, Shengpeng Wang. Validation: Jiayuan Wu, Zhenyu Pan, Liren Hu, Guanghua Chen, Jun Lyu, Shengpeng Wang. Visualization: Quan Zhou, Jun Lyu. Writing – original draft: Jiayuan Wu, Quan Zhou. Writing – review & editing: Zhenyu Pan, Yufeng Wang, Liren Hu, Jun Lyu. Jun Lyu orcid: 0000-0002-2237-8771. Shengpeng Wang orcid: 0000-0002-4818-4895. Guanghua Chen orcid: 0000-0002-8877-1322.
  6 in total

1.  Pattern and prognosis of distant metastases in nasopharyngeal carcinoma: A large-population retrospective analysis.

Authors:  Weiling Qu; Sihan Li; Miao Zhang; Qiao Qiao
Journal:  Cancer Med       Date:  2020-07-10       Impact factor: 4.452

2.  Development and Validation of a Nomogram for Predicting Overall Survival in Patients with Second Primary Small Cell Lung Cancer After Non-Small Cell Lung Cancer: A SEER-Based Study.

Authors:  Ju Zhu; Haoming Shi; Haoyu Ran; Qiancheng Lai; Yue Shao; Qingchen Wu
Journal:  Int J Gen Med       Date:  2022-04-02

3.  Development and Validation of a Nomogram for Predicting the Unresolved Risk of Parents of Adolescents With Psychiatric Diagnoses.

Authors:  Qingqing Sheng; Chunfeng Cai; Pingdong Li; Lihua Chen; Xi Zhang; Xinyu Wang; Yucui Gong
Journal:  Front Psychiatry       Date:  2022-04-01       Impact factor: 5.435

4.  Prognostic factors in metastatic nasopharyngeal carcinoma.

Authors:  Nabil Toumi; Sana Ennouri; Ilhem Charfeddine; Jamel Daoud; Afef Khanfir
Journal:  Braz J Otorhinolaryngol       Date:  2020-07-04

5.  Construction of nomograms for nasopharyngeal carcinoma containing primary tumor size and SEER stage.

Authors:  Guangrong Yang; Bangyu Luo; Qiao Yang; Mingjing Chen; Xiu Yang; Jianguo Sun
Journal:  Transl Cancer Res       Date:  2020-11       Impact factor: 1.241

6.  Development of a Radiotherapy Localisation Computed Tomography-Based Radiomic Model for Predicting Survival in Patients With Nasopharyngeal Carcinoma Treated With Intensity-Modulated Radiotherapy Following Induction Chemotherapy.

Authors:  Xiaoyue Li; Han Chen; Feipeng Zhao; Yun Zheng; Haowen Pang; Li Xiang
Journal:  Cancer Control       Date:  2022 Jan-Dec       Impact factor: 3.302

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.