Literature DB >> 31516573

Nomogram for predicting the survival of patients with malignant melanoma: A population analysis.

Jin Yang1,2, Zhenyu Pan1,2,3, Quan Zhou4, Qingqing Liu1,2, Fanfan Zhao1,2, Xiaojie Feng1,2, Jun Lyu1,2,5.   

Abstract

The aim of the current study was to develop and validate a nomogram based on a large population to estimate the 3- and 5-year survival rates of patients with malignant melanoma (MM). Patients were selected from the Surveillance, Epidemiology and End Results database and randomly divided into the training and validation cohorts. A nomogram was developed, and was used to assess the accuracy of the model. Independent prognostic factors associated with overall survival (OS) rate were identified through multivariate analysis, and were included in the internal validation of the nomogram. The nomogram provided high C-indexes for the training cohort [area under the time-dependent receiver operating characteristic curve (AUC) of 0.877 for 3-year OS rate and 0.872 for 5-year OS rate] and the validation cohort (AUC of 0.880 for 3-year OS rate and 0.874 for 5-year OS rate), indicating that the model had good discrimination ability. Calibration plots showed that the predicted 3- and 5-year OS rates probabilities for the training and validation groups were almost identical to the actual observations. The 3- and 5-year decision curves indicated net benefits for both the training and validation cohorts. The nomogram may aid clinicians to provide more accurate prognosis prediction in patient consultations and more personalized postoperative management plans.

Entities:  

Keywords:  Epidemiology and End Results program; Surveillance; malignant melanoma; nomogram; risk factors; survival rate

Year:  2019        PMID: 31516573      PMCID: PMC6732986          DOI: 10.3892/ol.2019.10720

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

The incidence of melanoma is increasing worldwide (1–3). A statistical fact sheet from the Surveillance, Epidemiology, and End Results (SEER) program of the National Cancer Institute (NCI) stated that the estimated number of new melanoma cases in 2019 in the USA was 96,480, accounting for 5.5% of all new cancer cases, and that the estimated death toll was 7,230 (https://seer.cancer.gov/statfacts/html/melan.html). Malignant melanoma (MM) is the most aggressive and deadly type of skin cancer (4). MM has a rapid development, high metastasis rate and poor prognosis; therefore, its early diagnosis and treatment are particularly important (5,6). The SEER program of the NCI is a coordinated system of population-based state cancer registries collecting demographic, clinical and outcome information on all cancer cases diagnosed in representative geographic regions and subpopulations (7,8). It includes 18 registries that cover 30% of the population in the USA (9). Nomograms incorporating and demonstrating important prognostic factors have been widely used in cancer prognosis predictions, and have become reliable and convenient tools for quantifying risk associated with diseases (10,11). They can reduce statistical predictive models to a single numerical estimate of the probability (PI) of an event (12,13). Several authors have reported that nomograms are more accurate in prognosis prediction than the traditional staging systems for patients with prostate, colon, breast and stomach cancer (14–18). Previous studies have reported a nomogram of melanoma at a site, or a nomogram of lymph node status in patients with melanoma (19,20). Our previous study also only reported a nomogram of nodular melanoma (21). However, there is no nomogram for predicting survival in patients with MM the from SEER database. The aim of the current study was to develop and validate a nomogram based on a large population that could be used to estimate the 3- and 5-year survival rates of patients with MM. In addition to the American Joint Committee on Cancer (AJCC) staging, predictors such as age, sex and insurance status were included to provide a personalized and accurate assessment of patient survival.

Materials and methods

Patient selection

A retrospective search of the SEER (seer.cancer.gov) database was performed for cases of MM diagnosed between January 2007 and December 2015. The SEER database was accessed using the SEER*Stat software (version 8.3.4, http://seer.cancer.gov/seerstat/). Patients were identified using the International Classification of Diseases for Oncology-O-3 (22) histological type codes of 8720/3-8723/3, 8726/3, 8727/3, 8730/3, 8740/3-8746/3, 8761/3, 8770-8773/3, 8780/3 and 8790/3, and primary site codes of C44, C51.0, C60.9 and C63.2, and were used as the inclusion criteria. Cases that were not confirmed by microscopy or only by autopsy were excluded, as were those with unknown or incomplete variables. Additionally, patients <18 years were excluded. The selection criteria applied resulted in a total of 110,727 eligible patients. As data on cancer is reported by the SEER database each year, informed patient consent was not required to utilize the data released by the SEER database. The present study was exempted from the requirement for ethical approval and patient consent by the Institutional Research Committee of The First Affiliated Hospital of Xi'an Jiaotong University.

Nomogram construction

For the construction and validation of the nomogram, 70% (n=77,508) of the patients were randomly assigned to the training cohort and 30% (n=33,219) were assigned to the validation cohort. The following variables were assessed: Age at diagnosis, ethnicity, sex, marital status, tumor primary site, AJCC and SEER stage, insurance status and median family income.

Validation of the nomogram

The nomogram was validated using discrimination and calibration with the validation cohort. The predictive accuracy of the nomogram was evaluated by the area under the time-dependent receiver operating characteristic (ROC) curve. Concordance index (C-index), proposed by Frank E. Harrell Jr, Professor of biostatistics at Vanderbilt University in 1996 (23), is used to calculate the discrimination between the predicted and real values of Cox models in survival analysis. In general, a C-index >0.75 is considered to represent relatively good discrimination (24). The agreement between the predicted PI and the actual outcome was evaluated by calibration plotting. Calibration represents the ability of a model to provide unbiased estimates of outcome, and a perfectly accurate nomogram would result in a plot on which predictions fall along a 45° diagonal line. Discrimination and calibration were evaluated using bootstrapping with 500 resamples. Bootstrapping is a random sampling method and the sampling was repeated 500 times using R software (https://www.r-project.org/; version 3.6.1), increasing the accuracy. The specificity and sensitivity may be determined from the ROC curve when evaluating a diagnostic method. Vickers and Elkin (25) developed a method of evaluation termed decision curve analysis (DCA), which was used to test the clinical value of the predictive models in the present study, in terms of the net benefit. DCA is a simple way to evaluate clinical predictive models. Traditional models, such as ROC curves, only measure the accuracy of predictive model diagnosis, in order to consider the clinical utility of the model, while the DCA curve considers clinical utility (26).

Statistical analysis

Cox regression, nomogram, C-index, AUC, calibration plotting and DCA were performed using SPSS (version 24.0; IBM Corp.) and R software. P<0.05 was considered to indicate a statistically significant difference. The continuous variable, age, was assessed for normal distribution using Shapiro-Wilk test. Kaplan-Meier curves were generated using R software. Clinical pathological characteristics of the training and validation cohorts were selected using the backward stepwise selection method in the Cox regression model. Only variables that were statistically significant in the univariate Cox regression models were analyzed in multivariate Coxregression models.

Results

Patient characteristics

The present study included 110,727 patients with MM. The median age (25th-75th percentile: 52–74) at the time of diagnosis of was 62 years in the two cohorts. The majority of the patients were male (59.7% for the training cohort; 60.1% for the validation cohort), white (98.5%) and married (68.5%). The most common primary tumor site was the trunk (31.1%). The majority of the patients were in AJCC stage I (69.7%). In both cohorts, 82.3% patients had a localized tumor, 11.7% patients had regional metastasis and 6.0% patients had distant metastasis. The majority of the patients were insured (93.8%) and had a median family income of USD 50,060-98,030 (82.2%; Table I).
Table I.

Clinical characteristics of patients with malignant melanoma in the present study.

CharacteristicTraining cohortValidation cohort
Median age at diagnosis, (25th-75th percentile)62 (52–74)62 (52–74)
Ethnicity, n (%)
  White76,377 (98.5)32,735 (98.5)
  Black  421 (0.5)  163 (0.5)
  Other  710 (1.0)  321 (1.0)
Sex, (%)
  Male46,286 (59.7)19,949 (60.1)
  Female31,222 (40.3)13,270 (39.9)
Marital status, n (%)
  Married52,970 (68.5)22,748 (68.5)
  Single11,739 (15.0)  4,982 (15.0)
  Divorced, separated or widowed12,799 (16.5)5,489 (16.5)
Site, n (%)
  Head and neck16,870 (21.8)  7,202 (21.8)
  Trunk24,075 (31.1)10,407 (31.1)
  Upper Limbs19,472 (25.1)  8,326 (25.1)
  Lower Limbs13,775 (17.8)  5,905 (17.8)
  Other  3,316 (4.2)  1,379 (4.2)
AJCC stage, n (%)
  I53,935 (69.7)23,143 (69.7)
  II12,163 (15.6)  5,177 (15.6)
  III  7,222 (9.3)  3,070 (9.2)
  IV  4,188 (5.4)  1,829 (5.5)
SEER stage, n (%)
  Localized63,803 (82.3)27,355 (82.3)
  Regional  9,076 (11.7)  3,872 (11.7)
  Distant  4,629 (6.0)  1,992 (6.0)
Insurance status, n (%)
  Any medicaid  3,209 (4.1)  1,413 (4.2)
  Insured72,713 (93.8)31,132 (93.8)
  Uninsured  1,586 (2.1)  674 (2.0)
Median family income, USD, n (%)
  24,880-50,000  5,461 (7.2)  2,385 (7.2)
  50,060-98,03063,817 (82.2)27,311 (82.2)
  100,190-125,990  8,230 (10.6)  3,523 (10.6)

SEER, Surveillance, Epidemiology, and End Results; AJCC, American Joint Committee on Cancer; USD, United states dollar.

Cox regression analysis of the training cohort

Following univariate Cox regression analysis, data on age at diagnosis, ethnicity, sex, marital status, tumor primary site, AJCC and SEER stage, insurance status and median family income were entered into the multivariate Cox regression analysis. The multivariate Cox regression analysis revealed that age [hazard ratio (HR)=1.015; P<0.001], being single (HR=1.095 vs. married; P=0.011), being divorced, separated or widowed (DSW) (HR=1.184 vs. married; P<0.001), AJCC stage II (HR=5.759 vs. AJCC stage I; P<0.001), AJCC stage III (HR=9.320 vs. AJCC stage I; P<0.001), AJCC stage IV (HR=19.148 vs. AJCC stage I; P<0.001), having regional metastasis (HR=1.592 vs. localized; P<0.001), having distant metastasis (HR=1.858 vs. localized; P<0.001), and not being insured (HR=1.201 vs. any medicaid insurance; P=0.013) were significant risk factors for the overall survival (OS) rate. Locations in the trunk (HR=0.893 vs. head and neck; P=0.001), upper limbs (HR=0.753 vs. head and neck; P<0.001) and lower limbs (HR=0.866 vs. head and neck; P<0.001) were protective factors. Ethnicity was not statistically significant in the multivariate Cox regression (P>0.05). (Table II).
Table II.

Univariate and multivariate Cox regression analysis (training cohort).

Univariate analysisMultivariate analysis


VariableHR95% CIP-valueHR95% CIP-value
Age at diagnosis1.0201.019–1.022<0.0011.0151.013–1.016<0.001
Ethnicity1.4771.357–1.609<0.001
  White
  Black
  Other
Sex0.6120.582–0.645<0.001
  MaleReference
  Female0.8060.763–0.852<0.001
Marital status1.2601.225–1.297<0.001
  MarriedReference
  Single1.0951.021–1.1740.011
  DSW1.1841.114–1.259<0.001
Site1.3931.365–1.423<0.001
  Head and neckReference
  Trunk0.8930.832–0.9580.001
  Upper limbs0.7530.695–0.816<0.001
  Lower limbs0.8660.797–0.941<0.001
  Other1.0110.930–1.0990.800
AJCC stage3.2493.183–3.316<0.001
  IReference
  II5.7595.300–6.258<0.001
  III9.3208.132–10.682<0.001
  IV19.14815.089–24.299<0.001
SEER stage4.7194.594–4.847<0.001
  LocalizedReference
  Regional1.5921.420–1.785<0.001
  Distant1.8581.487–2.321<0.001
Insurance status0.6120.560–0.670<0.001
  Any medicaidReference
  Insured0.7620.698–0.832<0.001
  Uninsured1.2011.039–1.3880.013
Median family income, USD0.7460.705–0.790<0.001
  24,880-50,000Reference
  50,060-98,0300.9160.843–0.9940.035
  100,190-125,9900.8000.713–0.896<0.001

HR, hazard ratio; CI, confidence interval; SEER, Surveillance, Epidemiology, and End Results; AJCC, American Joint Committee on Cancer; USD, United states dollar.

Nomogram validation

Independent prognostic factors associated with the OS rate identified in the multivariate analysis were incorporated into the nomogram internal validation. The nomogram was used by first drawing a vertical line up to the points row to obtain the points for each variable, adding up the points for all the variables to obtain the total points, and drawing a vertical line down from the total points row to obtain the 3- and 5-year OS rates (Fig. 1).
Figure 1.

Nomogram predicting 3- and 5-year survival. F, female; M, male; M, married; S, single; DSW, divorced, separated or widowed; UL, upper limbs; LL, lower limbs; T, trunk; HN, head and neck; O, other; SEER, Surveillance, Epidemiology, and End Results; L, localized; R, regional; D, distant. status: In, insured; AM, any medicaid (Indian/Public Health Service; Medicaid, Medicaid-Administered through a managed care plan; Medicare with Medicaid eligibility); Unin, uninsured; AJCC, American Joint Committee on Cancer. The nomogram is used by first giving each variable a score on its points scale. The scores for all variables are then added to obtain the total score and a vertical line is drawn from the total-points row to estimate the 3-year and 5-year survival rates.

Performance of the nomogram

Based on the C-index analysis of the SEER training cohort, the nomogram provided high C-indexes for the 3- and 5-year OS rates, with AUCs of 0.877 and 0.872, respectively (Fig. 2A). Similarly, the corresponding values for the validation cohort were high, 0.880 and 0.874, respectively, indicating that the model had good discrimination ability (Fig. 2B). As presented in Fig. 3, the predicted 3- and 5-year OS rate probabilities for the SEER training and validation groups were almost identical to the actual observations.
Figure 2.

AUCs of the nomogram. (A) Training and (B) validation cohorts. AUC, area under the curve; ROC, receiver operating characteristic; TP, true positive rate; FP, false positive rate.

Figure 3.

Calibration curves for the nomogram. The x-axis is the predicted survival calculated by the nomogram, and the y-axis is the actual survival estimated by the Kaplan-Meier method. The 95% confidence intervals of the Kaplan-Meier estimates are indicated by black vertical lines at each point. The red dashed line presents the reference line and is a 45° diagonal. The closer the black line drawn by the model is to the red dashed line, the better the model. (A) Calibration plot of the 3-year OS for the training cohort. (B) Calibration plot of the 5-year OS for the training cohort. (C) Calibration plot of the 3-year OS for the validation cohort. (D) Calibration plot of the 5-year OS for the validation cohort. OS, overall survival.

DCA

The abscissa of a decision curve is the threshold PI and the ordinate is the net benefit rate. A horizontal line indicates that all samples are negative and not treated (with a net benefit of zero), i.e. these patients are disease-free and do not require treatment. An oblique line, with a positive slope, indicates that all samples are positive and the patients had received treatment, and the net benefit is indicated by an oblique line with a negative slope. In our model, the 3- and 5-year DCA curves yielded net benefits for both the training and validation cohorts (Fig. 4).
Figure 4.

DCA of the 3- and 5-year OS for the training and validation cohorts. The abscissa represents the threshold probability and the ordinate represents the net benefit rate. The horizontal line indicates that all samples are negative and all are not treated, with a net benefit of zero. The oblique line indicates that all samples are positive. The net benefit is represented by a negative slope. The dashed line does not coincide with the other two lines, and when it is in the upper right corner, it means that the model is valuable. (A) DCA of the 3-year OS for the training cohort. (B) DCA of the 5-year OS for training cohort, (C) DCA of 3-year OS for the validation cohort. (D) DCA of the 5-year OS for the validation cohort. DCA, decision curve analysis; OS, overall survival.

Discussion

The AJCC staging manual has become the benchmark for classifying patients with cancer, predicting their prognosis and selecting appropriate treatment approaches (27). However, the prognosis may differ in patients at the same pathological stage due to other influencing factors such as age, sex, histological type and adjuvant chemotherapy, all of which may affect the OS rate (16). MM is one of the most aggressive forms of cutaneous neoplasm, and its incidence is increasing (28). Therefore, a more individualized approach for predicting the survival of patients with MM is required. Nomograms may provide an accurate and informative prognosis for individual patients (12,29,30), and have already been applied for prognosis prediction of various tumor types. For example, Cai et al (31) developed a nomogram for hepatic functional reserve and tumor characteristics of hepatocellular carcinoma following therapeutic hepatectomy. Zhang et al (32) established a gallbladder cancer-specific survival model to predict the prognosis of patients with non-metastatic gallbladder cancer following non-surgical treatment. Pietrantonio et al (33) established a nomogram for predicting the survival PI of patients with advanced gastric cancer. The SEER database provides a large number of samples for exploring risk factors and developing accurate predictive models. The analysis of data collected by the SEER database renders the results obtained in the current study more generalizable than those from single-center studies. Our group has recently published an article on nodular melanoma, which is limited to nodular tumors (21). However, the present study investigated the prognosis of all malignant melanomas and includes more cases and is more comprehensive/accurate. To the best of our knowledge, the present study is the first to use a nomogram to develop a predictive model for predicting 3- and 5-year survival rates in patients with MM. The present study included 110,727 cases from the SEER database, and 70 and 30% of the patients were randomly assigned to the training and validation cohorts, respectively. Factors that affect patient survival were assessed using the Cox proportional-hazards model. Variables that were significant in the univariate analysis were included in the final multivariate model. The multivariate Cox regression analysis revealed that the age at diagnosis was a risk factor for survival in patients with MM. As for all types of cancer, the incidence of melanoma increases with age (34), which is consistent with the results obtained in the current study. Compared with being married, being single and DSW were risk factors for survival, which has also been observed in previous studies (35,36). Being female was a protective factor, as was having localized disease. McLaughlin et al (37) found that among married, single, divorced or separated patients, the risk of a late diagnosis was >50% higher in males than in females. A previous study investigating head and neck MM found that the prognosis was better in female patients compared with male patients (38). Ethnicity was not statistically significant in the multivariate Cox regression. Compared with the head and neck, locations in the trunk, upper and lower limbs were protective factors. The results of ethnicity and site remain to be interpreted, but may be due to the use of different data and the specific factors selected in the predictive model. The present study included DCA, which is a recently described analytical technique. Although novel, a previous study has recommended the use of DCA (26). The present study revealed that the 3- and 5-year DCA curves yielded net benefits for both the training and validation cohorts, indicating that the model provides certain clinical benefits. For example, in a validation set of three years of survival, the PI of a patient diagnosed with MM is recorded as PI, and when the threshold of PI reaches 20%, it is defined as positive and treatment is received. At this point, there will be patients benefiting, and there will be non-patients receiving treatment, causing damage, and patients not receiving treatment, which is a loss. The ordinate is the net benefit, and two of the 100 people would have a net benefit. The present study was subject to some limitations. Firstly, only patients for whom complete information was available were included, which may have introduced selection bias. Secondly, common variables associated with prognosis, such as smoking history and genetic mutations (39), were not included in the model as no relevant data was available in the SEER database. Thirdly, an internal validation approach was used to evaluate the model performance. Although the model exhibited good performance, external verification is required to estimate its accuracy and verify its utility for decision-making. Fourthly, the values predicted by the nomogram should be considered as reference values, rather than representing certain prognoses. Finally, the current study was based on retrospective data, and so inherent bias was inevitable. The present study developed and internally validated a nomogram for predicting the 3- and 5-year survival rates in patients with MM. This nomogram predicted the survival rate of individual patients with a high C-index, and was found to be well calibrated and the DCA curves yielded net benefits. The results obtained in the current study may aid clinicians to provide more accurate prognosis predictions in patients with MM.
  4 in total

1.  Novel Prognostic Models Predicting the Cancer-Specific Survival in Patients with Cutaneous Melanoma Based on Metastatic Lymph Node Status.

Authors:  Wei Li; Xuewen Xu; Yange Zhang
Journal:  Ann Surg Oncol       Date:  2021-01-11       Impact factor: 5.344

2.  A prognostic nomogram for stage II/III rectal cancer patients treated with neoadjuvant chemoradiotherapy followed by surgical resection.

Authors:  Yanfei Lin
Journal:  BMC Surg       Date:  2022-07-04       Impact factor: 2.030

3.  A Novel Nomogram for Prediction and Evaluation of Lymphatic Metastasis in Patients With Renal Cell Carcinoma.

Authors:  Wenle Li; Bing Wang; Shengtao Dong; Chan Xu; Yang Song; Ximin Qiao; Xiaofeng Xu; Meijin Huang; Chengliang Yin
Journal:  Front Oncol       Date:  2022-04-11       Impact factor: 5.738

4.  Development of a Risk Nomogram Model for Identifying Interstitial Lung Disease in Patients With Rheumatoid Arthritis.

Authors:  Jing Xue; Wenfeng Hu; Shuang Wu; Jing Wang; Shuhong Chi; Xiaoming Liu
Journal:  Front Immunol       Date:  2022-06-16       Impact factor: 8.786

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.