Literature DB >> 35935563

A nomogram for survival prediction in 275,812 U.S. patients with breast cancer: a population-based cohort study based on the SEER database.

Zhe Wang1, Lei Xing2, Xinrong Luo2, Guosheng Ren2,3.   

Abstract

Background: Nomograms can assess the risk of clinicopathological features by quantifying the biological and clinical variables of cancer patients. However, the nomogram based on significant factors that influence the survival of breast cancer in a large population has been rarely explored. This study was to investigate the predictive effectiveness of a nomogram for the survival of patients with breast cancer.
Methods: Demographic and clinical data of 275,812 breast cancer patients were extracted from the Surveillance, Epidemiology, and End Results (SEER) database. All patients aged ≥20 years in this retrospective cohort study were classified as two groups in a random manner, namely the training set (n=193,069) and validation set (n=82,743). The outcomes of our study were the 3- and 5-year survival of breast cancer. The potential predictors of cancer mortality were screened by univariate and multivariable Cox regression analyses. The nomogram was conducted based on the predictors. Harrell's concordance index (C-index), receiver operating characteristic (ROC) curves and calibration curve was utilized to evaluate the performance of the nomogram.
Results: The age at diagnosis, race, marital status, tumor size, first malignant primary indicator, American Joint Committee on Cancer (AJCC) T stage, M stage, tumor grade, and number of malignant tumors were independent predictors for the death of patients with breast cancer. The C-indexes of the training set and the validation set were 0.782 and 0.778, respectively. The area under the curve (AUC) values of the nomogram for predicting the 3- and 5-year survival of breast cancer were 0.770 and 0.756, respectively. Furthermore, the C-index values of our nomogram were 0.816, 0.775, 0.773, 0.734, and 0.750 for predicting survival in Asian, White, Hispanic, American Indian, and Black populations, respectively. Conclusions: The nomogram may have predictive performance for predicting the 3- and 5-year survival of breast cancer patients, and future studies need to validate our findings. 2022 Gland Surgery. All rights reserved.

Entities:  

Keywords:  Breast cancer; nomogram; prediction; survival time

Year:  2022        PMID: 35935563      PMCID: PMC9346213          DOI: 10.21037/gs-22-321

Source DB:  PubMed          Journal:  Gland Surg        ISSN: 2227-684X


Introduction

Breast cancer is one of the most lethal malignant tumors and has become the second leading cause of female death (1). It is estimated that about 287,850 females have suffered from breast cancer and approximately 43,250 patients have died from the cancer in 2022 so far (1). With advancements in surgery, chemotherapy, radiation, and hormone therapy, the survival of breast cancer has been improved in the past few decades (2-4). The improvement of multimodal therapies is significant for the survival of cancer patients, with a 5-year relative survival over 90% in developed countries (5). However, these women still have a poor prognosis worldwide. Nomograms, a statistic-based tool, can assess the risk of clinicopathological features by quantifying the biological and clinical variables of patients with cancer (6). To date, nomograms have been widely applied in the personalized prediction of cancer, such lung, cervical, prostate, and hepatocellular carcinomas (7-11). To the best of our knowledge, several studies have reported risk factors associated with breast cancer (12-17). However, the nomogram based on significant factors that influence the survival of breast cancer in a large population has been rarely explored. Huang et al. (18) reported a nomogram to assess the overall mortality risk in young breast cancer patients. Johnson et al. (19) mentioned that the incidence of breast cancer in adolescents and young adults is relatively low. It is of clinical significance to establish a model to predict the mortality risk of all breast cancer patients. Herein, we attempted to investigate the predictive factors for the death of breast cancer patients, and develop and validate a nomogram based on the Surveillance, Epidemiology, and End Results (SEER) database, with the aim of reliably predicting the 3- and 5-year survival of breast cancer patients. In addition, we further explored the predictive performance of our nomogram in different ethnicities. We present the following article in accordance with the TRIPOD reporting checklist (available at https://gs.amegroups.com/article/view/10.21037/gs-22-321/rc).

Methods

Study design and population

In this retrospective cohort study, data on patients with breast cancer were extracted from the SEER database, a public registry from the National Cancer Institute (NCI), which covers approximately 27.8% of the U.S. cancer population (20). A total of 275,812 breast cancer patients aged 18 years and over were included. These patients were diagnosed as breast cancer via histopathological examination between 1975 and 2015. The study was carried out in accordance with the Declaration of Helsinki (as revised in 2013).

Study variables

Basic information on patients was collected, including the age at diagnosis (20–39, 40–59, 60–79, or ≥80 years), year of diagnosis, race (American Indian, Asian, Black, Hispanic, and White), marital status (single, married, divorced/separated, or widowed), vital status (alive or dead), American Joint Committee on Cancer (AJCC) T stage (Tis & T1, T2, T3, or T4), N stage (N0, N1, N2, or N3), grade (I, II, III, or IV), type of reporting source item (hospital inpatients or others), regional nodes positive (no or yes), tumor size (<4 or ≥4 cm), laterality (right or left), first malignant primary indicator (no or yes), and number of malignant tumors (1, 2, or ≥3). The outcomes of our study were the 3- and 5-year survival rates of breast cancer patients. The follow-up duration was 5 years. The follow-up was terminated when the death occurred during the follow-up period.

Establishment and validation of the nomogram

All eligible patients were randomly classified as two groups in a random manner, namely the training set (n=193,069) and validation set (n=82,743). Univariate and multivariate Cox regression analyses were used to assess the predictors of cancer mortality using the training set. The regression model was recalculated using significant variables from the multivariate Cox regression analysis. The nomogram was drawn to predict the 3- or 5-year survival of patients. Harrell’s concordance index (C-index) was utilized to assess the performance of the nomogram (no predictive power: C=0.50, low predictive power: 0.51–0.70, moderate predictive power: 0.71–0.90, and high predictive power: >0.90) (21). The discriminative ability of the nomogram was evaluated using the receiver operating characteristic (ROC) and calibration curves.

Statistical analysis

Normal distributed measurement data were compared through one-way analysis of variance and shown as the mean ± standard deviation (), and those with skewed distribution were compared via the rank-sum test and presented as the median and quartile [M (Q1, Q3)]. The χ2 test or Fisher’s exact test was utilized for comparisons between enumeration data that were expressed as number and proportion [n (%)]. R software (version 4.0.2, The R Foundation for Statistical Computing, Vienna, Austria) and SAS software (version 9.4, SAS Institute Inc., USA) were used for statistical analysis. Two-sided P<0.05 was deemed statistically significant.

Results

The characteristics of patients with breast cancer

Totally 275,812 women with breast cancer who were registered in the SEER database were enrolled in this study. Among them, 193,069 cases were in the training set, with a mean survival time of 42.22±21.36 months. The ages at diagnosis were 20–39 (n=9,329, 4.83%), 40–59 (n=79,750, 41.31%), 60–79 (n=89,615, 46.42%), and ≥80 (n=14,375, 7.45%) years. In terms of marital status, 58.87% (n=113,668) of cases were married, 15.42% (n=29,768) were single, 13.13% (n=25,347) were widowed, and 12.58% (n=24,286) were divorced or separated. The comparison of the baseline characteristics of women with breast cancer between the training and validation sets is shown in . There were no differences between the two sets regarding age at diagnosis (20–39, 40–59, 60–79, or ≥80 years), year of diagnosis (2010, 2011, 2012, 2013, 2014, or 2015), race (American Indian, Asian, Black, Hispanic, and White), marital status (single, married, divorced/separated, or widowed), AJCC T stage (Tis & T1, T2, T3, or T4), N stage (N0, N1, N2, or N3), grade (I, II, III, or IV), type of reporting source item (hospital inpatients or others), regional nodes positive (no or yes), tumor size (<4 or ≥4 cm), laterality (right or left), first malignant primary indicator (no or yes), number of malignant tumors (1, 2, or ≥3), and vital status (alive or dead), all with P>0.05.
Table 1

The baseline characteristics of patients with breast cancer between the two groups

VariablesTotal (n=275,812)GroupsStatisticsP
Training set (n=193,069)Validation set (n=82,743)
Age at diagnosis (years), n (%)Z=1.0700.285
   20–3913,381 (4.85)9,329 (4.83)4,052 (4.90)
   40–59113,673 (41.21)79,750 (41.31)33,923 (41.00)
   60–79128,145 (46.46)89,615 (46.42)38,530 (46.57)
   ≥8020,613 (7.47)14,375 (7.45)6,238 (7.54)
Year of diagnosis, n (%)Z=−1.6840.092
   201042,881 (15.55)29,905 (15.49)12,976 (15.68)
   201144,062 (15.98)30,759 (15.93)13,303 (16.08)
   201245,105 (16.35)31,568 (16.35)13,537 (16.36)
   201346,833 (16.98)32,832 (17.01)14,001 (16.92)
   201447,397 (17.18)33,296 (17.25)14,101 (17.04)
   201549,534 (17.96)34,709 (17.98)14,825 (17.92)
Race, n (%)χ2=3.2440.518
   American Indian1,467 (0.53)1,042 (0.54)425 (0.51)
   Asian23,857 (8.65)16,777 (8.69)7,080 (8.56)
   Black28,900 (10.48)20,267 (10.50)8,633 (10.43)
   Hispanic30,243 (10.97)21,089 (10.92)9,154 (11.06)
   White191,345 (69.38)133,894 (69.35)57,451 (69.43)
Marital status, n (%)χ2=6.3730.095
   Single42,591 (15.44)29,768 (15.42)12,823 (15.50)
   Married162,606 (58.96)113,668 (58.87)48,938 (59.14)
   Separated/divorced34,694 (12.58)24,286 (12.58)10,408 (12.58)
   Widowed35,921 (13.02)25,347 (13.13)10,574 (12.78)
T stage, n (%)Z=0.2670.790
   Tis & T1168,672 (61.15)118,114 (61.18)50,558 (61.10)
   T283,921 (30.43)58,670 (30.39)25,251 (30.52)
   T316,301 (5.91)11,488 (5.95)4,813 (5.82)
   T46,918 (2.51)4,797 (2.48)2,121 (2.56)
N stage, n (%)Z=−0.6920.489
   N0186,438 (67.60)130,400 (67.54)56,038 (67.73)
   N165,622 (23.79)46,094 (23.87)19,528 (23.60)
   N214,851 (5.38)10,386 (5.38)4,465 (5.40)
   N38,901 (3.23)6,189 (3.21)2,712 (3.28)
Tumor size (cm), n (%)χ2=0.4850.486
   <49,388 (3.40)6,602 (3.42)2,786 (3.37)
   ≥426,6424 (96.60)186,467 (96.58)79,957 (96.63)
Grade, n (%)Z=−1.0550.292
   I63,996 (23.20)44,701 (23.15)19,295 (23.32)
   II122,430 (44.39)85,699 (44.39)36,731 (44.39)
   III88,632 (32.13)62,154 (32.19)26,478 (32.00)
   IV754 (0.27)515 (0.27)239 (0.29)
Type of reporting source item, n (%)χ2=0.0010.986
   Hospital inpatients263,759 (95.63)184,631 (95.63)79,128 (95.63)
   Others12,053 (4.37)8,438 (4.37)3,615 (4.37)
Regional nodes positive, n (%)χ2=0.9880.320
   No189,198 (68.60)132,328 (68.54)56,870 (68.73)
   Yes86,614 (31.40)60,741 (31.46)25,873 (31.27)
Laterality, n (%)χ2=1.0760.300
   Left139,797 (50.69)97,983 (50.75)41,814 (50.53)
   Right136,015 (49.31)95,086 (49.25)40,929 (49.47)
First malignant primary indicator, n (%)χ2=0.7840.376
   No39,445 (14.30)27,537 (14.26)11,908 (14.39)
   Yes236,367 (85.70)165,532 (85.74)70,835 (85.61)
Malignant tumors, n (%)Z=0.4370.662
   1210,697 (76.39)147,513 (76.40)63,184 (76.36)
   253,762 (19.49)37,699 (19.53)16,063 (19.41)
   ≥311,353 (4.12)7,857 (4.07)3,496 (4.23)
Vital status, n (%)χ2=0.4150.519
   Alive249,773 (90.56)174,887 (90.58)74,886 (90.50)
   Dead26,039 (9.44)18,182 (9.42)7,857 (9.50)
Survival time (months), x¯±s42.26±21.3742.22±21.3642.35±21.39t=−1.4500.148

Predictors of the mortality of breast cancer patients in the training set

The results of univariate Cox regression analysis are listed in . There were significant differences in age at diagnosis, race, marital status, T stage, N stage, tumor size, grade, type of reporting source item, regional nodes positive, first malignant primary indicator, and the number of malignant tumors (all P<0.001).
Table 2

Univariate Cox regression analysis for the mortality of breast cancer patients

VariablesβS.Eχ2PHR95% CI
LowerUpper
Age at diagnosis (years)
   20–39Ref
   40–59−0.4390.037139.328<0.0010.6450.6000.694
   60–790.0170.0360.2150.6431.0170.9481.091
   ≥801.2020.0381,015.921<0.0013.3253.0883.580
Year of diagnosis
   2010Ref
   2011−0.0130.0210.3750.5400.9870.9471.029
   2012−0.0460.0233.9180.0480.9550.9120.999
   2013−0.0370.0262.0420.1530.9640.9171.014
   2014−0.0310.0301.0870.2970.9700.9151.028
   2015−0.0440.0381.3300.2490.9570.8881.031
Race
   AsianRef
   White0.4650.041130.719<0.0011.5911.4701.723
   Hispanic0.4880.034204.199<0.0011.6281.5231.741
   American Indian0.7290.09657.237<0.0012.0731.7162.503
   Black0.9280.038597.119<0.0012.5312.3492.726
Marital status
   MarriedRef
   Separated/divorced0.3960.023296.106<0.0011.4861.4201.555
   Single0.4430.021433.584<0.0011.5571.4931.623
   Widowed1.0290.0183,130.847<0.0012.7972.6982.900
T stage
   Tis & T1Ref
   T20.8330.0172,424.379<0.0012.3002.2252.378
   T31.3910.0243,333.377<0.0014.0193.8344.213
   T42.2200.0267,186.235<0.0019.2048.7449.689
N stage
   N0Ref
   N10.7750.0172,028.604<0.0012.1712.0992.245
   N21.3320.0243,126.498<0.0013.7873.6153.968
   N31.8580.0255,660.915<0.0016.4106.1076.728
Tumor size (cm)
   <4Ref
   ≥41.0150.065246.547<0.0012.7602.4323.133
Grade
   IRef
   II0.4110.024293.627<0.0011.5081.4391.581
   III1.0790.0232,175.440<0.0012.9422.8123.079
   IV1.2780.099165.585<0.0013.5882.9544.359
Type of reporting source item
   Hospital inpatientsRef
   Other−0.1900.03923.731<0.0010.8270.7660.893
Regional nodes positive
   NoRef
   Yes1.0240.0154,726.128<0.0012.7852.7052.867
Laterality
   LeftRef
   Right−0.0230.0152.4550.1170.9770.9491.006
First malignant primary indicator
   NoRef
   Yes−0.4910.018730.276<0.0010.6120.5900.634
Malignant tumors
   1Ref
   20.4810.017807.050<0.0011.6181.5651.672
   ≥30.8540.027970.342<0.0012.3482.2252.478

S.E, standard error; HR, hazard ratio; CI, confidence interval.

S.E, standard error; HR, hazard ratio; CI, confidence interval. Multivariate Cox regression analysis was used to assess the predictive factors of the mortality of patients with breast cancer, and the details are listed in . The findings showed that compared with women aged 20–39 years, the risk of cancer mortality in women aged 60–79 years [hazard ratio (HR) =1.484, 95% confidence interval (CI): 1.380 to 1.596] and ≥80 years (HR =3.872, 95% CI: 3.575 to 4.195) was higher, while the risk of cancer mortality in those aged 40–59 years was lower (HR =0.856, 95% CI: 0.796 to 0.922). The mortality risks of White (HR =1.456, 95% CI: 1.362 to 1.558), Hispanic (HR =1.462, 95% CI: 1.349 to 1.583), American Indian (HR =1.843, 95% CI: 1.526 to 2.227), and Black (HR =1.924, 95% CI: 1.785 to 2.074) patients were higher than Asian patients, respectively. Compared with married cases, the mortality risks of single, divorced/separated, or widowed patients were higher, with HR values of 1.285 (95% CI: 1.228 to 1.344), 1.352 (95% CI: 1.295 to 1.411), and 1.530 (95% CI: 1.469 to 1.593), respectively. Breast cancer patients with tumor size ≥4 cm had a high mortality risk compared to those with tumor size <4 cm (HR =1.322, 95% CI: 1.163 to 1.503). The first malignant primary indicator (HR =1.064, 95% CI: 1.011 to 1.120), T stage (T2, HR =1.608, 95% CI: 1.551 to 1.667; T3, HR =2.328, 95% CI: 2.210 to 2.453; T4, HR =4.110, 95% CI: 3.879 to 4.354), N stage (N2, HR =1.512, 95% CI: 1.315 to 1.738; N3, HR =2.189, 95% CI: 1.902 to 2.519), and tumor grade (II, HR =1.167, 95% CI: 1.112 to 1.224; III, HR =2.037, 95% CI: 1.942 to 2.136; IV, HR =2.411, 95% CI: 1.983 to 2.931) were associated with the mortality of women with breast cancer. Patients with positive regional nodes had a higher risk of death than in patients without (HR =1.596, 95% CI: 1.401 to 1.819). When the number of malignant tumors was 2 (HR =1.577, 95% CI: 1.510 to 1.646) or ≥3 (HR =2.223, 95% CI: 2.077 to 2.380), patients had a higher risk of death.
Table 3

Multivariate Cox regression analysis for the mortality of breast cancer patients

VariablesβS.Eχ2PHR95% CI
LowerUpper
Age at diagnosis (years)
   20–39Ref
   40–59−0.1550.03717.099<0.0010.8560.7960.922
   60–790.3950.037112.671<0.0011.4841.3801.596
   ≥801.3540.0411,103.175<0.0013.8723.5754.195
Race
   AsianRef
   White0.3760.034120.320<0.0011.4561.3621.558
   Hispanic0.3790.04186.893<0.0011.4621.3491.583
   American Indian0.6120.09640.232<0.0011.8431.5262.227
   Black0.6540.038291.673<0.0011.9241.7852.074
Marital status
   MarriedRef
   Separated/divorced0.2500.023116.634<0.0011.2851.2281.344
   Single0.3010.022191.052<0.0011.3521.2951.411
   Widowed0.4250.021418.100<0.0011.5301.4691.593
Tumor size (cm)
   <4Ref
   ≥40.2790.06618.150<0.0011.3221.1631.503
First malignant primary indicator
   NoRef
   Yes0.0620.0265.6360.0181.0641.0111.120
Regional nodes positive
   NoRef
   Yes0.4680.06749.407<0.0011.5961.4011.819
T stage
   Tis & T1Ref
   T20.4750.018665.191<0.0011.6081.5511.667
   T30.8450.0271,010.133<0.0012.3282.2102.453
   T41.4130.0292,299.684<0.0014.1103.8794.354
N stage
   N0Ref
   N10.0780.0681.3320.2491.0820.9471.236
   N20.4130.07133.834<0.0011.5121.3151.738
   N30.7840.072119.696<0.0012.1891.9022.519
Grade
   IRef
   II0.1540.02440.042<0.0011.1671.1121.224
   III0.7110.024851.166<0.0012.0371.9422.136
   IV0.8800.10077.918<0.0012.4111.9832.931
Malignant tumors
   1Ref
   20.4550.022431.758<0.0011.5771.5101.646
   ≥30.7990.035529.101<0.0012.2232.0772.380

S.E, standard error; HR, hazard ratio; CI, confidence interval.

S.E, standard error; HR, hazard ratio; CI, confidence interval.

The predictive performance of the nomogram

The variables with statistical significance obtained from the multivariate Cox regression analysis were utilized to construct a nomogram to predict the 3- and 5-year survival rates of breast cancer patients, including age at diagnosis, marital status, race, T stage, N stage, grade, regional nodes positive, first malignant primary indicator, tumor size, and number of malignant tumors (). The C-index of our nomogram using the training set was 0.782, with a standard error of 0.002. The C-index in the validation set was 0.778, with a standard error of 0.003. The ROC curves on the 3- and 5-year survival prediction were shown in . The area under the curve (AUC) values of the nomogram for predicting the 3- and 5-year survival of breast cancer were 0.770 and 0.756, respectively. There were no differences between the training and validation sets (Z=0.383, P=0.998), indicating that the nomogram had good predictive power. The calibration curves of the nomogram for predicting the 3- and 5-year survival rates of breast cancer patients are shown in (the training set) and (the validation set).
Figure 1

The nomogram for the 3- and 5-year survival prediction among breast cancer patients.

Figure 2

The ROC curves on the 3- and 5-year survival prediction of breast cancer. (A) Three-year survival; (B) 5-year survival. ROC, receiver operating characteristic.

Figure 3

The calibration curves of the nomogram for predicting the survival rates of breast cancer patients in the training set. (A) Three-year survival; (B) 5-year survival.

Figure 4

The calibration curves of the nomogram for predicting the survival rates of breast cancer patients in the validation set. (A) Three-year survival; (B) 5-year survival.

The nomogram for the 3- and 5-year survival prediction among breast cancer patients. The ROC curves on the 3- and 5-year survival prediction of breast cancer. (A) Three-year survival; (B) 5-year survival. ROC, receiver operating characteristic. The calibration curves of the nomogram for predicting the survival rates of breast cancer patients in the training set. (A) Three-year survival; (B) 5-year survival. The calibration curves of the nomogram for predicting the survival rates of breast cancer patients in the validation set. (A) Three-year survival; (B) 5-year survival.

The predictive effect of the nomogram in different races

The patients from the validation set were classified into 5 risk subgroups based on race. As shown in , the C-index values of our nomogram in the Asian, White, Hispanic, American Indian, and Black groups were 0.816, 0.775, 0.773, 0.734, and 0.750, respectively. The predictive effect of the nomogram in Asian cases was superior to that in White (Z=3.927, P<0.001), Hispanic (Z=3.196, P=0.001), American Indian (Z=2.253, P=0.024), and Black (Z=5.407, P<0.001) cases with breast cancer.
Table 4

The predictive performance of the nomogram in different races

RaceC-indexS.EZP
Asian0.8160.010
White0.7750.0033.927<0.001
Hispanic0.7730.0093.1960.001
American Indian0.7340.0352.2530.024
Black0.7500.0075.407<0.001

S.E, standard error.

S.E, standard error.

Discussion

In the present study, 275,812 patients with breast cancer were screened from the SEER database. We found that the age at diagnosis, marital status, race, T stage, N stage, grade, regional nodes positive, first malignant primary indicator, tumor size, and number of malignant tumors were independent predictors for the death of patients with breast cancer. A nomogram that contained the predictive factors associated with breast cancer death was established. The C-indexes of the nomogram for the training and validation sets were 0.782 and 0.778, respectively. Our findings indicated that our nomogram had predictive ability for predicting the 3- and 5-year survival rates of breast cancer patients. Previous studies have reported that nomograms play an important role in the personalized prediction of cancer (22-25), which helps clinicians optimize the therapeutic protocols of patients on the basis of individual information. The nomogram is a visual tool to effectively reflect the results of Cox regression analysis. The most valuable aspect of a nomogram is in predicting the outcomes, and the length of the lines can be used to indicate the impact of different variables on the outcomes, as well as the effects of various values of these variables on the outcomes. The larger the C-index, the more accurate the predictive effectiveness of the nomogram (26). However, the predictive power of the nomogram is limited in predicting the mortality risk of breast cancer patients, especially for different ethnicities. In this study, we developed a nomogram according to the related predictive factors from the general information and tumor characteristics of patients in the SEER database to estimate the 3- and 5-year survival rates of patients with breast cancer, and further assessed the predictive performance of the nomogram for the survival of breast cancer patients with different ethnicities. The predictive factors were considered in the nomogram on the basis of our findings obtained from the univariate and multivariate analyses. We discovered that the risk of cancer death in patients aged 40–59, 60–79, and >80 years was higher than that in the 20–39 years age group. It was suggested that the risk of dying from breast cancer gradually increased with the increase of age. This may be related to less aggressive treatments, poor endurance of stress, impaired compensatory mechanisms, or other chronic diseases in the elderly, whereas younger patients may often exhibit more aggressive biological behaviors of malignant tumors (27). Cases with malignant tumors had a higher risk of death than cases with in situ tumors. The occurrence of lymph node metastases in patients resulted in a higher risk of death than in patients without lymph node metastases. Early evidence showed that lymph node metastases is a common clinical feature in the progression of breast cancer (28-30). Compared with patients with grade I tumors, the mortality risks of those with grade II, III, and IV tumors were higher, with HR values of 1.167, 2.037, and 2.411, respectively. Our results found that tumor size ≥4 cm could increase the mortality risk of breast cancer. When the number of malignant tumors was over 2, patients had a higher risk of death. It was indicated that high tumor grade, large tumor size, and a large number of tumors were associated with cancer death. Validation of the nomogram was carried out through randomly selecting 275,812 patients with breast cancer. The nomogram showed greater discrimination for the prediction of the survival time of patients, and the C-index for the validation set was 0.778, indicating that our nomogram had good effectiveness in predicting the prognosis of breast cancer patients. Furthermore, patients in the validation set were further classified into different ethnicities. Ethnicity is an independent risk factor of the occurrence of breast cancer and affects the prognosis of breast cancer. In the current study, we discovered that the predictive performance of the nomogram in Asian patients was superior to that in other ethnicities, which may suggest that the prognosis of breast cancer is racially heterogeneous. Hence, the specific race-based risk and prognostic prediction model was more precise and personalized for outcome prediction. This was a population-based study that covered U.S. cancer registry data of high quality. The SEER database incorporates relatively complete follow-up information for breast cancer, indicating that the death-related information is reliable. Nonetheless, some limitations should be noted. More information on systemic therapy, such as targeted therapy records and chemotherapy protocols, was not available in the SEER database, and the predictive factors were only limited to this database. Additionally, this nomogram was developed using retrospective data, and further studies with external validation in large and prospective cohorts are needed.

Conclusions

In summary, we established and validated a nomogram based on the SEER database to predict the 3- and 5-year survival of breast cancer patients. Our nomogram had the effectiveness for predicting survival, especially in the Asian population, which may provide clinicians with useful guidelines for the individualized treatment of patients. The article’s supplementary files as
  30 in total

Review 1.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.

Authors:  F E Harrell; K L Lee; D B Mark
Journal:  Stat Med       Date:  1996-02-28       Impact factor: 2.373

Review 2.  Nomograms in oncology: more than meets the eye.

Authors:  Vinod P Balachandran; Mithat Gonen; J Joshua Smith; Ronald P DeMatteo
Journal:  Lancet Oncol       Date:  2015-04       Impact factor: 41.316

3.  Development and validation of a gene expression-based signature to predict distant metastasis in locoregionally advanced nasopharyngeal carcinoma: a retrospective, multicentre, cohort study.

Authors:  Xin-Ran Tang; Ying-Qin Li; Shao-Bo Liang; Wei Jiang; Fang Liu; Wen-Xiu Ge; Ling-Long Tang; Yan-Ping Mao; Qing-Mei He; Xiao-Jing Yang; Yuan Zhang; Xin Wen; Jian Zhang; Ya-Qin Wang; Pan-Pan Zhang; Ying Sun; Jing-Ping Yun; Jing Zeng; Li Li; Li-Zhi Liu; Na Liu; Jun Ma
Journal:  Lancet Oncol       Date:  2018-02-07       Impact factor: 41.316

Review 4.  Breast cancer in adolescents and young adults.

Authors:  Rebecca H Johnson; Carey K Anders; Jennifer K Litton; Kathryn J Ruddy; Archie Bleyer
Journal:  Pediatr Blood Cancer       Date:  2018-08-28       Impact factor: 3.167

Review 5.  Breast cancer metastasis: markers and models.

Authors:  Britta Weigelt; Johannes L Peterse; Laura J van 't Veer
Journal:  Nat Rev Cancer       Date:  2005-08       Impact factor: 60.716

6.  Cancer statistics, 2022.

Authors:  Rebecca L Siegel; Kimberly D Miller; Hannah E Fuchs; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2022-01-12       Impact factor: 508.702

7.  Differences in Breast Cancer Screening Practices by Diabetes Status and Race/Ethnicity in the United States.

Authors:  Sydney E Kim; Alexandra E Bachorik; Kimberly A Bertrand; Christine M Gunn
Journal:  J Womens Health (Larchmt)       Date:  2021-12-17       Impact factor: 3.017

8.  Development and Validation of Novel Nomograms to Predict the Overall Survival and Cancer-Specific Survival of Cervical Cancer Patients With Lymph Node Metastasis.

Authors:  Jianying Yi; Zhili Liu; Lu Wang; Xingxin Zhang; Lili Pi; Chunlei Zhou; Hong Mu
Journal:  Front Oncol       Date:  2022-03-17       Impact factor: 5.738

9.  EORTC Nomograms and Risk Groups for Predicting Recurrence, Progression, and Disease-specific and Overall Survival in Non-Muscle-invasive Stage Ta-T1 Urothelial Bladder Cancer Patients Treated with 1-3 Years of Maintenance Bacillus Calmette-Guérin.

Authors:  Samantha Cambier; Richard J Sylvester; Laurence Collette; Paolo Gontero; Maurizio A Brausi; George van Andel; Wim J Kirkels; Fernando Calais Da Silva; Willem Oosterlinck; Stephen Prescott; Ziya Kirkali; Philip H Powell; Theo M de Reijke; Levent Turkeri; Sandra Collette; Jorg Oddens
Journal:  Eur Urol       Date:  2015-07-23       Impact factor: 20.096

10.  Survival Analysis and a Novel Nomogram Model for Progression-Free Survival in Patients with Prostate Cancer.

Authors:  Yuefu Han; Xingqiao Wen; Dong Chen; Xiaojuan Li; Qu Leng; Yuehui Wen; Jun Li; Weian Zhu
Journal:  J Oncol       Date:  2022-03-22       Impact factor: 4.375

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.