| Literature DB >> 36248962 |
Chaofan Li1, Mengjie Liu1, Jia Li1, Xixi Zhao2, Yusheng Wang3, Xi Chen1, Weiwei Wang1, Shiyu Sun1, Cong Feng1, Yifan Cai1, Fei Wu1, Chong Du1, Yinbin Zhang1, Shuqun Zhang1, Jingkun Qu1.
Abstract
Background: Breast cancer (BC) survivors have an increased risk of developing second primary cancers (SPCs); however, it is still unclear if metastasis is a risk factor for developing SPCs. Usually, long-term cancer survivors face an increased risk of developing SPCs; however, less attention has been paid to SPCs in patients with metastatic cancer as the survival outcomes of the patients are greatly reduced.Entities:
Keywords: XGBoost algorithm; breast cancer; metastasis; second primary cancers; standardized incidence ratio; standardized mortality ratio
Year: 2022 PMID: 36248962 PMCID: PMC9556865 DOI: 10.3389/fonc.2022.942320
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1The flowchart shows the process of conducting the study and statistical analysis.
Standardized incidence ratios of SPCs in BC patients by characteristic.
| characteristic | NMBC (N = 11975) | MBC (N = 5102) | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| regional node metastasis (N = 4537) | distant metastasis (N = 565) | |||||||||||||
| Observe | Expected | SIR (O/E) | 95%Cl | Observe | Expected | SIR (O/E) | 95%Cl | Observe | Expected | SIR(O/E) | 95%Cl | |||
| Age | ||||||||||||||
| <40 | 256 | 98.38 | 2.60* | 2.29-2.94 | 176 | 72.26 | 2.44* | 2.09-2.82 | 36 | 7.82 | 4.60* | 3.24-6.38 | ||
| 40-49 | 1123 | 846.93 | 1.33* | 1.25-1.41 | 675 | 443.28 | 1.52* | 1.41-1.64 | 77 | 32.38 | 2.38* | 1.79-2.91 | ||
| 50-59 | 2335 | 2223.68 | 1.05* | 1.01-1.09 | 1131 | 964.7 | 1.17* | 1.11-1.24 | 132 | 89.08 | 1.48* | 1.24-1.76 | ||
| 60-69 | 3844 | 4195.48 | 0.92* | 0.89-0.95 | 1322 | 1387.52 | 0.95 | 0.90-1.01 | 157 | 145.14 | 1.08 | 0.92-1.27 | ||
| >=70 | 4417 | 4459.13 | 0.99 | 0.96-1.02 | 1233 | 1334.39 | 0.92* | 0.87-0.98 | 163 | 172.73 | 0.94 | 0.81-1.10 | ||
| Subtype | ||||||||||||||
| luminalA | 8726 | 8735.56 | 1.00 | 0.98-1.02 | 3216 | 3012.49 | 1.07* | 1.03-1.11 | 338 | 270.57 | 1.25* | 1.12-1.39 | ||
| luminalB | 867 | 934.82 | 0.93* | 0.87-0.99 | 421 | 449.18 | 0.94 | 0.85-1.03 | 89 | 72.23 | 1.23 | 0.99-1.52 | ||
| Her2 | 351 | 369.95 | 0.95 | 0.85-1.05 | 200 | 181.38 | 1.10 | 0.96-1.27 | 39 | 30.05 | 1.3 | 0.92-1.77 | ||
| triple-negative | 1310 | 1086.7 | 1.21* | 1.14-1.27 | 468 | 354.66 | 1.32* | 1.20-1.44 | 36 | 24.32 | 1.48* | 1.04-2.05 | ||
| unknown | 721 | 696.57 | 1.04 | 0.96-1.11 | 232 | 204.43 | 1.13 | 0.99-1.29 | 63 | 49.98 | 1.22 | 0.94-1.57 | ||
| Race | ||||||||||||||
| white | 9718 | 9984.03 | 0.97* | 0.95-0.99 | 3562 | 3455.56 | 1.03 | 1.00-1.07 | 429 | 367.23 | 1.17* | 1.06-1.28 | ||
| black | 1259 | 1059.11 | 1.19* | 1.12-1.26 | 587 | 479.06 | 1.23* | 1.13-1.33 | 97 | 55.81 | 1.74* | 1.41-2.12 | ||
| others | 998 | 728.47 | 1.37* | 1.29-1.46 | 388 | 253.59 | 1.53* | 1.38-1.69 | 39 | 22.53 | 1.73* | 1.23-2.37 | ||
| Grade | ||||||||||||||
| well differentiated | 3364 | 3392.16 | 0.99 | 0.96-1.03 | 603 | 621.15 | 0.97 | 0.89-1.05 | 40 | 36.47 | 1.1 | 0.78-1.49 | ||
| moderately differentiated | 5022 | 5114.02 | 0.98 | 0.96-1.01 | 2056 | 1946.67 | 1.06* | 1.01-1.10 | 233 | 169.48 | 1.37* | 1.20-1.56 | ||
| poorly differentiated | 3057 | 2809.97 | 1.09* | 1.05-1.13 | 1654 | 1457.70 | 1.13* | 1.08-1.19 | 178 | 136.29 | 1.31* | 1.12-1.52 | ||
| unknown | 532 | 507.45 | 1.05 | 0.96-1.14 | 224 | 176.62 | 1.27* | 1.11-1.45 | 114 | 104.92 | 1.09 | 0.88-1.30 | ||
| Histologic type | ||||||||||||||
| IDC | 8826 | 8735.88 | 1.01 | 0.99-1.03 | 3288 | 3081.97 | 1.07* | 1.03-1.10 | 359 | 294.55 | 1.22* | 1.10-1.35 | ||
| ILC | 1060 | 1117.29 | 0.95 | 0.89-1.01 | 547 | 492.46 | 1.11* | 1.02-1.21 | 81 | 54.07 | 1.50* | 1.19-1.86 | ||
| IDC and ILC(mixed) | 645 | 647.87 | 1.00 | 0.92-1.08 | 332 | 309.36 | 1.07 | 0.96-1.20 | 28 | 21.57 | 1.30 | 0.86-1.87 | ||
| others | 1444 | 1322.56 | 1.09* | 1.04-1.15 | 370 | 318.36 | 1.16* | 1.05-1.29 | 97 | 76.71 | 1.26* | 1.01-1.52 | ||
| T stage | ||||||||||||||
| T1 | 7731 | 7894.42 | 0.98 | 0.96-1.00 | 1479 | 1486.85 | 0.99 | 0.94-1.05 | 71 | 44.25 | 1.60* | 1.22-1.98 | ||
| T2 | 2462 | 2263.48 | 1.09* | 1.05-1.13 | 1744 | 1698.44 | 1.03 | 0.98-1.08 | 135 | 112.47 | 1.20* | 1.01-1.42 | ||
| T3 | 301 | 247.10 | 1.22* | 1.08-1.36 | 475 | 388.06 | 1.22* | 1.12-1.34 | 79 | 57.25 | 1.38* | 1.09-1.72 | ||
| T4 | 88 | 73.16 | 1.2 | 0.96-1.48 | 237 | 159.24 | 1.49* | 1.30-1.69 | 115 | 100.59 | 1.14 | 0.95-1.37 | ||
| Tx | 1393 | 1332.80 | 1.03 | 0.98-1.09 | 590 | 452.50 | 1.30* | 1.20-1.41 | 165 | 132.53 | 1.25* | 1.07-1.45 | ||
| N stage | ||||||||||||||
| N0 | / | / | / | / | / | / | / | / | 110 | 90.12 | 1.22 | 0.99-1.45 | ||
| N1 | / | / | / | / | 2854 | 2853.02 | 1.00 | 0.96-1.04 | 203 | 159.87 | 1.27* | 1.10-1.44 | ||
| N2 | / | / | / | / | 703 | 620.86 | 1.13* | 1.05-1.22 | 44 | 41.71 | 1.05 | 0.76-1.42 | ||
| N3 | / | / | / | / | 428 | 312.98 | 1.37* | 1.24-1.50 | 66 | 46.89 | 1.41* | 1.09-1.76 | ||
| Nx | / | / | / | / | 552 | 415.29 | 1.33* | 1.22-1.44 | 142 | 108.48 | 1.31* | 1.11-1.54 | ||
| Therapy | ||||||||||||||
| without radiotherapy or chemotherapy | 4017 | 3852.37 | 1.04* | 1.01-1.08 | 839 | 795.31 | 1.05 | 0.98-1.13 | 159 | 140.98 | 1.13 | 0.96-1.33 | ||
| radiotherapy | 5132 | 5193.57 | 0.99 | 0.96-1.02 | 752 | 719.09 | 1.05 | 0.97-1.12 | 79 | 65.73 | 1.20 | 0.95-1.48 | ||
| chemotherapy | 1250 | 1207.95 | 1.03 | 0.98-1.09 | 980 | 931.43 | 1.05 | 0.99-1.12 | 184 | 155.66 | 1.18* | 1.02-1.37 | ||
| radiotherapy and chemotherapy | 1576 | 1569.70 | 1.00 | 0.96-1.05 | 1966 | 1756.32 | 1.12* | 1.07-1.17 | 143 | 84.1 | 1.70* | 1.44-2.00 | ||
*p<0.05 compared to the general population.
Standardized incidence ratios of SPCs in distant MBC patients by characteristics.
| Bone metastasis (N = 408) | Lung metastasis (N = 173) | Liver metastasis (N = 112) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Observe | Expected | SIR (O/E) | 95%Cl | Observe | Expected | SIR (O/E) | 95%Cl | Observe | Expected | SIR (O/E) | 95%Cl | |
| Age | ||||||||||||
| <40 | 26 | 5.74 | 4.53* | 2.96-6.64 | 9 | 1.57 | 5.74* | 2.63-10.90 | 7 | 2.61 | 2.69* | 1.08-5.54 |
| 40-49 | 56 | 24.48 | 2.29* | 1.73-2.97 | 29 | 7.75 | 3.74* | 2.51-5.38 | 16 | 8.78 | 1.82* | 1.04-2.96 |
| 50-59 | 99 | 68.34 | 1.45* | 1.18-1.76 | 36 | 24.70 | 1.46* | 1.02-2.02 | 37 | 23.15 | 1.60* | 1.13-2.20 |
| 60-69 | 112 | 111.16 | 1.01 | 0.83-1.21 | 41 | 45.78 | 0.90 | 0.64-1.21 | 28 | 26.89 | 1.04 | 0.69-1.50 |
| >=70 | 115 | 128.37 | 0.90 | 0.74-1.08 | 58 | 57.20 | 1.01 | 0.77-1.31 | 24 | 24.52 | 0.98 | 0.63-1.46 |
| Subtype | ||||||||||||
| Luminal A | 256 | 221.26 | 1.20* | 1.06-1.35 | 101 | 76.90 | 1.31* | 1.07-1.60 | 50 | 36.38 | 1.37* | 1.02-1.81 |
| luminal B | 68 | 50.21 | 1.35* | 1.05-1.72 | 19 | 24.09 | 0.79 | 0.47-1.23 | 27 | 22.46 | 1.20 | 0.79-2.75 |
| Her2 | 19 | 15.94 | 1.19 | 0.72-1.86 | 14 | 10.35 | 1.35 | 0.74-2.27 | 18 | 13.61 | 1.32 | 0.78-2.09 |
| triple-negative | 23 | 12.58 | 1.83* | 1.16-2.74 | 12 | 11.22 | 1.07 | 0.55-1.87 | 7 | 5.74 | 1.22 | 0.49-2.51 |
| unknown | 42 | 48.36 | 0.87 | 0.60-1.22 | 27 | 14.42 | 1.87* | 1.23-2.72 | 10 | 7.75 | 1.29 | 0.62-2.37 |
| Race | ||||||||||||
| white | 318 | 281.08 | 1.13* | 1.01-1.26 | 124 | 108.77 | 1.14 | 0.95-1.36 | 81 | 68.94 | 1.17 | 0.93-1.46 |
| black | 59 | 39.60 | 1.49* | 1.13-1.92 | 40 | 19.17 | 2.09* | 1.49-2.84 | 17 | 11.50 | 1.48 | 0.86-2.37 |
| others | 31 | 16.53 | 1.88* | 1.27-2.66 | 9 | 8.29 | 1.09 | 0.50-2.06 | 14 | 5.22 | 2.68* | 1.47-4.50 |
| Grade | ||||||||||||
| well differentiated | 28 | 30.92 | 0.91 | 0.60-1.31 | 11 | 8.41 | 1.31 | 0.65-2.34 | 7 | 4.65 | 1.50 | 0.60-3.10 |
| moderately differentiated | 177 | 135.40 | 1.31* | 1.12-1.51 | 70 | 49.40 | 1.42* | 1.10-1.79 | 40 | 27.81 | 1.44* | 1.03-1.96 |
| poorly differentiated | 121 | 88.45 | 1.37* | 1.14-1.63 | 52 | 49.25 | 1.06 | 0.79-1.38 | 48 | 35.25 | 1.36* | 1.00-1.81 |
| unknown | 82 | 83.32 | 0.98 | 0.78-1.22 | 40 | 29.92 | 1.34 | 0.96-1.82 | 17 | 18.22 | 0.93 | 0.54-1.49 |
| Histologic type | ||||||||||||
| IDC | 255 | 212.71 | 1.20* | 1.06-1.36 | 122 | 102.00 | 1.20 | 0.99-1.43 | 69 | 61.98 | 1.11 | 0.87-1.41 |
| ILC | 72 | 49.52 | 1.45* | 1.14-1.83 | 11 | 5.02 | 2.19* | 1.09-3.92 | 13 | 6.94 | 1.87 | 1.00-3.20 |
| IDC and ILC(mixed) | 22 | 18.71 | 1.18 | 0.74-1.78 | 3 | 4.12 | 0.73 | 0.15-2.13 | 5 | 3.05 | 1.64 | 0.53-3.83 |
| others | 59 | 57.16 | 1.03 | 0.79-1.33 | 37 | 25.85 | 1.43* | 1.01-1.97 | 25 | 13.97 | 1.79* | 1.16-2.64 |
| T stage | ||||||||||||
| T1 | 47 | 34.63 | 1.36 | 1.00-1.80 | 18 | 9.73 | 1.85* | 1.10-2.92 | 15 | 8.40 | 1.79 | 1.00-2.95 |
| T2 | 94 | 84.94 | 1.11 | 0.89-1.35 | 37 | 29.86 | 1.24 | 0.87-1.71 | 29 | 21.06 | 1.38 | 0.92-1.98 |
| T3 | 63 | 42.44 | 1.48* | 1.14-1.90 | 20 | 16.54 | 1.21 | 0.74-1.87 | 10 | 10.67 | 0.94 | 0.45-1.72 |
| T4 | 79 | 74.07 | 1.07 | 0.84-1.33 | 45 | 41.89 | 1.07 | 0.78-1.44 | 30 | 20.26 | 1.48 | 1.00-2.11 |
| Tx | 125 | 102.02 | 1.23* | 1.02-1.46 | 53 | 38.97 | 1.36* | 1.02-1.78 | 28 | 25.55 | 1.10 | 0.73-1.58 |
| N stage | ||||||||||||
| N0 | 71 | 69.28 | 1.02 | 0.80-1.29 | 41 | 22.68 | 1.81* | 1.30-2.45 | 19 | 14.98 | 1.27 | 0.76-1.98 |
| N1 | 141 | 119.27 | 1.18 | 1.00-1.39 | 57 | 52.10 | 1.10 | 0.83-1.42 | 45 | 32.85 | 1.37 | 1.00-1.83 |
| N2 | 31 | 29.99 | 1.03 | 0.70-1.47 | 14 | 13.75 | 1.02 | 0.56-1.71 | 12 | 7.80 | 1.54 | 0.79-2.69 |
| N3 | 56 | 36.05 | 1.55* | 1.17-2.02 | 16 | 13.86 | 1.15 | 0.66-1.87 | 12 | 8.77 | 1.37 | 0.71-2.39 |
| Nx | 109 | 83.49 | 1.31* | 1.07-1.57 | 45 | 34.69 | 1.30 | 0.95-1.74 | 24 | 21.54 | 1.11 | 0.71-1.66 |
| Therapy | ||||||||||||
| without radiation or chemotherapy | 112 | 105.11 | 1.07 | 0.88-1.28 | 54 | 44.71 | 1.21 | 0.91-1.58 | 21 | 18.81 | 1.12 | 0.69-1.71 |
| radiation | 68 | 60.22 | 1.13 | 0.88-1.43 | 20 | 13.78 | 1.45 | 0.89-2.24 | 9 | 4.74 | 1.90 | 0.87-3.61 |
| chemotherapy | 112 | 107.5 | 1.04 | 0.86-1.25 | 58 | 57.14 | 1.02 | 0.77-1.31 | 57 | 44.83 | 1.27 | 0.96-1.65 |
| radiation and chemotherapy | 116 | 65.27 | 1.78* | 1.47-2.13 | 41 | 21.37 | 1.92* | 1.38-2.60 | 25 | 17.57 | 1.42 | 0.92-2.10 |
*p<0.05 compared to the general population.
Figure 2The patterns of second primary cancers (SPCs) in Breast cancer (BC) patients with different metastasis status (A) Calculated by dividing the number of SPC cases observed per unit by the total number of SPCs observed. The grey cells indicate tumor types that are less than 1% of the total number of SPCs. Percentage values are shown in . (B) Standard incidence rates (SIRs) are calculated as the ratio of the number of SPCs observed to the number of SPCs expected in the general population. Cells marked with an * symbol show a statistically significant association between first primary breast cancer (BC) and SPCs. Blue cells indicate SIRs that are statistically significantly lower than expected according to a qualifying statistical test (number of SPCs observed ≥ 5). Red cells indicate SIRs that are statistically significantly higher than expected according to a qualifying statistical test (number of SPCs observed ≥ 5). Gray cells indicate SIRs, which are not statistically significant or associations not tested due to the small number of SPCs observed. Point estimates and 95% CIs are shown in .
Figure 3The SIR of SPCs varies with the latency period *p < 0.05 compared to the general population. Point estimates and 95% CIs are shown in .
Figure 4The standard mortality rate (SMR) of SPCs in BC patients with different metastasis status. The SMR is calculated as the ratio of observed SPC deaths to the expected number of cancer deaths in the general population. The cells marked with an * symbol indicate statistically significant association between first primary breast cancer (BC) and SPCs. Blue cells indicate SMRs that are statistically significantly lower than expected according to a qualifying statistical test (number of SPCs observed ≥ 5). Red cells indicate SMRs that are statistically significantly higher than expected according to a qualifying statistical test (number of SPCs observed ≥ 5). Grey cells indicate SMRs that are not statistically significant, or no association was tested due to the small number of SPCs observed. Point estimates and 95% CIs are shown in .
Figure 5The overall survival of SPCs classified by the metastatic status of breast cancer. Site of SPCs: (A) breast, (B) ovary, (C) urinary, (D) colon and rectum, (E) non-Hodgkin lymphoma, (F) leukemia, (G) melanoma of skin, (H) corpus uteri, (I) thyroid, (J) lung and bronchus, (K) stomach, (L). pancreas. HR and 95% CIs are shown in .*p < 0.05; **p < 0.01; ***p < 0.001. N.S, not significant.
Figure 6XGBoost model evaluationROC: receiver operating characteristic curve; AUC: area under the curve (A) ROC curve for the 3-year prognostic model of XGBoost algorithm (test data), (B) ROC curve for the 5-year prognostic model of XGBoost algorithm (test data), (C) The ranking of clinical characteristics in terms of importance in the 3-year prognostic model. The x-axis of the graph is the feature importance score. The closer the score is to 1, the more important the feature in predicting the prognosis of the patient in the model. (D) The ranking of clinical characteristics in terms of importance in the 5-year prognostic model. The x-axis of the graph is the feature importance score. The closer the score is to 1, the more important the feature in predicting the prognosis of the patient in the model.
Performance of prognostic models built by machine learning algorithms on test sets.
| AUC | ||
|---|---|---|
| 3-year survival | 5-year survival | |
| XGBoost | 0.873 | 0.918 |
| ID3 | 0.763 | 0.782 |
| SVM | 0.689 | 0.729 |
| KNN | 0.712 | 0.798 |