| Literature DB >> 35361134 |
Kaiyan Huang1, Jie Zhang1,2,3, Yushuai Yu1, Yuxiang Lin1,2,3, Chuangui Song4,5,6.
Abstract
PURPOSE: We aimed to analysis the impact of chemotherapy and establish prediction models of prognosis in early elderly triple negative breast cancer (eTNBC) by using machine learning.Entities:
Keywords: Breast cancer-specific survival; Elderly triple negative breast cancer; Machine learning; Overall survival; SEER database
Mesh:
Year: 2022 PMID: 35361134 PMCID: PMC8973884 DOI: 10.1186/s12877-022-02936-5
Source DB: PubMed Journal: BMC Geriatr ISSN: 1471-2318 Impact factor: 3.921
Baseline characteristics of patients with chemotherapy and no-chemotherapy
| 27(12–49) | 27(12–47) | 27(12–48) | ||||||
| 1257 | 48.8 | 1836 | 86.5 | 3093 | 65.9 | |||
| 1317 | 51.2 | 286 | 13.5 | 1603 | 34.1 | |||
| 2003 | 77.8 | 1636 | 77.1 | 3639 | 77.5 | 0.238 | ||
| 383 | 14.9 | 348 | 16.4 | 731 | 15.6 | |||
| 188 | 7.3 | 138 | 6.5 | 326 | 6.9 | |||
| 959 | 37.3 | 1085 | 51.1 | 2044 | 43.5 | |||
| 1615 | 62.7 | 1037 | 48.9 | 2652 | 56.5 | |||
| 707 | 27.5 | 388 | 18.3 | 1095 | 23.3 | |||
| 1867 | 72.5 | 1734 | 81,7 | 3601 | 76.7 | |||
| 1317 | 51.2 | 706 | 33.3 | 2023 | 43.1 | |||
| 960 | 37.3 | 999 | 47.1 | 1959 | 41.7 | |||
| 297 | 11.5 | 417 | 8.9 | 714 | 15.2 | |||
| 1435 | 55.7 | 910 | 42.9 | 2345 | 49.9 | |||
| 876 | 34.0 | 919 | 43.3 | 1795 | 38.2 | |||
| 150 | 5.8 | 148 | 7.0 | 298 | 6.3 | |||
| 113 | 4.4 | 145 | 6.8 | 258 | 5.5 | |||
| 2039 | 79.2 | 1290 | 60.8 | 3329 | 70.9 | |||
| 357 | 13.9 | 556 | 26.2 | 913 | 19.4 | |||
| 104 | 4.0 | 172 | 8.1 | 276 | 5.9 | |||
| 74 | 2.9 | 104 | 4.9 | 178 | 3.8 | |||
| 150 | 5.8 | 107 | 5.0 | 257 | 5.5 | 0.239 | ||
| 2424 | 94.2 | 2015 | 95.0 | 4439 | 94.5 | |||
| 1013 | 39.4 | 1179 | 55.6 | 2192 | 46.7 | |||
| 1561 | 60.6 | 943 | 44.4 | 2504 | 53.3 | |||
Abbreviation: AJCC American Joint Committee on Cancer, BCS Breast-conserving surgery, IQR Interquartile range
aOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown
bNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed
cThe P value of the Chi-square test was calculated between the chemotherapy and no-chemotherapy groups, and bold type indicates significance
Multivariate Cox proportional hazard model of breast cancer-specific survival (BCSS) and overall survival (OS) in all patients
| Variables | BCSS | OS | |||
|---|---|---|---|---|---|
| Reference | Reference | ||||
| 1.315(1.111–1.557) | 1.629(1.429–1.856) | ||||
| Reference | Reference | ||||
| 1.062(0.902–1.250) | 0.472 | 1.120 (0.985–1.272) | 0.084 | ||
| Reference | Reference | ||||
| 1.068 (0.875–1.305) | 0.516 | 1.070 (0.914–1.252) | 0.399 | ||
| 0.706 (0.507–0.983) | 0.709 (0.549–0.916) | ||||
| Reference | Reference | ||||
| 1.510 (1.222–1.865) | 1.344(1.152–1.568) | ||||
| Reference | Reference | ||||
| 3.982 (3.137–5.055) | 2.602(2.221–3.048) | ||||
| 11.609(9.015–14.949) | 6.528(5.468–7.793) | ||||
| Reference | Reference | ||||
| 0.246(0.198–0.304) | 0.293(0.245–0.351) | ||||
| Reference | Reference | ||||
| 0.626 (0.529–0.741) | 0.565 (0.495–0.645) | ||||
| Reference | Reference | ||||
| 0.656 (0.553–0.779) | 0.561(0.488–0.644) | ||||
Abbreviation: 70–79 70–79 years old, 80 + More than 80 years old, BCS Breast Conserving Surgery, HR Hazard ratio
aNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed
bOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown. Bold type indicates significance
Baseline characteristics of patients with chemotherapy and no-chemotherapy in PSM group
| 25(9–49) | 26(11–48) | 26(10–48.75) | ||||||
| 1046 | 78.6 | 1055 | 79.3 | 2101 | 79.0 | 0.703 | ||
| 284 | 21.4 | 275 | 20.7 | 559 | 21.0 | |||
| 982 | 73.8 | 982 | 73.8 | 1964 | 73.8 | 1.000 | ||
| 248 | 18.7 | 248 | 18.7 | 496 | 18.7 | |||
| 100 | 7.5 | 100 | 7.5 | 200 | 7.5 | |||
| 608 | 45.7 | 632 | 47.5 | 1240 | 46.6 | 0.371 | ||
| 722 | 54.3 | 698 | 52.5 | 1420 | 53.4 | |||
| 259 | 19.5 | 287 | 21.6 | 546 | 20.5 | 0.195 | ||
| 1071 | 80.5 | 1043 | 78.4 | 2114 | 79.5 | |||
| 647 | 48.6 | 665 | 50.0 | 1312 | 49.3 | 0.645 | ||
| 494 | 37.1 | 491 | 36.9 | 985 | 37.0 | |||
| 189 | 7.1 | 174 | 13.1 | 363 | 13.6 | |||
| 727 | 54.7 | 723 | 54.4 | 1450 | 54.5 | 0.258 | ||
| 453 | 34.1 | 436 | 32.8 | 889 | 33.4 | |||
| 83 | 6.2 | 80 | 6.0 | 163 | 6.1 | |||
| 67 | 5.0 | 91 | 6.8 | 158 | 5.9 | |||
| 990 | 74.4 | 1030 | 77.4 | 2020 | 75.9 | 0.146 | ||
| 218 | 16.4 | 197 | 14.8 | 415 | 15.6 | |||
| 67 | 5.0 | 66 | 4.9 | 133 | 5.0 | |||
| 55 | 4.1 | 37 | 2.9 | 92 | 3.5 | |||
| 70 | 5.3 | 67 | 5.0 | 137 | 5.2 | 0.861 | ||
| 1260 | 94.7 | 1263 | 95.0 | 2523 | 94.8 | |||
| 610 | 45.9 | 638 | 48.0 | 1248 | 46.9 | 0.294 | ||
| 720 | 54.1 | 692 | 52.0 | 1412 | 53.1 | |||
Abbreviation: AJCC American Joint Committee on Cancer; BCS, breast-conserving surgery, IQR Interquartile range
aOther includes American Indian/Alaskan native and Asian/Pacific Islander and Unknown
bNot married includes divorced, separated, single (never married), unmarried or domestic partner, and widowed
cThe P value of the Chi-square test was calculated between the chemotherapy and no-chemotherapy groups, and bold type indicates significance
Comparison of breast cancer-specific survival (BCSS) and overall survival (OS) between matched patients with chemotherapy and no-chemotherapy in specific stage
| Stage | BCSS | OS | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| 53 | 0.932 | 103 | 0.111 | ||
|
| 1.024(0.595–1.764) | 0.723(0.485–1.078)* | ||||
|
| Reference | Reference | ||||
|
| 164 |
| 247 |
| ||
|
| 0.564(0.408–0.779) | 0.522(0.400–0.682)* | ||||
|
| Reference | Reference | ||||
|
| 142 |
| 192 |
| ||
|
| 0.549(0.386–0.781) | 0.537(0.395–0.728)* | ||||
|
| Reference | Reference | ||||
|
| 359 |
| 542 |
| ||
|
| 0.612(0.493–0.759)* | 0.549(0.459–0.655)* | ||||
|
| Reference | Reference | ||||
Abbreviation: HR Hazard ratio, CI Confidence interval, BCSS Breast cancer-specific survival, OS Overall survival, Events No Number of events
aP value was adjusted by a multivariate Cox proportional hazard regression model or a time-dependent covariate analysis. Bold type indicates significance
bThe groups using time-dependent covariate analysis were specifically marked with asterisks(*)
Fig. 1Kaplan–Meier survival curves of the effect of chemotherapy on BCSS (A–D) and OS (E–H) stratified by stage
Fig. 2Kaplan–Meier survival curves of the effect of chemotherapy on BCSS (A-E) and OS (F-J) stratified by T stage, N stage and tumor grade
Comparison of breast cancer-specific survival (BCSS) and overall survival (OS) between matched patients with chemotherapy and no-chemotherapy in specific clinical variables
| Variables | BCSS | OS | ||||
|---|---|---|---|---|---|---|
| 21 | 0.666 | 34 | 0.872 | |||
| 0.778(0.249–2.432) | 1.072(0.458–2.508) | |||||
| Reference | Reference | |||||
| 80 | 123 | |||||
| 0.420(0.261–0.675) | 0.361(0.243–0.536) | |||||
| Reference | Reference | |||||
| 63 | 0.321 | 90 | ||||
| 0.767(0.454–1.296)* | 0.640(0.419–0.978) | |||||
| Reference | Reference | |||||
| 53 | 0.387 | 81 | 0.306 | |||
| 0.781(0.445–1.368) | 0.790(0.503–1.240)* | |||||
| Reference | Reference | |||||
| 306 | 461 | |||||
| 0.559(0.441–0.708)* | 0.505(0.415–0.615)* | |||||
| Reference | Reference | |||||
Abbreviation: HR Hazard ratio, CI Confidence interval, BCSS Breast cancer-specific survival, OS Overall survival, Events No Number of events
aP value was adjusted by a multivariate Cox proportional hazard regression model or a time-dependent covariate analysis. Bold type indicates significance
bThe groups using time-dependent covariate analysis were specifically marked with asterisks(*)
Model performance for 5-year BCSS and 5-year OS
| Algorithms | Accuracy | Precision | Sensitivity | F1 score | AUC |
|---|---|---|---|---|---|
| 5-year BCSS | |||||
| K-nearest neighbor | 0.879 | 0.882 | 0.98 | 0.928 | 0.70 |
| Catboost | 0.905 | 0.892 | 0.974 | 0.932 | 0.69 |
| Decision tree | 0.908 | 0.901 | 0.949 | 0.924 | 0.61 |
| Random forest | 0.869 | 0.889 | 0.971 | 0.929 | 0.70 |
| Gradient booster | 0.882 | 0.887 | 0.991 | 0.936 | 0.75 |
| LightGBM | 0.882 | 0.887 | 0.991 | 0.936 | 0.75 |
| Neural network model | 0.886 | 0.877 | 1.0 | 0.934 | 0.75 |
| Support vector machine | 0.882 | 0.887 | 0.991 | 0.936 | 0.51 |
| XGBoost | 0.879 | 0.892 | 0.98 | 0.934 | 0.70 |
| 5-year OS | |||||
| K-nearest neighbor | 0.844 | 0.857 | 0.952 | 0.902 | 0.73 |
| Catboost | 0.877 | 0.86 | 0.977 | 0.915 | 0.76 |
| Decision tree | 0.882 | 0.869 | 0.940 | 0.903 | 0.69 |
| Random forest | 0.837 | 0.864 | 0.954 | 0.907 | 0.72 |
| Gradient booster | 0.849 | 0.855 | 0.985 | 0.916 | 0.80 |
| LightGBM | 0.851 | 0.859 | 0.983 | 0.916 | 0.81 |
| Neural network model | 0.86 | 0.877 | 0.949 | 0.911 | 0.79 |
| Support vector machine | 0.854 | 0.854 | 0.994 | 0.919 | 0.70 |
| XGBoost | 0.865 | 0.868 | 0.988 | 0.924 | 0.79 |
Abbreviation: AUC Area Under Curve
Confusion matrix of nine algorithms for 5-year BCSS and 5-year OS
| Algorithms | Predictions | Algorithms | Predictions | ||||
|---|---|---|---|---|---|---|---|
| Dead | Alive | Dead | Alive | ||||
| Dead | 3 | 47 | Dead | 15 | 56 | ||
| Alive | 7 | 350 | Alive | 17 | 337 | ||
| Dead | 8 | 42 | Dead | 15 | 56 | ||
| Alive | 9 | 348 | Alive | 8 | 346 | ||
| Dead | 13 | 37 | Dead | 21 | 50 | ||
| Alive | 18 | 339 | Alive | 21 | 333 | ||
| Dead | 7 | 43 | Dead | 18 | 53 | ||
| Alive | 10 | 347 | Alive | 16 | 338 | ||
| Dead | 5 | 45 | Dead | 12 | 59 | ||
| Alive | 3 | 354 | Alive | 5 | 349 | ||
| Dead | 5 | 45 | Dead | 14 | 57 | ||
| Alive | 3 | 354 | Alive | 6 | 348 | ||
| Dead | 0 | 50 | Dead | 24 | 47 | ||
| Alive | 0 | 357 | Alive | 18 | 336 | ||
| Dead | 5 | 45 | Dead | 11 | 60 | ||
| Alive | 3 | 354 | Alive | 2 | 352 | ||
| Dead | 8 | 42 | Dead | 18 | 53 | ||
| Alive | 7 | 350 | Alive | 4 | 350 | ||
Fig. 3The importance score of predictor variables in predicting 5-year BCSS (A) and 5-year OS (B)