| Literature DB >> 31802913 |
Yucan Xu1, Lingsha Ju1, Jianhua Tong1, Chengmao Zhou1, Jianjun Yang1.
Abstract
OBJECTIVE: To use machine learning algorithms to predict the death outcomes of patients with triple-negative breast cancer, 5 years after discharge.Entities:
Keywords: 5-year prognosis; machine learning; triple-negative breast cancer
Year: 2019 PMID: 31802913 PMCID: PMC6830358 DOI: 10.2147/OTT.S223603
Source DB: PubMed Journal: Onco Targets Ther ISSN: 1178-6930 Impact factor: 4.147
Baseline Data
| Five-Year Mortality | Survival | Death | P-value |
|---|---|---|---|
| Number | 1143 | 425 | |
| Age (years) | 48.9 ± 11.3 | 49.7 ± 11.8 | 0.218 |
| White blood cell (109 cells/L) | 6.6 ± 1.8 | 6.9 ± 1.9 | 0.088 |
| Hemoglobin | 125.7 ± 13.2 | 126.1 ± 12.3 | 0.587 |
| Platelet (109 cells/L) | 240.3 ± 60.9 | 247.6 ± 62.3 | 0.038 |
| Lymphocyte (109 cells/L) | 2.0 ± 0.6 | 2.0 ± 0.7 | 0.272 |
| Neutrophile (109 cells/L) | 4.1 ± 1.5 | 4.3 ± 1.6 | 0.043 |
| Monocyte (109 cells/L) | 0.4 ± 0.2 | 0.4 ± 0.2 | 0.551 |
| NLR | 2.3 ± 1.2 | 2.4 ± 1.5 | 0.573 |
| PLR | 134.5 ± 61.0 | 136.3 ± 63.9 | 0.956 |
| LMR | 5.6 ± 8.0 | 5.1 ± 2.9 | 0.541 |
| ER | 1.6 ± 1.3 | 1.7 ± 1.4 | 0.838 |
| PR | 1.6 ± 1.2 | 1.4 ± 1.3 | <0.001 |
| Intraoperative NSAID | <0.001 | ||
| No | 385 (34.0%) | 60 (14.2%) | |
| Yes | 749 (66.0%) | 364 (85.8%) | |
| Chemotherapy | <0.001 | ||
| No | 142 (12.4%) | 87 (20.5%) | |
| Yes | 1001 (87.6%) | 338 (79.5%) | |
| Tumor Size | <0.001 | ||
| T1 | 663 (61.6%) | 189 (48.5%) | |
| T2 | 344 (32.0%) | 165 (42.3%) | |
| T3–4 | 69 (6.4%) | 36 (9.2%) | |
| Axillary Surgery | <0.001 | ||
| SLNB | 536 (47.1%) | 275 (66.4%) | |
| ALND | 599 (52.7%) | 139 (33.6%) | |
| Uncertain | 2 (0.2%) | 0 (0.0%) | |
| Breast Surgery | 0.102 | ||
| Conservation | 518 (45.3%) | 173 (40.7%) | |
| Mastectomy | 625 (54.7%) | 252 (59.3%) | |
| HER2 | 0.379 | ||
| Negative | 895 (78.4%) | 323 (76.4%) | |
| Positive | 246 (21.6%) | 100 (23.6%) | |
| Molecular Subtypes | <0.001 | ||
| Luminal | 760 (66.5%) | 241 (56.7%) | |
| Her2 positive | 245 (21.4%) | 99 (23.3%) | |
| Triple negative | 138 (12.1%) | 85 (20.0%) |
Abbreviations: ER, Estrogen receptor; PR, progesterone receptor; NLR, neutrophil-to-lymphocyte ratio; LMR, lymphocyte-to-monocyte ratio; PLR, the platelet-to-lymphocyte ratio; ALND, axillary lymph node dissection; SLNB, Sentinel lymph node biopsy for breast cancer.
Figure 1Correlation analysis of various factors.
Figure 2Variable importance of features included in the machine learning algorithm for predicting triple-negative breast cancer five-year mortality.
Forecast Results For Training Group
| Accuracy | Precision | Recall | f1_score | AUC | MSE | |
|---|---|---|---|---|---|---|
| Logistic | 0.737640 | 0.551402 | 0.173529 | 0.263982 | 0.722799 | 0.2624 |
| DecisionTree | 0.734450 | 0.518325 | 0.291176 | 0.372881 | 0.702804 | 0.2656 |
| Forest | 0.770335 | 1.000000 | 0.152941 | 0.265306 | 0.896673 | 0.2297 |
| GradientBoosting | 0.751994 | 0.737705 | 0.132353 | 0.224439 | 0.776836 | 0.2480 |
| gbm | 0.760766 | 0.954545 | 0.123529 | 0.218750 | 0.895408 | 0.2392 |
Figure 3Machine learning algorithm for predicting five-year mortality for triple-negative breast cancer patients in the training group (forest, gbm, gradientboost (Gbdt), Logistic (lr) and DecisionTree (tr)).
Forecast Results For Testing Group
| Accuracy | Precision | Recall | f1_score | AUC | MSE | |
|---|---|---|---|---|---|---|
| Logistic | 0.738854 | 0.578947 | 0.129412 | 0.211538 | 0.715438 | 0.2611 |
| DecisionTree | 0.748408 | 0.568182 | 0.294118 | 0.387597 | 0.691960 | 0.2516 |
| forest | 0.738854 | 1.000000 | 0.035294 | 0.068182 | 0.712767 | 0.2611 |
| GradientBoosting | 0.732484 | 0.555556 | 0.058824 | 0.106383 | 0.731595 | 0.2675 |
| gbm | 0.729299 | 0.500000 | 0.023529 | 0.044944 | 0.708348 | 0.2707 |
Figure 4Machine learning algorithm for prediction of five-year mortality for triple-negative breast cancer patients in the testing group (forest, gbm, gradientboost (Gbdt), Logistic (lr) and DecisionTree (tr)).