| Literature DB >> 34604089 |
Wenting Peng1,2,3,4, Caijin Lin1,2,3, Shanshan Jing3,5, Guanhua Su1,2,3, Xi Jin1,2,3, Genhong Di1,2,3, Zhiming Shao1,2,3.
Abstract
BACKGROUND: The prognosis of lymph node-negative triple-negative breast cancer (TNBC) is still worse than that of other subtypes despite adjuvant chemotherapy. Reliable prognostic biomarkers are required to identify lymph node-negative TNBC patients at a high risk of distant metastasis and optimize individual treatment.Entities:
Keywords: distant metastasis; modeling; prognostic biomarker; transcriptomics; triple-negative breast cancer
Year: 2021 PMID: 34604089 PMCID: PMC8481824 DOI: 10.3389/fonc.2021.746763
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Clinicopathological characteristics of patients and their tumors.
| Characteristics | Number of patients (%) |
| ||
|---|---|---|---|---|
| Whole set | Training set | Validation set | ||
|
| 0.865 | |||
| ≤50 | 86 (42.6%) | 61 (43.0%) | 25 (41.7%) | |
| >50 | 116 (57.4%) | 81 (57.0%) | 35 (58.3%) | |
|
| 0.468 | |||
| Premenopausal | 75 (37.1%) | 55 (38.7%) | 20 (33.3%) | |
| Postmenopausal | 127 (62.9%) | 87 (61.3%) | 40 (66.7%) | |
|
| 0.183 | |||
| I | 35 (17.3%) | 27 (19.0%) | 8 (13.3%) | |
| II | 13 (6.4%) | 9 (6.3%) | 4 (6.7%) | |
| III | 134 (66.3%) | 96 (67.6%) | 38 (63.3%) | |
| Unknown | 20 (9.9%) | 10 (7.0%) | 10 (16.7%) | |
|
| 0.239 | |||
| ≤2cm | 85 (42.1%) | 64 (45.1%) | 21 (35.0%) | |
| >2-5cm | 111 (55.0%) | 75 (52.8%) | 36 (60.0%) | |
| >5cm | 6 (3.0%) | 3 (2.1%) | 3 (5%) | |
|
| 0.820 | |||
| ≤20% | 28 (13.9%) | 20 (14.1%) | 8 (13.3%) | |
| >20% | 169 (83.7%) | 119 (83.8%) | 50 (83.3%) | |
| Unknown | 5 (2.5%) | 3 (2.1%) | 2 (3.3%) | |
|
| 0.644 | |||
| No | 6 (3.0%) | 4 (2.8%) | 2 (3.3%) | |
| Yes | 188 (93.1%) | 131 (92.3%) | 57 (95.0%) | |
| Unknown | 8 (4.0%) | 7 (4.9%) | 1 (1.7%) | |
|
| 0.861 | |||
| No | 180 (89.1%) | 127 (89.4%) | 53 (88.3%) | |
| Yes | 21 (10.4%) | 14 (9.9%) | 7 (11.7%) | |
| Unknown | 1 (0.5%) | 1 (0.7%) | 0 (0.0%) | |
|
| 0.345 | |||
| No | 190 (94.1%) | 135 (95.1%) | 55 (91.7%) | |
| Yes | 12 (5.9%) | 7 (4.9%) | 5 (8.3%) | |
P values were calculated using Pearson’s chi-square test or Fisher’s exact test to compare the clinical and pathological characteristics between the training set and validation set.
Figure 1Flowchart of study design. TNBC, triple-negative breast cancer; FC, fold change; LASSO, least absolute shrinkage and selection operator; ROC, receiver operating characteristic.
Figure 2Volcano plot for differentially expressed mRNAs between patients with and without distant metastasis. In total, 71 differentially expressed mRNAs were screened out with log2(fold change) > 1 or < -1 and P < 0.05. Significantly upregulated and downregulated mRNAs are shown as red and blue dots, respectively.
Genes included in the seven-gene prognostic signature.
| Gene symbol | Log2 FC | Coefficient | HR (95% CI) |
|
|---|---|---|---|---|
| B3GALT5-AS1 | 1.18 | -0.06761697 | 0.93 (0.41-2.15) | 0.87 |
| DNER | 1.60 | 0.18801037 | 1.21 (0.39-3.73) | 0.74 |
| CSN1S1 | 1.61 | 0.28358112 | 1.33 (1.03-4.30) | 0.11 |
| KIF5A | 1.10 | 0.36011127 | 1.43 (0.79-2.61) | 0.24 |
| SIX3 | 1.28 | 0.57677377 | 1.78 (0.92-3.44) | 0.09 |
| NOTUM | 1.81 | 0.70105693 | 2.02 (1.22-3.33) | 0.01 |
| CPS1 | 1.51 | 0.74508978 | 2.11 (1.03-4.30) | 0.04 |
FC, fold change; HR, hazard ratio; CI, confidence interval.
The difference in the expression of seven genes between the group with and without distant metastasis was calculated using the limma package in R software.
The coefficients, hazard ratios, 95% confidence intervals, and P values of seven genes were calculated using a multivariate Cox proportional hazards regression model.
Figure 3Time-dependent receiver operating characteristic (ROC), Kaplan–Meier survival analysis, and risk score analysis for the seven-gene signature in the training set and validation set of the lymph node-negative triple-negative breast cancer (TNBC) cohort. AUC, area under the curve. (A) Time-dependent ROC curves of the seven-gene signature for 3-, 4-, and 5-year distant metastasis-free survival (DMFS). (B) Kaplan–Meier plots of the seven-gene signature illustrating that the patients in the high-risk group showed poorer DMFS than those in the low-risk group. (C) Distribution of genomic risk score, DMFS status of patients, and heat map of seven differentially expressed mRNA expression profiles.
Figure 4Time-dependent receiver operating characteristic (ROC) and Kaplan–Meier survival analysis for the clinical model and combined model in the training set and validation set of the lymph node-negative triple-negative breast cancer (TNBC) cohort. AUC, area under the curve. (A) Time-dependent ROC curves of the clinical model for 3-, 4-, and 5-year distant metastasis-free survival (DMFS). (B) Time-dependent ROC curves of the combined model for 3-, 4-, and 5-year DMFS. (C) Kaplan–Meier plots of the combined model illustrating that the patients in the high-risk group showed poorer DMFS than those in the low-risk group.
Figure 5A predictive nomogram was established in the training set. AUC, area under the curve. (A) The nomogram was built by the seven-gene risk score and clinical characteristics, including age and tumor size. (B) The time-dependent receiver operating characteristic (ROC) curves of the seven-gene model, clinical model, and combined model for 4- and 5-year distant metastasis-free survival (DMFS). The combined model was better than the clinical model for predicting 4-year (P = 0.029) and 5-year (P = 0.038) DMFS. (C) Calibration plots of the nomogram for 4-year DMFS. (D) Decision curve analysis (DCA) of the seven-gene model, clinical model, and combined model for 4-year DMFS.