| Literature DB >> 32977823 |
Xiaoying Zhou1, Chuanguang Xiao2, Tong Han3, Shusheng Qiu2, Meng Wang1, Jun Chu1, Weike Sun2, Liang Li2, Lili Lin4.
Abstract
BACKGROUND: This study intended to determine important genes related to the prognosis and recurrence of breast cancer.Entities:
Keywords: Breast cancer; Logit regression model; Prognosis; Recurrence
Mesh:
Substances:
Year: 2020 PMID: 32977823 PMCID: PMC7519567 DOI: 10.1186/s12957-020-02026-z
Source DB: PubMed Journal: World J Surg Oncol ISSN: 1477-7819 Impact factor: 2.754
Fig. 1a The volcano plot of differentially expressed genes (DEGs). Blue circle represents DEGs, black horizontal line represents FDR < 0.05, and two black vertical lines represent |log2FC| > 1. b Bidirectional hierarchical clustering heatmap based on DEG expression level. White and black bars represent poor and better disease-free survival (DFS) breast cancer tumor samples, respectively
GO function node list (top 3) significantly enriched by upregulated and downregulated DEGs
| Category | Term | Count | ||
|---|---|---|---|---|
| Upregulation | BP | GO:0006955~immune response | 67 | 1.270E−27 |
| GO:0006952~defense response | 45 | 1.130E−13 | ||
| GO:0019882~antigen processing and presentation | 14 | 9.100E−09 | ||
| CC | GO:0044459~plasma membrane part | 75 | 9.160E−08 | |
| GO:0005887~integral to plasma membrane | 45 | 6.820E−06 | ||
| GO:0005886~plasma membrane | 103 | 7.390E−06 | ||
| MF | GO:0019955~cytokine binding | 10 | 2.910E−04 | |
| GO:0008236~serine-type peptidase activity | 10 | 8.588E−03 | ||
| GO:0017171~serine hydrolase activity | 10 | 9.209E−03 | ||
| KEGG | hsa04514:Cell adhesion molecules (CAMs) | 15 | 9.070E−06 | |
| hsa04612:Antigen processing and presentation | 12 | 1.040E−05 | ||
| hsa04062:Chemokine signaling pathway | 17 | 3.180E−05 | ||
| Downregulation | BP | GO:0045137~development of primary sexual characteristics | 5 | 8.024E−03 |
| GO:0009968~negative regulation of signal transduction | 6 | 1.340E−02 | ||
| GO:0007548~sex differentiation | 5 | 1.409E−02 | ||
| CC | GO:0044456~synapse part | 7 | 1.228E−02 | |
| GO:0045202~synapse | 8 | 2.139E−02 | ||
| GO:0005761~mitochondrial ribosome | 3 | 3.516E−02 | ||
| MF | GO:0048306~calcium-dependent protein binding | 3 | 1.372E−02 | |
| GO:0005509~calcium ion binding | 14 | 1.696E−02 | ||
| GO:0008083~growth factor activity | 5 | 2.078E−02 | ||
| KEGG | hsa04514:Cell adhesion molecules (CAMs) | 3 | 2.108E−02 | |
| hsa04150:mTOR signaling pathway | 2 | 2.884E−02 | ||
| hsa03018:RNA degradation | 2 | 3.115E−02 |
The important prognostic feature DEGs screened by Logit model
| ID | B | SE | Df | |
|---|---|---|---|---|
| UBE2L6 | − 2.2813 | 0.7054 | 1 | 0.00122 |
| ABCA3 | − 1.5546 | 0.5185 | 1 | 0.00272 |
| MAP2K6 | − 0.7117 | 0.3443 | 1 | 0.03877 |
| IL1RN | − 0.5372 | 0.2485 | 1 | 0.03064 |
| FOXJ1 | − 0.2375 | 0.0955 | 1 | 0.01285 |
| CCL22 | − 0.5239 | 0.2706 | 1 | 0.04529 |
| APC2 | 0.6777 | 0.3478 | 1 | 0.04513 |
| TRPM2 | 0.9379 | 0.3747 | 1 | 0.01232 |
| KCNIP3 | 0.9632 | 0.3339 | 1 | 0.00392 |
| MRPL13 | 3.0732 | 0.7985 | 1 | 0.00012 |
Note:B regression coefficient, SE Standard error, Df Degree of freedom
Fig. 2The KM curves of 10 important characteristic DEGs. Red represents the high-expression sample group, and blue represents the low-expression sample group
The classification result in TCGA dataset and GSE45725 microarray dataset
| TCGA | Predict | |||
|---|---|---|---|---|
| Class | Better DFS | Poor DFS | Percent (%) | |
| Observed | Better DFS | 172 | 9 | 95.03 |
| Poor DFS | 14 | 38 | 73.08 | |
| Overall percent (%) | 90.13 | |||
| GSE45725 | Better DFS | 100 | 7 | 93.46 |
| Observed | Poor DFS | 6 | 14 | 70.00 |
| Overall percent (%) | 89.76 | |||
DFS Disease-free survival
Fig. 3The KM curves of the correlation between the classification groups in TCGA training set (a) and GSE45725 validation set (b) and the actual survival prognosis based on the LR classification model. The blue and red curves represent the groups of good and poor prognosis samples predicted by the LR classification model, respectively. c Area under the curve (AUC) was calculated from the receiver operating characteristic (ROC) curve. Black and red curves represent the TCGA training set and GSE45725 validation set
Fig. 4The KM curves and AUC were analyzed in the basal type (a), Her2 type (b), LumA type (c), and LumaB type (d) of breast cancer using TCGA dataset. The blue and red curves represent the groups of better and poor DFS samples predicted by the LR classification model, respectively
Fig. 5The column diagram of the expression levels of the 10 important characteristic DEGs in the TCGA training set (a) and the GSE45725 validation set (b) in the better and poor DFS groups, with white and black columns representing the better and poor DFS sample groups, respectively