| Literature DB >> 33080569 |
Shipeng Ning1,2, Hui Li1,2, Kun Qiao1,2, Qin Wang1,2, Meiying Shen1,2, Yujuan Kang1,2, Yanling Yin1,2, Jiena Liu1,2, Lei Liu1,2, Siyu Hou1,2, Jianyu Wang1,2, Shouping Xu1,2,3, Da Pang1,2,3.
Abstract
Breast cancer patients at the same stage may show different clinical prognoses or different therapeutic effects of systemic therapy. Differentially expressed genes of breast cancer were identified from GSE42568. Through survival, receiver operating characteristic (ROC) curve, random forest, GSVA and a Cox regression model analyses, genes were identified that could be associated with survival time in breast cancer. The molecular mechanism was identified by enrichment, GSEA, methylation and SNV analyses. Then, the expression of a key gene was verified by the TCGA dataset and RT-qPCR, Western blot, and immunohistochemistry. We identified 784 genes related to the 5-year overall survival time of breast cancer. Through ROC curve and random forest analysis, 10 prognostic genes were screened. These were integrated into a complex by GSVA, and high expression of the complex significantly promoted the recurrence-free survival of patients. In addition, key genes were related to immune and metabolic-related functions. Importantly, we identified methylation of MEX3A and TBC1D 9 and mutations events. Finally, the expression of UGCG was verified by the TCGA dataset and by experimental methods in our own samples. These results indicate that 10 genes may be potential biomarkers and therapeutic targets for long-term survival in breast cancer, especially UGCG.Entities:
Keywords: 5-year survival time; UGCG; breast cancer; complex; enrichment
Mesh:
Substances:
Year: 2020 PMID: 33080569 PMCID: PMC7655188 DOI: 10.18632/aging.103807
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Figure 1Study flowchart.
Figure 2Gene expression in breast cancer patients. (A) Differentially expressed genes between breast cancer patients and the control group as well as breast cancer patients with high and low survival times. (B) The common differentially expressed genes in the two groups.
Figure 3Identification of key genes that can predict the 5-year survival time of breast cancer patients. (A) Random forest screening for the top 10 genes with a high Gini coefficient of average decline. (B) Expression of the 10 selected genes in samples with a survival time of more than or less than 5 years. (C) Kaplan-Meier analysis of overall survival for the signatures associated with expression of the 10 genes in breast cancer. (D) AUC of the 10 selected genes that affect the survival time of breast cancer.
Figure 4The effect of 10 genes on the prognosis of breast cancer. (A) GSVA integrates 10 genes into a complex, which affects the overall survival rate of breast cancer patients. (B) The risk ratio of the gene complex and clinical information to breast cancer prognosis. (C) Kaplan-Meier relapse analysis of the effect of the complex on breast cancer relapse. (D) A Cox regression model was used to analyse the effect of multiple variables on breast cancer relapse. (E) The difference of 24 kinds of immune cells in breast cancer with longer than 5 years and shorter than 5 years survival time. (F) The correlation curve between the complex and eosinophilia.
Figure 5The biological function and signalling pathway of the key genes affect the prognosis of breast cancer. (A) The bubble chart shows BP and KEGG enriched by key genes. (B) GSEA of genes expressed by breast cancer patients with survival times greater than 5 years. (C) The same BP as GSEA was clustered into three types of biological functions.
Figure 6Potential factors affecting key genes that influence the survival time of breast cancer patients. (A) Methylation level and expression of MEX3A and TBC1D9 in breast cancer patients with survival times greater than or less than 5 years. Mosaic analysis identified the relationship between the expression of MEX3A (B) or TBC1D9 (C) and the prognosis and clinical information of breast cancer. (D). Six key genes were sequenced according to their mutation frequency. Different colours represent different methods of mutation. (E) The transition and crosscut graphs show the distribution of SNV in breast cancer with six transition and crosscut events. The stacked bar graph (bottom) shows the mutation spectrum distribution of each sample. (F) The Lollipop map shows the mutation distribution and protein domain of GATA3 with a high frequency of mutation. (G) The Rainfall map of TCGA-AC-A23H-01A-11D-A159-09 breast cancer sample. Each point is a mutation colour coded according to the SNV classification.
Figure 7TCGA sets and experiments to verify the expression of key genes. (A) Expression of 10 key genes in breast cancer patients with a survival time of more than or less than 5 years with four data sets. (B) TCGA and GSE7390 were used to verify the significant expression of UGCG. (C) The mRNA level of UGCG in breast cancer patients with a survival time greater than or less than 5 years was detected by qRT-PCR. (D) Western blot was used to detect the expression of UGCG in breast cancer patients with a survival time greater than or less than 5 years. (E) Immunohistochemistry images of UGCG levels in breast cancer tissues with a survival time greater than or less than 5 years. Scale bar = 200 μm. *P < 0.05, ***P < 0.001.
Primer sequence of UGCG and GAPDH.
| UGCG_F | TTCTTGGTGCTGTGGCTGATGC |
| UGCG_R | AGAGAGACACCTGGGAGCTTGC |
| GAPDH_F | AGAAGGCTGGGGCTCATTTG |
| GAPDH_R | AGGGGCCATCCACAGTCTTC |