| Literature DB >> 32010179 |
Yu Dong1, Yang Xiao2,3, Qihui Shi1, Chunjie Jiang2,3.
Abstract
Breast cancer is the most common cancer in women, but few biomarkers are effective in clinic. Previous studies have shown the important roles of non-coding RNAs in diagnosis, prognosis, and therapy selection for breast cancer and have suggested the significance of integrating molecules at different levels to interpret the mechanism of breast cancer. Here, we collected transcriptome data including long non-coding RNA (lncRNA), microRNA (miRNA), and mRNA for ~1,200 samples, including 1079 invasive breast carcinoma samples and 104 normal samples, from The Cancer Genome Atlas (TCGA) project. We identified differentially expressed lncRNAs, miRNAs, and mRNAs that distinguished invasive carcinoma samples from normal samples. We further constructed an integrated dysregulated network consisting of differentially expressed lncRNAs, miRNAs, and mRNAs and found housekeeping and cancer-related functions. Moreover, 58 RNA binding proteins (RBPs) involved in biological processes that are essential to maintain cell survival were found in the dysregulated network, and 10 were correlated with overall survival. In addition, we identified two modules that stratify patients into high- and low-risk subgroups. The expression patterns of these two modules were significantly different in invasive carcinoma versus normal samples, and some molecules were high-confidence biomarkers of breast cancer. Together, these data demonstrated an important clinical application for improving outcome prediction for invasive breast cancers.Entities:
Keywords: RNA binding protein; biomarker; integrative analysis; invasive breast carcinoma; lncRNAs; miRNAs
Year: 2020 PMID: 32010179 PMCID: PMC6975227 DOI: 10.3389/fgene.2019.01284
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Clustering based on differentially expressed molecules. Unsupervised hierarchical clustering of all samples based on differentially expressed lncRNAs (A) protein-coding genes (B) and miRNAs (C). The unsupervised hierarchical clustering was performed using an R package, ‘pheatmap’ with the default distance setting, Euclidean distance. (D–F) PCA analysis based on differentially expressed lncRNAs (D), protein-coding genes (E), and miRNAs (F). PCA analysis was performed by the R function ‘prcomp’.
Figure 2Functional analysis for the dysregulated network. (A) The dysregulated lncRNA-miRNA-mRNA network. The network was visualized using Cytoscape. (B) The top 10 enriched GO terms. (C) The top 10 enriched cancer hallmark-related GO terms. (D) The top 10 enriched GAD terms.
Figure 3RBPs in the dysregulated network. (A) Protein–protein interaction (PPI) network of RBPs based on STRING. (B) The top 10 enriched GO MF terms. (C) The top 10 enriched GO BP terms. BP, biological process.
Figure 4Analysis of modules identified from the dysregulated network. (A, B) The two modules identified from the dysregulated network using Cytoscape with default parameters. (C, D) Kaplan-Meier plot of survival for these two modules. (E) Expression patterns of the modules in normal and cancer samples. The average expression value of each molecule crossing all normal/cancer samples was used.