Literature DB >> 32190687

Identification of Long Noncoding RNAs as Predictors of Survival in Triple-Negative Breast Cancer Based on Network Analysis.

Xiao-Xiao Li1,2, Li-Juan Wang1,2, Jie Hou1, Hong-Yang Liu1, Rui Wang1, Chao Wang1, Wen-Hai Xie1,2.   

Abstract

Breast cancer is the most common cancer observed in adult females, worldwide. Due to the heterogeneity and varied molecular subtypes of breast cancer, the molecular mechanisms underlying carcinogenesis in different subtypes of breast cancer are distinct. Recently, long noncoding RNAs (lncRNAs) have been shown to be oncogenic or play important roles in cancer suppression and are used as biomarkers for diagnosis and therapy. In this study, we identified 134 lncRNAs and 6,414 coding genes were differentially expressed in triple-negative (TN), human epidermal growth factor receptor 2- (HER2-) positive, luminal A-positive, and luminal B-positive breast cancer. Of these, 37 lncRNAs were found to be dysregulated in all four subtypes of breast cancers. Subtypes of breast cancer special modules and lncRNA-mRNA interaction networks were constructed through weighted gene coexpression network analysis (WGCNA). Survival analysis of another public datasets was used to verify the identified lncRNAs exhibiting potential indicative roles in TN prognosis. Results from heat map analysis of the identified lncRNAs revealed that five blocks were significantly displayed. High expressions of lncRNAs, including LINC00911, CSMD2-AS1, LINC01192, SNHG19, DSCAM-AS1, PCAT4, ACVR28-AS1, and CNTFR-AS1, and low expressions of THAP9-AS1, MALAT1, TUG1, CAHM, FAM2011, NNT-AS1, COX10-AS1, and RPARP-AS1 were associated with low survival possibility in TN breast cancers. This study provides novel lncRNAs as potential biomarkers for the therapeutic and prognostic classification of different breast cancer subtypes.
Copyright © 2020 Xiao-Xiao Li et al.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32190687      PMCID: PMC7073484          DOI: 10.1155/2020/8970340

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Breast cancer is the most frequently occurring cancer and the second cause of female cancer mortality worldwide. According to a report from the International Agency for Research on Cancer, breast cancer accounts for approximately 11.6% of cancer incidences [1]. Although deaths caused by breast cancer have declined over time due to the development of new diagnostic and therapeutic strategies, the molecular mechanisms underlying this disease remain to be revealed. As breast cancers are notoriously heterogeneous, clinical and morphologic signatures are diversified. Consequently, the exact classification of breast cancer is a challenge for clinicians and scientist. Variations in the expressions of the estrogen receptor (ER), human epidermal growth factor receptor 2 (HER2), and progesterone receptor (PR) can be used to classify breast cancers into four major intrinsic molecular subtypes: triple-negative (TN), luminal A, luminal B, and HER2-positive [2]. TN breast cancers are well-known as lacking ER and PR as well as HER2 [3]. Luminal A breast cancers are defined as ER-positive, HER2-negative, and “low” recurrence risk, partially expressing PR [4]. Luminal B breast cancers are characterized by HER2-positive, ER and PR-negative, and a high histologic grade [5]. HER2-positive breast cancers are defined as HER2-positive and lacking of ER and PR. Along with ER, PR, and HER2, many other molecular genes also have been used to subtype breast cancers. High expression of Ki67, known as a proliferation marker, was usually been used to define breast cancer subtypes [6]. Molecular classifications are using multigene assays to classify breast cancers into low- and high-risk groups for personalized therapy [7, 8]. Many genes have been found as signatures in the diagnosis and prognosis of breast cancer. Decreased levels of uPA and PAI-1 are associated with lower risk of cancer recurrence, and overexpression of Ki67, cyclin D, cyclin E, p27, and p21 indicates uncontrolled tumor cell proliferation [9]. The molecular interaction regulation mechanisms mediating heterogeneity and different molecular subtypes of breast cancer need to be further probed. Long noncoding RNAs (lncRNAs), which are 200 nucleotides in length and play important roles in various biological pathways, have been shown to be oncogenic or play important roles in cancer suppression and are used as biomarkers for diagnosis and therapy [10, 11]. Aberrant expressions of lncRNAs, such as MALAT1 [12], DANCR [13], PDCD4-AS1 and [14], MIR100HG [15], are closely associated with the occurrence and progression of breast cancer. Overexpression of HOTAIR stimulates the invasion and metastasis of breast cancer cells [16]. PIWI-interacting noncoding RNA especially expressed in breast cancer is a response to estrogen regulation in breast cancer cells [17]. lncRNAs can also mediate the progression of breast cancer by interacting with microRNAs. For example, MEG3 acts as a suppressor to inhibit cell epithelial-mesenchymal transition by sponging miR-421 and targeting E-cadherin in breast cancer [18]. Highly expressed H19/low miR-675 and low NEAT1/high miR-204 could differentiate breast cancer subtypes and could be considered as diagnostic and therapeutic biomarkers [19]. lncRNAs can also act as epigenetic regulators affecting histone modifications and stabilize signal complexes and nuclear structures by recruiting chromatin modification factors [20]. Although a growing volume of lncRNAs has been found playing varied roles in the pathogenesis of breast cancer and drug resistance, their biological participation mechanisms are not yet fully understood. TN breast cancer is a heterogeneous subtype of breast cancers with poor prognosis and is beginning to be refined by molecular characteristics. Based on gene expression profiles, six TN breast cancer subtypes were classified [21]. As the high heterogeneous of TN breast cancer, it is necessary to disclose its molecular characteristics and identify effective clinical strategy to improve the therapeutic approach. A series of lncRNAs has been found dysregulated in TN breast cancer [22]. lncRNAs are involved in the regulation of most mammalian protein coding genes, which are responsible for various cellular processes such as cell differentiation, development, proliferation, and apoptosis. Exploitation of the function of dysregulated lncRNAs will provide potential clinical applications for TN's diagnosis and treatment. WGCNA can be used to identify key lncRNAs associated with multiple cancer pathogenesis and progression [23-26]. Survival analysis verified the identified lncRNAs possessing potential indicative roles in TN prognosis. These survival expectation indictors associated with coexpressing coding genes formed coexpressed axes, which could act as attractive therapeutic targets for the treatment of TN breast cancer. The workflow of this study is summarized in Figure 1.
Figure 1

Flow diagram of this study. First, breast cancer expression profiles were downloaded from Gene Expression Omnibus. Second, the pipeline of lncRNA classification was utilized to identify lncRNA expression. Third, constructed and analysis breast cancers associated network-based on WGCNA. Finally, validation of the prediction roles of identified lncRNAs in TN breast cancers was performed by survival analysis.

2. Materials and Methods

2.1. Breast Cancer Expression Profiles Downloaded from Gene Expression Omnibus

The public datasets GSE45827 and GSE58812 were downloaded from the NCBI Gene Expression Omnibus (GEO). GSE45827 includes 41 TN, 30 HER2, 29 luminal A, and 30 luminal B breast cancer patients and 11 normal tissue samples [27]. GSE58812 includes 107 triple-negative breast cancer patients [28]. Normal tissue samples were used as a control to identify differentially expressed coding genes (DEGs) and differentially expressed lncRNAs (DElncRNAs). In order to reduce the bias of different data resources, datasets adopted in this study were both based on the same GPL570 platform. The type of microarray was Affymetrix Human Genome U133 plus 2.0 Array. All raw data preprocessing was conducted in the R software environment. The Affy package was adopted to process the initial datasets [29]. A robust multiarray average function was used to detect and normalize the expression of probes. The Limma package was used for differential expression analysis [30]. The Benjamini and Hochberg method was used to control the false discovery rate (FDR). The adjusted p values were used to control the FDR. The FDR ratio was set as 5%. Adjusted p values < 0.05 and log (fold change) > 1 were selected as the thresholds for DEGs and DElncRNAs.

2.2. lncRNA Classification Pipeline

The pipeline of lncRNA classification was utilized to identify lncRNA expression as previously described [31], with minor modifications. The Affymetrix HG-U133 plus 2.0 probe set ID was mapped to the latest NetAffx annotation files. First, we extracted the probe sets with RefSeq IDs labelled as “NR_”. For the selected probe sets annotated with Ensembl gene IDs, we annotated with lncRNA, processed the transcript noncoding and antisense according to Homo_sapiens.GRCH38.96.gtf. The last probe sets without Ensembl gene IDs were filtered using pseudogenes, rRNAs, microRNAs, snoRNA, and tRNAs according to NetAffx annotation files. Finally, we obtained 2,553 Affymetrix probe IDs, which were annotated as lncRNA transcripts (1,870 annotated lncRNAs).

2.3. Construction of a Weighted Gene Coexpression Network and Identification of Modules Associated with Different Subtypes of Breast Cancer

WGCNA was adopted to identify coexpression modules [32]. The signed weighted correlation network was constructed by the first creating a matrix of pairwise correlations between all pairs of genes [33]. The power of seven was interpreted as a soft threshold of the correlation matrix. Depending on the resulting adjacency matrix, we calculated the topological overlap matrix, which measures the interconnectedness of the coexpression network. Then, genes with highly similar coexpression relationships were grouped together by hierarchical clustering based on the topological overlap matrix. The Dynamic Hybrid Tree Cut algorithm was adopted to define modules, meaning the coexpression genes. Module eigengenes were defined as the first principal component to represent each module. Finally, modules were merged based on correlation above 0.85. After identifying the coexpression modules, we associated modules with each sample type. Modules with correlation > 0.5 (p < 0.01) were picked up as respective subtypes of breast cancer.

2.4. Construction and Analysis of lncRNA-mRNA Coexpression Network

Based on the Pearson correlation coefficient of lncRNA-mRNA, we constructed breast cancer-related lncRNA-mRNA coexpression networks. The threshold of selection for lncRNA-mRNA pairs was set at an absolute value of the Pearson correlation coefficient above 0.6. Topological analysis of the coexpression network was performed using the Cytoscape (version 3.7.1) software [34]. Hub genes were determined using high connectivity of genes by summing the connection strengths with other module genes.

2.5. Gene Ontology Analysis

Gene ontology analysis of identified modules and core genes was performed using DAVID 6.8 [35]. The corrected FDR of a p value < 0.05 was selected as the threshold for enrichment of the GO_FAT terms.

2.6. Validation of Identified lncRNA Roles in TN Cancer Classification

The dataset GSE58812 was used to verify the important roles of identified DElncRNAs. Preprocessing of GSE58812 was conducted in the R software as described above. After obtaining the expression matrix of the datasets, these TN cancer samples were classified into groups based on principal component analysis. The clustering was performed by package ggfortify of R [36]. As the raw reference of GSE58812 classified the 107 TN breast cancer samples into three subtypes, we set the cluster number as 3. According to the clinical data of these TN cancer samples, survival analysis was performed using survminer package [37]. The median of DElncRNA expressions among different TN cancer sample groups was adopted to display the panel of predicting survival expectation.

3. Results

3.1. DEGs and DElncRNAs in Different Subtypes of Breast Cancer Patients

After applying lncRNA classification, we annotated the microarray with 17,242 coding genes and 1,870 lncRNAs. We identified 3,514 coding genes and 96 lncRNAs, 3,181 coding genes and 87 lncRNAs, 2,198 coding genes and 60 lncRNAs, and 2,841 coding genes and 70 lncRNAs differentially expressed in TN, HER2, luminal A, and luminal B breast cancers compared to normal tissues (Figure 2). The four different subtypes of breast cancer shared 37 common differentially expressed lncRNAs (Figure 3(a)). Most of the 37 common differentially expressed noncoding RNAs were downregulated in the four different subtypes of breast cancers compared to normal tissues (Table 1). This means that the downregulated lncRNAs may be the tumor suppressor genes in the normal tissues. Only five lncRNAs (HOTAIR, DLEU2, THAP9-AS1, POLR2J4, and LINC01614) were upregulated (Figures 3(e)–3(f)). LncRNA HOTAIR has been proven to play a significant role in promoting invasion of breast cancer cells [38]. DLEU2 was reported to play a role in response to estrogen regulation in human breast cancer cell line [39]. POLR2J4 has also been used to predict recurrence-free survival in hepatocellular carcinoma [40]. Knockdown of LINC01614 inhibits lung adenocarcinoma cell progression [41]. THAP9-AS1 was the first time observed to be associated with breast cancer.
Figure 2

Identification of differentially expressed coding genes and lncRNAs among the four molecular subtypes of breast cancers. Hierarchical analysis showed differentially expressed coding genes and lncRNAs compared to normal tissue samples. The upper chart was for DEGs and the bottom for DElncRNAs (a–d). TN breast cancers identified 3,514 DEGs and 96 DElncRNAs (a). HER2 breast cancers identified 3,181 DEGs and 87 DElncRNAs (b). Luminal A breast cancers identified 2,198 DEGs and 60 DElncRNAs (c). Luminal B breast cancers identified 2,841 DEGs and 70 DElncRNAs (d).

Figure 3

Venn diagram of DElncRNAs among the four molecular subtypes of breast cancer. Venn diagram shows the interactions of DElncRNAs among the four different molecular subtypes of breast cancer (a). The expression of five upregulated DElncRNAs in the four different subtypes of breast cancer and normal tissues was plotted by ggstatsplot (b–f). The expression values of upregulated DElncRNAs were shown as HOTAIR (b), DLEU2 (c), THAP9-AS1 (d), POLR2J4 (e), and LINC01614 (f).

Table 1

The fold changes of core 37 DElncRNAs in breast cancer compared to normal tissue.

lncRNA symbollncRNA Ensembl IDTNHer2 positiveLuminal A positiveLuminal B positive
Fold changeP.ValFold changeP.ValFold changeP.ValFold changeP.Val
DLEU2ENSG000002316072.172.84E‐062.042.43E‐052.211.32E‐052.102.39E‐06
HOTAIRENSG000002286303.144.22E‐037.905.11E‐092.512.61E‐025.136.74E‐06
LINC01614ENSG000002308385.102.32E‐0711.353.48E‐1111.185.68E‐1016.971.30E‐11
POLR2J4ENSG000002147833.445.91E‐103.176.65E‐103.137.34E‐112.872.66E‐09
THAP9-AS1ENSG000002510222.681.17E‐062.442.61E‐052.811.58E‐072.438.58E‐06
AC005064.1ENSG00000234715-3.032.72E‐09-3.081.10E‐07-2.051.41E‐03-2.902.47E‐08
AC099342.1ENSG00000226097-4.042.82E‐14-4.002.79E‐11-3.792.66E‐10-3.741.84E‐10
AC245123.1ENSG00000277526-2.212.04E‐12-2.181.69E‐10-2.281.34E‐10-2.233.34E‐10
ACVR2B-AS1ENSG00000229589-3.052.04E‐15-3.221.69E‐13-2.881.17E‐10-3.022.71E‐12
ADAMTS9-AS2ENSG00000241684-5.701.03E‐16-5.035.99E‐13-3.066.45E‐07-5.242.51E‐14
AP001816.1ENSG00000254531-2.421.90E‐11-2.104.98E‐06-2.098.57E‐09-2.051.05E‐06
AP002518.1ENSG00000256195-2.104.86E‐09-2.208.35E‐08-2.161.76E‐07-2.201.34E‐07
AP003110.1ENSG00000255084-2.031.19E‐07-2.021.98E‐06-2.101.27E‐07-2.041.60E‐06
CARMNENSG00000249669-6.184.66E‐15-4.672.23E‐11-3.027.51E‐09-4.101.78E‐11
FGF14-AS2ENSG00000272143-2.232.36E‐06-2.901.26E‐13-2.412.13E‐09-2.579.86E‐10
FZD10-AS1ENSG00000250208-2.627.84E‐14-2.663.70E‐14-2.087.17E‐09-2.577.81E‐12
GAS1RRENSG00000226237-2.386.27E‐09-2.683.76E‐10-2.181.95E‐08-2.436.91E‐09
GLIDRENSG00000278175-2.895.85E‐11-3.651.61E‐11-2.023.05E‐07-2.991.61E‐11
HOTAIRM1ENSG00000233429-3.594.33E‐08-3.809.15E‐09-3.392.07E‐09-3.898.96E‐12
HSD11B1-AS1ENSG00000227591-5.503.99E‐13-5.871.55E‐10-5.656.09E‐10-5.941.42E‐10
LINC00358ENSG00000229578-2.257.11E‐11-2.271.52E‐08-2.196.90E‐08-2.216.47E‐08
LINC00408ENSG00000226250-6.381.23E‐22-6.152.14E‐18-4.859.77E‐14-5.026.61E‐15
LINC00968ENSG00000246430-2.171.36E‐06-2.201.63E‐05-2.182.13E‐05-2.211.29E‐05
LINC01140ENSG00000267272-3.152.57E‐07-3.203.98E‐06-2.151.27E‐03-2.658.56E‐06
LINC01192ENSG00000241369-2.382.37E‐08-2.426.29E‐07-2.514.75E‐07-2.572.07E‐07
LINC01697ENSG00000232079-5.657.26E‐18-5.657.39E‐15-4.531.53E‐09-5.737.03E‐15
LINC01720ENSG00000231175-2.051.58E‐11-2.061.88E‐09-2.023.89E‐09-2.031.08E‐09
LINC02053ENSG00000241696-5.706.69E‐18-5.531.07E‐13-5.596.05E‐13-5.072.61E‐12
LINC02535ENSG00000234155-2.098.56E‐11-2.135.70E‐09-2.156.59E‐09-2.121.07E‐08
MEG3ENSG00000214548-6.393.25E‐15-3.673.67E‐06-2.379.12E‐05-3.641.20E‐07
MIR497HGENSG00000267532-2.692.77E‐17-2.821.69E‐14-2.341.84E‐11-2.683.47E‐13
PGM5-AS1ENSG00000224958-2.352.44E‐06-3.224.17E‐09-2.185.62E‐07-2.354.72E‐06
PRICKLE2-AS3ENSG00000226017-2.262.07E‐09-2.136.53E‐07-2.151.11E‐06-2.319.74E‐08
SNHG29ENSG00000175061-3.475.31E‐10-4.405.21E‐09-2.372.28E‐07-3.575.97E‐09
TRHDE-AS1ENSG00000236333-2.683.80E‐08-2.871.82E‐08-2.631.08E‐07-2.683.80E‐08
WARS2-AS1ENSG00000231365-2.211.39E‐11-2.491.46E‐12-2.185.92E‐09-2.314.35E‐09
ZNF436-AS1ENSG00000249087-2.881.03E‐10-3.102.83E‐10-2.576.59E‐09-2.752.95E‐09

The five DElncRNAs upregulated in all the four breast cancer subtypes were marked with “∗”.

3.2. WGCNA Analysis of the Coding Genes and lncRNAs

The combination of these differentially expressed coding genes and lncRNAs formed a matrix dataset including 6,414 genes and 134 lncRNAs among 141 samples. The network was constructed with the soft threshold power of 7 (Figure 4(a)). We identified 15 gene modules based on a minimum module size of 30 merged modules with eigengene correlation above 0.85. Modules related to different breast cancer subtypes are shown in Figure 4(b). A gene dendrogram was obtained using average linkage hierarchical clustering. The color row was painted according to the expression values of genes in the dendrogram. The turquoise module was highly associated with TN, and midnight blue was related to HER2 and luminal B breast cancers. Luminal B breast cancer was highly associated with the grey modules compared to other subtypes of breast cancers. Luminal A has three relatively high modules including black, green, and magenta. Luminal A and luminal B have the same relationship with the green modules (Figure 4(c)). These related modules will provide more information for subtyping of different breast cancers. In addition, the brown and blue modules were, respectively, positively and negatively correlated with normal tissue samples. The 15 gene modules eigengenes were classified into three groups using hierarchical clustering (Figure 4(d)).
Figure 4

Fifteen gene modules were identified based on differentially expressed coding genes and lncRNAs. Selection of soft thresholding powers for WGCNA (a). Red line corresponds to 0.9. As the lowest power satisfies the approximate scale-free topology criterion, number 7 was interpreted as a soft threshold of the correlation matrix (a). Heat map of gene dendrogram assigned to the four different subtypes of breast cancer. The color row was painted according to expression values of genes in the dendrogram (b). Relationships of module eigengenes and different breast cancer subtypes showed in these datasets. Each row in the table corresponds to the identified 15 gene modules, and each column corresponds to the sample types. The correlations of the corresponding module eigengenes, samples, and p values were labelled in the table. The colors of the tables were painted corresponding to the correlation between gene modules and samples (c). 15 gene module eigengenes were classified into three groups by hierarchical clustering (d).

3.3. Construction of Breast Cancer-Related lncRNA-mRNA Coexpression Networks

Based on the Pearson correlation coefficients of lncRNA-mRNA, we constructed breast cancer-related lncRNA-mRNA coexpression networks using WGCNA. The threshold for the selection of lncRNA-mRNA pairs was set at an absolute value of the Pearson correlation coefficient above 0.6. The TN-related lncRNA-mRNA coexpression network derived from the turquoise module consists of 19 lncRNAs and 218 DEGs (Figure 5(a)). Maternally expressed gene 3 (MEG3) was the hub of lncRNA in the constructed network with 219 interaction links. Previous studies showed that MEG3 was downregulated in various types of cancers including breast cancer. Meta-analysis demonstrated that low expression of MEG3 is associated with poor survival in cancer patients [42]. GO analysis of MEG3 linked 219 genes, which enriched cell adhesion, cell proliferation, immune system processes, and other processes. The network could provide useful information for the mechanism of MEG3. CEP55, MELK, and KIF11 were the top 3 degree DEGs in the network. CEP55 is a determinant of cell fate during perturbed mitosis in breast cancer [43].
Figure 5

Results of different breast cancer subtype-related lncRNA-mRNA coexpression networks. The coexpression networks of TN-related module—turquoise (a), luminal A- and luminal B-related module—green (b), normal-related module—brown (c), and normal negative-related module—blue (d). Nodes with a V shape arrow are lncRNAs. Nodes with ellipse are mRNAs. Key lncRNAs and mRNAs in the network are, respectively, highlighted in yellow and red colors. The size of the nodes corresponds to their degrees.

Luminal A- and luminal B-related lncRNA-mRNA coexpression network derived from the green module consists of 9 lncRNAs and 144 DEGs. LINC01116, LINC01087, AC110619, FOXP4-AS1, and TP53TG1 were the top nodes in this network, while MLPH, AGR2, SMIM4, FOXA1, INPP4B, MAGED, P4HTM, and AGR3 mediated the connection of the whole network. Normal tissue-related lncRNA-mRNA coexpression network derived from the brown module consists of 23 lncRNAs and 405 DEGs. The top nodes in this network were ADMTS9-AS2, HSD11B1-AS1, LINC01697, MIR497HG, LINC01140, and CARMN. BTNL9 and TNS1 were the top degree DEGs in the constructed network. TNS1 modulated the activation of Cdc42 to regulate cell invasion in breast cancer [44]. The nonnormal-related lncRNA-mRNA coexpression network derived from the blue module was negatively related with normal samples. This cancer-related lncRNA-mRNA coexpression network consists of 38 lncRNAs and 496 DEGs. LINC00408, AC099342.1, LINC02053, LINC01720, and AC245123.1 were the top nodes in this network, while DEFB106A, LECT2, TRIM49, BOLL, OPHN1, ZNF835, and EPPIN were the top degree DEG nodes in the network.

3.4. Function Annotation of Breast Cancer-Related lncRNAs

We performed function annotation of these constructed lncRNA-mRNA coexpression networks using the DAVID system. The TN-related lncRNA-mRNA coexpression network was involved in cell cycle control and cell proliferation (Figure 6(a)). The current result is consistent with results from previous studies, which showed that overexpression of MEG3 suppresses the proliferation of breast cancer cells [45]. The luminal A- and luminal B-related lncRNA-mRNA coexpression network was involved in carboxylic acid and organic acid transport or metabolic processes and the response to hormone stimulus (Figure 6(b)). That may be the reason why prognosis of these two subtypes of breast cancer is better than that of others. GO analysis of the normal-related lncRNA-mRNA coexpression network enriched blood vessel development and morphogenesis and the angiogenesis term (Figure 6(c)). That means the brown module is essential for normal tissue maintenance. GO analysis of the normal negative-related lncRNA-mRNA coexpression networks enriched protein transport terms such as Golgi vesicle transport and vesicle organization (Figure 6(d)). These lncRNAs interact with coding genes and mediate cancer cell protein transport and cellular morphology during tumorigenesis.
Figure 6

Function annotation of identified DEG-DElncRNA coexpression network. GO analysis shows enriched GO term in TN-related lncRNA-mRNA coexpression networks (a), luminal A- and luminal B-related lncRNA-mRNA coexpression networks (b), normal-related lncRNA-mRNA coexpression network (c), and normal negative-related lncRNA-mRNA coexpression network (d).

3.5. DElncRNAs Act as Survival Expectation Indictors Verified Using Public Datasets

The TN breast cancer samples of dataset GSE58812 were classified into three clusters based on the expression of these DElncRNAs (Figure 7(a)). According to the clinical data of these TN cancer samples, survival analysis of the three clusters displayed different survival probabilities. The KM curves with a p value of pairwise comparisons for the three clusters are shown in Figure 7(b). The survival probability of cluster 3 was 50% lower than those of cluster 1 or 2 (75%). Five blocks that acted as signature predictors of survival in TN breast cancer patients were significantly displayed using heat map analysis (Figure 7(c)). This means that different DElncRNAs play different roles in TN breast cancer progression by promoting or inhibiting cancer progression. The DElncRNAs THAP9-AS1, MALAT1, TUG1, CAHM, FAM2011, NNT-AS1, COX10-AS1, and RPARP-AS1 were lowly expressed in cluster 3 compared with those of cluster 1 and 2 samples. The analysis of the TCGA database showed that higher expression of TUG1 was associated with better prognosis in breast cancer patients [46]. This is consistent with our bioinformatics prediction. LINC00911, CSMD2-AS1, LINC01192, SNHG19, DSCAM-AS1, PCAT4, ACVR28-AS1, and CNTFR-AS1 were highly expressed in cluster 3. The relative expression values of DElncRNAs can be used as a predictor for survival expectation.
Figure 7

Verification of the inductive roles of identified DElncRNAs based on public datasets. The TN cancer samples were classified into three clusters (a). Survival analysis of the three clusters showed different survival probability (b). The panels of identified 97 lncRNAs act as signature predictors of survival of TN breast cancer patients (c).

In order to detect the prediction roles of identified 97 lncRNAs individually, we performed survival analysis of single gene expression and TN breast cancer prognosis. The prediction with significant p values between high and low expression values is displayed in Figure 8. High expression of LINC00911 (Figure 8(a)), H19 (Figure 8(b)), and MIRLRT7BHG (Figure 8(c)), respectively, predicted poor survival in TN breast cancer. High expression of MIR155HG (Figure 8(d)), CHAM (Figure 8(e)), and FAM13A-AS1 (Figure 8(f)), respectively, were associated with higher survival probability. This was consistent with the above panel prediction. The individual expression of LINC00911, H19, and MIRLRT7BHG can be used as a predictor for poor prognosis. MIR155HG, CHAM, and FAM13A-AS1 can be used as a positive predictor for survival probability.
Figure 8

The prediction with significant p values between high and low expression values with TN breast cancer prognosis. High expression of LINC00911 (a), H19 (b), and MIRLRT7BHG (c) predicts poor survival in TN breast cancer. High expression of MIR155HG (d), CHAM (e) and FAM13A-AS1 (f), respectively, is associated with higher survival probability.

3.6. The Coexpressing Coding Genes of Survival Expectation Indictors Extracted from the Above lncRNA-mRNA Coexpression Networks

From the above identified TN, luminal A, luminal B, and normal tissue-associated lncRNA-mRNA coexpression networks, useful interaction information of lncRNAs, and coding genes could be found. LncRNAs LINC00911, LINC01192, PCAT4, and CNTFR-AS1 were listed in the normal samples that were negatively related to the blue module. Extraction of the lncRNA-mRNA interaction network is displayed in Figure 9(a). The four highly expressed lncRNAs associated with 31 coding genes formed a coexpression network. The top degree coding genes were painted with red colors. Several coding genes have been found playing important roles in recurrence, poor diagnosis, and low survival probabilities. Multivariable analyses revealed that LECT2 as one of the predictors for both breast carcinoma recurrence and mortality among smokers [47]. Differentially expressed ZNF835 was associated with ethnicity in colorectal cancer patients [48]. The positive of oligophrenin-1 was significantly correlated with a high Gleason score in prostate cancer [49]. Methyl-DNA binding domain capture technique identified Kcnv1 as a diagnostic marker for early noninvasive detection and subsequent breast cancer surveillance [50]. Brain-derived neurotrophic factor (BDNF) is a potent neurotrophic factor that has been shown to stimulate breast cancer cell growth and metastasis via tyrosine kinase receptors. The methylation of BDNF gene may be a biomarker for suicidal thoughts in patients with breast cancer [51]. BDNF/TrkB pathways activated microRNAs to act as prognostic and predictive biomarkers for detecting patients at a high risk of developing breast cancer [52]. This suggested that lncRNA interacted with Kcnv1 modification methylation levels to mediate survival in TN breast cancer. Increasing expression of SCD5 promoted tumor cell survival in breast cancer [53]. Phosphorylation of hnRNPK S379 participates in the regulation of the migration of triple-negative MDA-MB-231 cells via the EMT signaling pathway [54]. Although there is no information about the function of YWHA, but upregulation of one member of YWHAZ (also known as 14-3-3ζ) had been found associated with poor clinical outcome in breast cancer [55].
Figure 9

The coexpressing coding genes of survival expectation indictors extracted from the identified lncRNA-mRNA coexpression networks. The four highly expressed lncRNAs associated with 31 coding genes to form a coexpression network (a). Top degree coding genes were painted with red color. The splicing factor SRSF10 bridged those two lowly expressed lncRNAs THAP9-AS1 and TUG1 mediated coexpression network (b). SRSF10 was painted with yellow color.

Cluster 3 lowly expressed lncRNAs, THAP9-AS1 and TUG1, were also listed in the normal samples negatively related to the blue module. Extraction of the lncRNA-mRNA interaction network displayed that the splicing factor SRSF10 bridged those two lowly expressed lncRNAs to mediate the coexpression network (Figure 9(b)). SRSF10, as a splicing factor, has been found playing important roles in colon and cervical cancer oncogenesis by mediating alternative splicing [56, 57]. Whether SRSF10 plays a similar role in breast cancer or not requires more experimental confirmation. The survival expectation indictors associated with coexpressing coding genes formed coexpressed axes, which could act as attractive therapeutic targets for the treatment of TN breast cancer.

4. Discussion

Although deaths from breast cancer have declined over time, its molecular mechanism remains to be revealed. Owing to the notorious heterogeneity of breast cancers, it is meaningful to not only classify them based on clinical and morphologic characteristics but also consider intrinsic molecular signatures. Compared with other breast cancers, TN breast cancer was characterized as young onset, high malignancy, easy recurrence, and low survival rates [58]. In addition, TN breast cancer lacks ER and PR as well as HER2 receptor. There are no effective targets of endocrine therapy and targeted therapy for TN breast cancer. It is necessary to identify new molecular targets for TN breast cancer therapy. In our work, we aimed to identify key lncRNAs associated with TN, HER2, luminal A, and luminal B breast cancers. In total, 6,414 coding genes and 134 lncRNAs were found to be dysregulated in different subtypes of breast cancers. A core of 37 lncRNAs was found to be dysregulated in all the four subtypes of breast cancers. Subtypes of breast cancer special modules and lncRNA-mRNA interaction networks were obtained using WGCNA. Moreover, survival analysis of public datasets (GSE58812) verified the identified lncRNAs, which could act as indicators for triple-negative breast cancer prognosis. Studies on lncRNAs in breast cancer are still in the early stages, and the roles of lncRNAs in breast cancer remain to be elucidated. In the notorious heterogeneity of breast cancers, there is still a core of 37 lncRNAs which dysregulated in all the four major subtypes of breast cancers. Most of the core 37 lncRNAs were downregulated compared to the normal tissues. Only five lncRNAs were upregulated in breast cancer. This indicates that the downregulated lncRNAs may be the tumor suppressor genes in normal tissues. For example, MEG3 is listed in the 32 downregulated core lncRNAs as being downregulated in various types of cancers including breast cancer. Meta-analysis demonstrated that low expression of MEG3 is associated with poor survival in cancer patients [42]. The TN-related lncRNA-mRNA coexpression network derived from the turquoise module displayed MEG3 coexpression pairs. CEP55, MELK, and KIF11 were the top three high degree DEGs in the network. CEP55 is a determinant of cell fate during perturbed mitosis in breast cancer [43]. MELK acts as a potential therapeutic target for TN breast cancer and other aggressive malignancies [59]. A recent study demonstrated that MELK can be upregulated by LINC02418 in colorectal cancer [60]. Recently, a few lncRNAs were reported to function as scaffold molecules recruiting chromatin-modifying complexes to regulate gene expression [61]. lncRNAs can also act as miRNA sponge to reverse miRNA suppression of its target genes [62]. Constructions of breast cancer-related lncRNA-mRNA coexpression networks could provide useful information for lncRNA-mRNA interaction mechanism in breast cancer progression. In this study, we found SRSF10 bridged THAP9-AS1 and TUG1 to mediate the coexpression network. As the splicing factor function of SRSF10, the low expression of THAP9-AS1 and TUG1 in cluster 3 may be related to alternative splicing. lncRNAs may serve as biomarkers for diagnostic and prognostic purposes and also as potential therapeutic targets in cancer. Several studies have provided insights into the potential clinical implications of lncRNAs in breast cancer. For example, DSCAM-AS1 mediates tumor progression and tamoxifen resistance [63] and HOTAIR reprograms the polycomb repressive complex binding pattern in breast cancer [64]. The relative expression values of DElncRNAs can be used as predictors for survival expectation in TN breast cancer. Interestingly, the expressions of THAP9-AS1 and DLEU2 were relatively lower in cluster 3 and were the only five upregulated lncRNAs in breast cancer samples. There is a paradox between panel lncRNAs and individual lncRNAs prediction of survival probability. H19 and MIRLRT7BHG lined in the last block which means high expression predict high survival probability. High expression of H19 and MIRLRT7BHG was significant related with poor prognosis, since H19 is an estrogen-inducible gene and plays a key role in cell survival and especially in estrogen-induced cell proliferation such as MCF-7 cells [65]. The expression pattern of H19 in TN breast cancers may be different from ER-positive breast cancer. Disclosing the expression pattern of H19 will provide clues for TN breast cancer diagnosis and progression. Considering the notorious heterogeneity of breast cancers, exact classification needs more molecular markers. In addition, more efforts are needed to detect the relationship between the expressions of these lncRNAs and breast cancer progression. Jiang and colleagues developed a reliable tool to predict tumor recurrence and the benefit of taxane chemotherapy based on an integrated mRNA-lncRNA signature in TN breast cancers [66]. Moreover, further validation in prospective clinical trials of those identified lncRNAs could provide new therapy strategy for the treatment of TN breast cancers. In addition, a huge number of microarray expression profiles of cancer samples are deposited in the public database. With the improvement of annotation information especially, more and more noncoding genes have been found playing important roles in carcinogenesis; mining of those microarray datasets is necessary and would provide novel knowledge about cancer diagnosis and treatment. We should also point out that there were some limitations in this study. First, the datasets used to identify the differentially expressed DEGs and DElncRNAs were downloaded from a public database and were not validated by quantitation experiments. Second, due to the limit of the sample size, the KM curves with a p value of pairwise comparisons for the three TN breast cancer clusters were not statistically different enough. Therefore, further validation of the roles of these lncRNAs would be required in future experiments.

5. Conclusions

In conclusion, we identified 134 lncRNAs to be closely related to different subtypes of breast cancers. Based on WGCNA analysis, we constructed breast cancer-related lncRNA-mRNA coexpression networks. The constructed lncRNA-mRNA coexpression networks provided useful information for uncovering the mechanism of lncRNAs in breast cancer genesis. Bioinformatics analysis revealed that lncRNA-mRNA coexpression networks were involved in cell cycle checkpoint, carboxylic acid and organic acid transport, metabolic processes, response to hormone stimulus, Golgi vesicle transport, and vesicle organization. The survival analysis of public datasets verified the identified lncRNAs, which acted as indicators for TN breast cancer prognosis. These survival expectation indictors are associated with coexpressing coding genes to form coexpressed axes, which could act as attractive therapeutic targets for the treatment of TN breast cancer.
  64 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  Transcriptome Analysis of Triple-Negative Breast Cancer Reveals an Integrated mRNA-lncRNA Signature with Predictive and Prognostic Value.

Authors:  Yi-Zhou Jiang; Yi-Rong Liu; Xiao-En Xu; Xi Jin; Xin Hu; Ke-Da Yu; Zhi-Ming Shao
Journal:  Cancer Res       Date:  2016-03-03       Impact factor: 12.701

3.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications.

Authors:  T Sørlie; C M Perou; R Tibshirani; T Aas; S Geisler; H Johnsen; T Hastie; M B Eisen; M van de Rijn; S S Jeffrey; T Thorsen; H Quist; J C Matese; P O Brown; D Botstein; P E Lønning; A L Børresen-Dale
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-11       Impact factor: 11.205

4.  Molecular interplay of the noncoding RNA ANRIL and methylated histone H3 lysine 27 by polycomb CBX7 in transcriptional silencing of INK4a.

Authors:  Kyoko L Yap; Side Li; Ana M Muñoz-Cabello; Selina Raguz; Lei Zeng; Shiraz Mujtaba; Jesús Gil; Martin J Walsh; Ming-Ming Zhou
Journal:  Mol Cell       Date:  2010-06-11       Impact factor: 17.970

5.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

6.  Estrogen regulation in human breast cancer cells of new downstream gene targets involved in estrogen metabolism, cell proliferation and cell transformation.

Authors:  J A Vendrell; F Magnino; E Danis; M J Duchesne; S Pinloche; M Pons; D Birnbaum; C Nguyen; C Theillet; P A Cohen
Journal:  J Mol Endocrinol       Date:  2004-04       Impact factor: 5.098

7.  Identification of the copy number variant biomarkers for breast cancer subtypes.

Authors:  Xiaoyong Pan; XiaoHua Hu; Yu-Hang Zhang; Lei Chen; LiuCun Zhu; ShiBao Wan; Tao Huang; Yu-Dong Cai
Journal:  Mol Genet Genomics       Date:  2018-09-10       Impact factor: 3.291

8.  hnRNPK S379 phosphorylation participates in migration regulation of triple negative MDA-MB-231 cells.

Authors:  Hsin-Yu Tsai; Shu-Ling Fu; Ling-Ming Tseng; Jen-Hwey Chiu; Chao-Hsiung Lin
Journal:  Sci Rep       Date:  2019-05-20       Impact factor: 4.379

9.  Amplification of LAPTM4B and YWHAZ contributes to chemotherapy resistance and recurrence of breast cancer.

Authors:  Yang Li; Lihua Zou; Qiyuan Li; Benjamin Haibe-Kains; Ruiyang Tian; Yan Li; Christine Desmedt; Christos Sotiriou; Zoltan Szallasi; J Dirk Iglehart; Andrea L Richardson; Zhigang Charles Wang
Journal:  Nat Med       Date:  2010-01-24       Impact factor: 53.440

10.  RNA sequencing identifies specific PIWI-interacting small non-coding RNA expression patterns in breast cancer.

Authors:  Adnan Hashim; Francesca Rizzo; Giovanna Marchese; Maria Ravo; Roberta Tarallo; Giovanni Nassa; Giorgio Giurato; Gianluca Santamaria; Angela Cordella; Concita Cantarella; Alessandro Weisz
Journal:  Oncotarget       Date:  2014-10-30
View more
  21 in total

1.  A 9-LncRNA Signature for Predicting Prognosis and Immune Response in Diffuse Large B-Cell Lymphoma.

Authors:  Xiaoxuan Wang; Yaxiao Lu; Ziyi Liu; Yidan Zhang; You He; Cong Sun; Lanfang Li; Qiongli Zhai; Bin Meng; Xiubao Ren; Xudong Wu; Huilai Zhang; Xianhuo Wang
Journal:  Front Immunol       Date:  2022-07-06       Impact factor: 8.786

2.  Novel lncRNAs with diagnostic or prognostic value screened out from breast cancer via bioinformatics analyses.

Authors:  Hongxian Wang; Lirong Shu; Nan Niu; Chenyang Zhao; Shuqi Lu; Yanhua Li; Huanyu Wang; Yao Liu; Tianhui Zou; Jiawei Zou; Xiaoqin Wu; Yun Wang
Journal:  PeerJ       Date:  2022-07-14       Impact factor: 3.061

3.  Survival-Associated Metabolic Genes and Risk Scoring System in HER2-Positive Breast Cancer.

Authors:  Chundi Gao; Huayao Li; Chao Zhou; Cun Liu; Jing Zhuang; Lijuan Liu; Changgang Sun
Journal:  Front Endocrinol (Lausanne)       Date:  2022-05-19       Impact factor: 6.055

4.  Comprehensive Analysis of a Ferroptosis-Related lncRNA Signature for Predicting Prognosis and Immune Landscape in Osteosarcoma.

Authors:  Yiming Zhang; Rong He; Xuan Lei; Lianghao Mao; Zhengyu Yin; Xinyu Zhong; Wenbing Cao; Qiping Zheng; Dapeng Li
Journal:  Front Oncol       Date:  2022-06-28       Impact factor: 5.738

5.  Identification of NUF2 and FAM83D as potential biomarkers in triple-negative breast cancer.

Authors:  Xiuming Zhai; Zhaowei Yang; Xiji Liu; Zihe Dong; Dandan Zhou
Journal:  PeerJ       Date:  2020-09-21       Impact factor: 2.984

6.  Ginsenoside Rg3 Prevents Oncogenic Long Noncoding RNA ATXN8OS from Inhibiting Tumor-Suppressive microRNA-424-5p in Breast Cancer Cells.

Authors:  Heejoo Kim; Hwee Won Ji; Hyeon Woo Kim; Sung Hwan Yun; Jae Eun Park; Sun Jung Kim
Journal:  Biomolecules       Date:  2021-01-18

7.  THAP9-AS1/miR-133b/SOX4 positive feedback loop facilitates the progression of esophageal squamous cell carcinoma.

Authors:  Jiwei Cheng; Haibo Ma; Ming Yan; Wenqun Xing
Journal:  Cell Death Dis       Date:  2021-04-14       Impact factor: 8.469

8.  LncRNA SNHG19 Promotes the Development of Non-Small Cell Lung Cancer via Mediating miR-137/E2F7 Axis.

Authors:  Guang-Yin Zhao; Zhao-Feng Ning; Rui Wang
Journal:  Front Oncol       Date:  2021-03-25       Impact factor: 6.244

9.  Complement C7 is Specifically Expressed in Mesangial Cells and is a Potential Diagnostic Biomarker for Diabetic Nephropathy and is Regulated by miR-494-3p and miR-574-5p.

Authors:  Hang Guo; Zhiyue Yan; Yonghui Hu; Xitong Huang; Congqing Pan
Journal:  Diabetes Metab Syndr Obes       Date:  2021-07-05       Impact factor: 3.168

10.  DNMT1 Mediated CAHM Repression Promotes Glioma Invasion via SPAK/JNK Pathway.

Authors:  Yadi Xu; Zelin Li; Tian Huai; Xiuhao Huo; Hongliang Wang; Erbao Bian; Bing Zhao
Journal:  Cell Mol Neurobiol       Date:  2021-07-06       Impact factor: 4.231

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.