Literature DB >> 31059025

Comprehensive analysis of the aberrantly expressed lncRNA‑associated ceRNA network in breast cancer.

Tayier Tuersong1, Linlin Li1, Zumureti Abulaiti2, Shumei Feng3.   

Abstract

Previous studies have suggested that long non‑coding RNAs (lncRNAs) are closely associated with human diseases, particularly cancer, including cancer of the lung, breast and stomach. A variety of lncRNAs are abnormally expressed in cancer and participate in several pathways including cell proliferation and apoptosis; these elements are closely associated with the development of cancer. The Cancer Genome Atlas (TCGA) is an important cancer database. It consists of clinical data, genomic variation, mRNA, microRNA (miRNA) and lncRNAs expression, methylation and other data for various types of human cancer. In the present study, differential expression of RNA was identified using the edgeR package. A total 1,222 RNA sequencing profiles from patients with breast cancer were downloaded from TCGA. A competing endogenous RNA (ceRNA) network was constructed for breast cancer based on miRcode and miRTarBase. The top 10 lncRNAs were selected using Cox regression analysis. Survival analysis was performed using Kaplan‑Meier analysis. A total of 1,028 breast cancer‑associated lncRNAs and 89 miRNAs (fold change >2; P<0.05) were identified; among these, 93 lncRNAs and 19 miRNAs were included in the ceRNA network. Subsequently, 10 basic lncRNAs were selected and their associations with overall survival were identified. In addition, 5 lncRNAs (ADAM metallopeptidase with thrombospondin type 1 motif 9‑antisense RNA 1, AL513123.1, chromosome 10 open reading frame 126, long intergenic non‑protein coding RNA 536 and Wilms tumor 1 antisense RNA) were identified to be significantly associated with overall survival (P<0.05, log rank test). These results suggested that mRNAs, lncRNAs and miRNAs were involved in pathological mechanisms of breast cancer. The newly‑identified ceRNA network included 93 breast cancer‑specific lncRNAs, 19 miRNAs and 27 mRNAs. The results of the present study highlight the potential of lncRNAs in understanding the development and pathogenesis of breast cancer, and suggest novel concepts and an experimental basis for the identification of prognostic biomarkers and therapeutic targets for breast cancer.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31059025      PMCID: PMC6522813          DOI: 10.3892/mmr.2019.10165

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Breast cancer is one of the most widespread types of cancer in females, and its incidence is increasing yearly. It is expected that by 2018, the United States of America will have ~266,120 incident cases of invasive breast cancer and 63,960 cases of in situ breast cancer. Furthermore, it is estimated that 40,920 women will succumb to breast cancer (1). Similarly, in China, the incidence of breast cancer is the primary cause of mortality in women <45 years, followed by colorectal, liver and esophageal cancer (2). Following systemic treatment, patients with breast cancer continue to experience recurrence and metastasis, and the majority of patients eventually succumb to metastases. Therefore, it is important to identify diagnostic and prognostic markers, and potential therapeutic targets (3,4). The present study explored the mechanisms through which long non-coding RNAs (lncRNAs) act as competing endogenous RNAs (ceRNAs) to regulate target genes and participate in the prognosis and pathogenesis of breast cancer. Over previous decades, 70–90% of the transcribed human genome has been identified. Relevant data indicate that the proportion of genes encoding proteins in the genome is >2%, and non-coding RNA accounts for the majority of the human transcriptome (5). Non-coding RNAs are a large class of RNA molecules that do not encode proteins, but which serve regulatory roles. Non-coding RNAs may be divided into three classes, by length: <50 nucleotides (nt), including microRNA (miRNA), small interfering RNA and Piwi-interacting RNA; 50–500 nt, including ribosomal RNA, transfer RNA, small nuclear RNA, small nucleolar RNA and lncRNA; >500 nt, including long mRNA-like non-coding RNAs and lncRNAs without polyA tails (6). lncRNA is a generic term for a class of RNA molecules with lengths of >200 nucleotides, and is one of the most active fields of study in molecular biology at present. lncRNAs regulate the gene expression of tumor cells through multiple modes of action, and therefore are widely involved in the occurrence and metastasis of tumors (7–9). lncRNAs are thought to serve an important role in the development of cancer; however, only a few have been well-characterized regarding their functional roles (10). Differentially expressed ceRNAs have been identified in a number of diseases, particularly in cancer. Previous studies have demonstrated that ceRNAs affect the proliferation, growth, differentiation, apoptosis and other biological behaviors of cancer cells (11–13). miRNA response elements (MREs) are partially complementary sequences located on mRNAs that bind to miRNAs, and may inhibit the expression of miRNA target genes. miRNAs regulate hundreds of mRNAs, and one miRNA may be regulated by hundreds of mRNAs. Numerous types of RNA molecules constitute a ceRNA regulatory network, and the more MREs that are shared between them, the greater the potential for communication or co-regulation of a target gene (14). Theoretically, almost every RNA molecule possesses at least one MRE binding site that binds to an miRNA, thereby forming a ceRNA association with the corresponding miRNA (15). Among these, the interaction mechanisms between lncRNA and miRNA include: miRNA binding to and degrading lncRNA; lncRNA acting as miRNA sponge by binding to adsorbed miRNA; and truncation of lncRNA to generate miRNA (16). There is evidence that lncRNAs interact with miRNAs and act as ceRNAs to regulate the expression of miRNAs. CeRNAs serve important roles together with cancer-associated genes, and they have demonstrated potential in clinical tumor diagnosis and treatment (17,18). As the largest database of cancer gene information currently available, The Cancer Genome Atlas (TCGA) includes 33 types of cancers in situ, advancing the understanding of the molecular basis of the onset of these diseases and improving the ability to diagnose, treat and prevent cancer. TCGA comprises multiple levels of tumor data, including genomic, transcriptomic, proteomic and epigenetic data (19). In the present study, according to the analysis of RNA expression profiles among 1,109 breast cancer tumor and 113 non-tumor tissue samples, a lncRNA-miRNA-mRNA regulatory network was successfully constructed. Furthermore, Cox regression analysis was used to select the top 10 differentially expressed lncRNAs (DElncRNAs), to understand the function of lncRNAs and their potential mechanisms in breast cancer. Survival analyses were also performed to identify prognostic genes.

Materials and methods

Study population

RNA sequencing data was downloaded from TCGA (https://www.cancergenome.nih.gov/). As of February 19, 2018, a total of 1,092 breast cancer cases were collected in the database. Univariate Cox analysis was performed using the library (survival) package in R software (version 3.4.3; R: A Language and Environment for Statistical Computing) (20) from the TCGA database. The present study adhered to the TCGA publication guidelines.

Screening differentially expressed genes

Breast cancer mRNA and miRNA sequencing data were derived from 1,222 samples, including 1,109 tumor samples (cohort Tumor) and 113 normal samples (cohort Normal). Normal sample and tumor sample data were then merged, and all expression values equal to zero were removed. Comparing the data from normal tissues with breast cancer samples, Perl language was used to extract the mRNA matrix file data from the RNA sequencing data (lncRNA, mRNA) downloaded by TCGA in the Windows Environment, running the program with cmd.exe, and then the data were converted into ensemble ID to obtain the gene symbol matrix file. The gene symbol matrix file contained the lncRNA and mRNA matrix, and gets-lncRNA symbol.pl and get-mRNA symbol.pl scripts were used to obtain the lncRNA symbol and mRNA symbol matrix data, respectively. DElncRNAs were measured using lncRNA symbol matrix and ‘edgeR’ package in R software (version 3.4.3), with thresholds of |log2 fold change (FC)| >2.0 and adjusted P<0.01. DEmiRNAs were defined as those with |log2FC| >2.0 and adjusted P<0.01.

Constructing the ceRNA network in breast cancer

DElncRNAs and DEmiRNAs were predicted using the miRcode (http://www.miRcode.org/), starBase (http://starbase.sysu.edu.cn/) (21), using miRTarBase online software to perform target gene predictions on the screened differential miRNAs (22). TargetScan (http://www.targetscan.org/vert_72/) and miRDB (http://mirdb.org/) databases were used to predict the targeting relationship between DEmiRNA and DEmRNA. The target DEmiRNA is entered into the database, which then shows all the target genes that may interact with the target microRNA, and then further screens the genes involved by the construction of a ceRNA network (23). To understand the functions of miRNA and lncRNA with the ceRNA network, gene co-expression network analysis was used, and the results were visualized with Cytoscape 3.6.0 (24).

Top 10 aberrantly expressed lncRNAs in breast cancer

The top 10 lncRNA were selected by receiver operating characteristics (ROC) curve and area under the curve (AUC) analysis. The prognostic roles of lncRNAs were examined with Kaplan-Meier curve analysis, and the log-rank test was conducted to distinguish survival time. P<0.05 was considered to indicate a statistically significant difference.

Analysis the lncRNAs with gene expression profiling interactive analysis (GEPIA)

Similarly, lncRNA expression levels in para-noncancerous tissues and cancer tissues were compared using the GEPIA database (http://gepia.cancer-pku.cn) (25), which was used to analyze the RNA sequencing data of 33 types of cancers from TCGA database.

Survival analysis

Using the survival time data for breast cancer in TCGA, the ‘Survival’ package in R software (version 3.4.3) was used to analyze the specific lncRNA, miRNA and mRNA associated with survival. Kaplan-Meier survival analysis was performed to analyze the correlation between DERNAs signature and breast cancer patient prognosis. P<0.05 was considered to indicate a statistically significant difference.

Results

DElncRNAs, DEmRNAs and DEmiRNAs

RNA expression levels in 1,109 breast cancer tumor and 113 normal tissue samples were investigated. Genes with |log2 FC| >1.5 and adjusted P<0.01 were considered as differentially expressed. The significant DEmiRNAs and DEmRNAs was identified in breast cancer samples. A total of 2,155 DEmRNAs, 89 DEmiRNAs and 1,028 lncRNAs were identified from TCGA data set using the ‘edgeR’ package in R. Of these, 1,434 mRNAs, 68 miRNAs and 777 lncRNAs were upregulated and 721 mRNAs, 21 miRNAs and 251 lncRNAs were downregulated in breast cancer tissues compared with normal tissues. The RNAs hierarchical clustering analyses are presented in Fig. 1A, and it was demonstrated that the expression levels of these three types of RNAs were significantly differentiated compared with the normal tissues. Finally, volcano plots were generated, and differences between the normal and tumor groups were identified (Fig. 1B).
Figure 1.

Heatmap and volcano plots demonstrating differential lncRNA, mRNA and miRNA expression between normal and cancer samples. (A) Heatmap of DElncRNAs, DEmiRNAs and DEmRNAs. Sample clusters are included above the heatmap. Clusters of DElncRNAs, DEmiRNAs and DEmRNAs are noted on the left of the heatmap. Red represents upregulated genes and green represents downregulated genes. Normal samples (blue) are on the left side of each heatmap, and tumor samples (pink) are on the right. (B) Volcano plots reflecting number, significance and reliability of differentially expressed lncRNAs, miRNAs and mRNAs. Red dots indicate upregulation and green dots indicate downregulation of lncRNAs, miRNAs and mRNAs. Black dots show the lncRNAs, miRNAs and mRNAs with expression of |log2FC|<2. The x-axis represents an adjusted FDR and the y-axis represents the value of log2 FC. Aberrantly expressed lncRNAs were calculated by DESeq R. In total, 777 upregulated and 251 downregulated lncRNAs, 68 upregulated and 21 downregulated miRNAs, and 1,434 upregulated and 721 downregulated mRNAs were identified. lncRNA, long non-coding RNA: miRNA, microRNA, DE, differentially expressed; FC, fold change; FDR, false discovery rate.

miRNA targeting lncRNAs predicted by miRcode and starBase

A total of 89 breast cancer-associated miRNAs were identified to be expressed in breast cancer and normal tissues. A total of 19 miRNAs were selected from 89 breast cancer-associated miRNAs in TCGA (|log2 FC| >3; P<0.001; Table I). It was identified that 19 miRNAs interacted with 93 lncRNAs (Table II).
Table I.

Breast cancer-specific miRNAs in the competing endogenous RNA network.

miRNAExpression changelog2 FC (T/N)P-valueFDR
hsa-miR-141Upregulated2.314549762.06×10−886.36×10−87
hsa-miR-200aUpregulated2.1977448635.86×10−741.32×10−72
hsa-miR-145Downregulated−2.2401049394.89×10−1903.59×10−188
hsa-miR-182Upregulated2.455997012.74×10−715.75×10−70
hsa-miR-206Downregulated−3.8928022441.7×10−269.3×10−26
hsa-miR-204Downregulated−2.5489017572.02×10−613.29×10−60
hsa-miR-21Upregulated2.266979845.18×10−1152.54×10−113
hsa-miR-375Upregulated2.6572223818.83×10−305.36×10−29
hsa-miR-183Upregulated3.0402476853.28×10−1061.28×10−104
hsa-miR-122Upregulated7.3771923928.45×10−152.46×10−14
hsa-miR-96Upregulated3.378896785.04×10−1132.28×10−111
hsa-miR-187Upregulated3.2560992831.11×10−245.49×10−24
hsa-miR-301bUpregulated2.9664318037.76×10−407.59×10−39
hsa-miR-429Upregulated2.7671545059.75×10−822.49×10−80
hsa-miR-210Upregulated3.1462524395.59×10−527.62×10−51
hsa-miR-144Downregulated−2.7909136554.88×10−1001.59×10−98
hsa-miR-137Upregulated2.5333833796.50×10−111.48×10−10
hsa-miR-184Upregulated4.3105807661.02×10−234.84×10−23
hsa-miR-100Downregulated−2.0058721183.19×10−807.8×10−79

hsa, Homo sapiens; miR, microRNA; FC, fold change; T/N, tumor/normal; FDR, false discovery rate.

Table II.

DElncRNAs interacting with the 19 DEmiRNAs retrieved from the miRcode database.

lncRNAmiRNAs
AGAP11hsa-miR-141, hsa-miR-200a, hsa-miR-145, hsa-miR-182, hsa-miR-206, hsa-miR-204, hsa-miR-21, hsa-miR-375
C2orf48hsa-miR-183, hsa-miR-204, hsa-miR-122
SHANK2-AS3hsa-miR-96, hsa-miR-145, hsa-miR-187, hsa-miR-204, hsa-miR-122
C20orf166-AS1hsa-miR-301b, hsa-miR-183, hsa-miR-429, hsa-miR-375
C15orf54hsa-miR-301b, hsa-miR-182, hsa-miR-206, hsa-miR-429, hsa-miR-375
AC127496.1hsa-miR-96, hsa-miR-182, hsa-miR-183, hsa-miR-204, hsa-miR-210, hsa-miR-122
MIR7-3HGhsa-miR-145, hsa-miR-204
LINC00305hsa-miR-144, hsa-miR-204
C10orf91hsa-miR-429, hsa-miR-204, hsa-miR-122
WT1-AShsa-miR-96, hsa-miR-141, hsa-miR-200a, hsa-miR-145, hsa-miR-182, hsa-miR-206, hsa-miR-429, hsa-miR-375
LINC00518hsa-miR-141, hsa-miR-200a, hsa-miR-145, hsa-miR-206, hsa-miR-204, hsa-miR-375
LINC00221hsa-miR-301b, hsa-miR-96, hsa-miR-182, hsa-miR-204, hsa-miR-21
TCL6hsa-miR-301b, hsa-miR-96, hsa-miR-137, hsa-miR-144, hsa-miR-145, hsa-miR-182, hsa-miR-183, hsa-miR-187, hsa-miR-206, hsa-miR-204, hsa-miR-210, hsa-miR-122, hsa-miR-375
AC009093.1hsa-miR-183, hsa-miR-122
C17orf102hsa-miR-96, hsa-miR-145, hsa-miR-206, hsa-miR-429, hsa-miR-21
AC135178.1hsa-miR-122
AF241725.1hsa-miR-145
MUC2hsa-miR-145, hsa-miR-182, hsa-miR-183, hsa-miR-184, hsa-miR-429, hsa-miR-210, hsa-miR-122
RMRPhsa-miR-206, hsa-miR-122
C1orf137hsa-miR-204
AL391421.1hsa-miR-137, hsa-miR-144
TDRG1hsa-miR-122
C10orf126hsa-miR-141, hsa-miR-200a, hsa-miR-206, hsa-miR-375
MUC19hsa-miR-301b, hsa-miR-96, hsa-miR-137, hsa-miR-144, hsa-miR-145, hsa-miR-182, hsa-miR-184, hsa-miR-187, hsa-miR-206, hsa-miR-429, hsa-miR-204, hsa-miR-122, hsa-miR-375
UCA1hsa-miR-96, hsa-miR-182, hsa-miR-184, hsa-miR-206, hsa-miR-122
LINC00488hsa-miR-96, hsa-miR-144, hsa-miR-206, hsa-miR-21, hsa-miR-122
LINC00243hsa-miR-96, hsa-miR-145, hsa-miR-182, hsa-miR-206, hsa-miR-122, hsa-miR-375
AL356479.1hsa-miR-429
AL356310.1hsa-miR-206
AC080037.1hsa-miR-429
AL513123.1hsa-miR-141, hsa-miR-200a, hsa-miR-183
LGALS8-AS1hsa-miR-122
SMCR2hsa-miR-145, hsa-miR-183, hsa-miR-204
PHEX-AS1hsa-miR-301b, hsa-miR-96, hsa-miR-145, hsa-miR-182, hsa-miR-204, hsa-miR-122
LINC00466hsa-miR-96, hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-144, hsa-miR-183, hsa-miR-206, hsa-miR-429, hsa-miR-204, hsa-miR-21
CHL1-AS2hsa-miR-183
TSSC1-IT1hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-183
LINC00337hsa-miR-145, hsa-miR-182, hsa-miR-375
LINC00113hsa-miR-145
LINC00351hsa-miR-21, hsa-miR-375
ADIPOQ-AS1hsa-miR-144, hsa-miR-145, hsa-miR-182, hsa-miR-183, hsa-miR-184, hsa-miR-122, hsa-miR-375
NAALADL2-AS2hsa-miR-183, hsa-miR-206
LINC00355hsa-miR-141, hsa-miR-200a, hsa-miR-122
LINC00392hsa-miR-183
HOTAIRhsa-miR-301b, hsa-miR-206, hsa-miR-204, hsa-miR-21, hsa-miR-375
SRGAP3-AS2hsa-miR-145, hsa-miR-206
HCG23hsa-miR-145
LINC00200hsa-miR-183, hsa-miR-204
LINC00404hsa-miR-141, hsa-miR-200a
SACS-AS1hsa-miR-144, hsa-miR-187
EMX2OShsa-miR-182, hsa-miR-183, hsa-miR-184, hsa-miR-210
ATXN8OShsa-miR-145, hsa-miR-183, hsa-miR-204, hsa-miR-210, hsa-miR-122hsa-miR-375
LINC00210hsa-miR-96, hsa-miR-206, hsa-miR-204
TLR8-AS1hsa-miR-182, hsa-miR-187, hsa-miR-206, hsa-miR-204
LINC00460hsa-miR-206, hsa-miR-429
POU6F2-AS2hsa-miR-137, hsa-miR-187, hsa-miR-375
BOK-AS1hsa-miR-184
MAGI2-AS3hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-144, hsa-miR-145, hsa-miR-429, hsa-miR-204, hsa-miR-210, hsa-miR-122
CHL1-AS1hsa-miR-137, hsa-miR-187
FNDC1-IT1hsa-miR-144
DSCAM-AS1hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-204, hsa-miR-122
LINC00484hsa-miR-141, hsa-miR-200a, hsa-miR-187, hsa-miR-206, hsa-miR-122
ARHGEF7-AS2hsa-miR-187, hsa-miR-210, hsa-miR-122, hsa-miR-375
RBMS3-AS3hsa-miR-96, hsa-miR-182, hsa-miR-204
LINC00445hsa-miR-375
NDP-AS1hsa-miR-145, hsa-miR-206hsa-miR-122
AC009121.1hsa-miR-141, hsa-miR-200a
CLRN1-AS1hsa-miR-137, hsa-miR-206, hsa-miR-429, hsa-miR-204
PDZRN3-AS1hsa-miR-141, hsa-miR-200a
AC080129.1hsa-miR-122
AL109754.1hsa-miR-122
MME-AS1hsa-miR-100, hsa-miR-182, hsa-miR-429
LSAMP-AS1hsa-miR-183, hsa-miR-375
ADAMTS9-AS1hsa-miR-301b, hsa-miR-96, hsa-miR-144, hsa-miR-145, hsa-miR-182, hsa-miR-21
SYNPR-AS1hsa-miR-96, hsa-miR-182, hsa-miR-375
ADAMTS9-AS2hsa-miR-301b, hsa-miR-96, hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-144, hsa-miR-145
hsa-miR-182, hsa-miR-183, hsa-miR-184, hsa-miR-204, hsa-miR-122, hsa-miR-375
AC007731.1hsa-miR-183
AL139002.1hsa-miR-301b, hsa-miR-210
PEX5L-AS1hsa-miR-141, hsa-miR-200a
AC061992.1hsa-miR-301b, hsa-miR-204, hsa-miR-122
AP000553.1hsa-miR-122
LINC00461hsa-miR-96, hsa-miR-137, hsa-miR-141, hsa-miR-200a, hsa-miR-144, hsa-miR-145, hsa-miR-204, hsa-miR-210, hsa-miR-122
ALDH1L1-AS2hsa-miR-301b, hsa-miR-145, hsa-miR-210
MAST4-IT1hsa-miR-204
LINC00536hsa-miR-96, hsa-miR-137, hsa-miR-182, hsa-miR-204
OPCML-IT1hsa-miR-184, hsa-miR-375
C8orf49hsa-miR-301b, hsa-miR-100, hsa-miR-184, hsa-miR-429, hsa-miR-122, hsa-miR-375
AL589642.1hsa-miR-141, hsa-miR-200a, hsa-miR-145, hsa-miR-182, hsa-miR-183, hsa-miR-429, hsa-miR-204, hsa-miR-210, hsa-miR-122
AC040173.1hsa-miR-96, hsa-miR-144, hsa-miR-182, hsa-miR-183, hsa-miR-429
LINC00524hsa-miR-204
LINC00052hsa-miR-145, hsa-miR-187
PWRN1hsa-miR-137, hsa-miR-144, hsa-miR-145, hsa-miR-184, hsa-miR-21, hsa-miR-122
LINC00261hsa-miR-301b, hsa-miR-144, hsa-miR-145, hsa-miR-182, hsa-miR-183, hsa-miR-206, hsa-miR-429, hsa-miR-204, hsa-miR-375

DE, differentially expressed; lncRNAs, long non-coding RNAs; hsa, Homo sapiens; miR, microRNA.

miRNA targets predicted by miRTarBase

Based on the miRNAs described in Table I, miRNA-targeted mRNA were identified by searching the TargetScan, miRDB and miRTarBase databases. A total of 27 mRNAs were identified including titin, chordin like 1, hydroxycarboxylic acid receptor 2, transforming growth factor β receptor 2, A-kinase anchoring protein 12, sprout RTK signaling antagonist 2, kelch like family member 40, sterile α motif domain containing 5, transcription elongation factor A like 7, KIT proto-oncogene receptor tyrosine kinase, cyclin B1, cell adhesion molecule L1 like, WAS protein family member 3, fibroblast growth factor (FGF) 2, SHC binding and spindle associated 1, serine rich and transmembrane domain containing 1, karyopherin α 2 (KPNA2), neurotrophic receptor tyrosine kinase 2 (NTRK2), cyclin E2, secreted frizzled related protein 1 (SFRP1), par-6 family cell polarity regulator β, ELAV like RNA binding protein 2, FGF receptor 3, cadherin 2, C-C motif chemokine ligand 20, solute carrier family 1 member 1 and homeobox B5 (Table III).
Table III.

Breast cancer-specific mRNAs in the competing endogenous RNA network.

mRNAGene IDExpression changelog2 FC (T/N)P-valueFDR
TTNENSG00000155657Downregulated−4.9500856070.0010.001
CHRDL1ENSG00000101938Downregulated−4.3846451645.38×10−2468.81×10−244
HCAR2ENSG00000182782Downregulated−3.3595955652.72×10−2333.83×10−231
TGFBR2ENSG00000163513Downregulated−2.1481904754.53×10−1824.04×10−180
AKAP12ENSG00000131016Downregulated−2.6305463551.93×10−1801.68×10−178
SPRY2ENSG00000136158Downregulated−2.3733438483.08×10−1792.63×10−177
KLHL40ENSG00000157119Downregulated−7.1408244955.06×10−1503.06×10−148
SAMD5ENSG00000203727Downregulated−2.9216015541.42×10−1345.06×10−150
TCEAL7ENSG00000182916Downregulated−2.0876728751.82×10−1217.64×10−120
KITENSG00000157404Downregulated−2.9048761015.47×10−1202.24×10−118
CCNB1ENSG00000134057Upregulated2.6415586335.60×10−1192.22×10−117
CHL1ENSG00000134121Downregulated−2.8934842533.51×10−1151.31×10−113
WASF3ENSG00000132970Downregulated−2.1003710971.12×10−1113.94×10−110
FGF2ENSG00000138685Downregulated−2.6633806212.96×10−1028.7×10−101
SHCBP1ENSG00000171241Upregulated2.6743702723.03×10−876.84×10−86
SERTM1ENSG00000180440Downregulated−3.5968254323.8×10−817.75×10−80
KPNA2ENSG00000182481Upregulated2.2327714918.08×10−811.64×10−79
NTRK2ENSG00000148053Downregulated−2.5322101343.89×10−706.46×10−69
CCNE2ENSG00000175305Upregulated2.4957448826.12×10−701.01×10−68
SFRP1ENSG00000104332Downregulated−2.497175911.76×10−461.59×10−45
PARD6BENSG00000124171Upregulated2.2638527418.63×10−355.35×10−34
ELAVL2ENSG00000107105Upregulated2.6933397722.05×10−341.25×10−33
FGFR3ENSG00000068078Upregulated2.4724709415.52×10−323.06×10−31
CDH2ENSG00000170558Upregulated2.6390301074.82×10−282.26×10−27
CCL20ENSG00000115009Upregulated2.5686831561.40×10−245.69×10−24
SLC1A1ENSG00000106688Upregulated2.3522490293.02×10−211.06×10−20
HOXB5ENSG000001200755Upregulated2.0011212611.91×10−155.15×10−15

FC, fold change; T/N, tumor/normal; FDR, false discovery rate.

Construction of a ceRNA network in breast cancer

To improve the understanding of the role of DElncRNAs in breast cancer, a ceRNA network was constructed based on co-expressed lncRNAs-miRNAs and miRNAs-mRNAs. As demonstrated in Fig. 2, the ceRNA network was composed of 93 lncRNAs, 27 mRNAs and 19 miRNAs.
Figure 2.

Competing endogenous RNA network in breast cancer. The red nodes represent increased level of expression, while the green nodes represent decreased level of expression. Rectangles represent miRNAs, diamonds represent lncRNAs and ellipses represent mRNAs. The gray edges denote lncRNA-miRNA-mRNA interactions. lncRNA, long non-coding RNA; miR, microRNA.

ROC analysis of breast cancer-specific lncRNAs

The 93 miRNAs targeting lncRNAs in breast cancer predicted by miRcode and starBase were selected. Then, Perl script was used to obtain the survival analysis data, and the clinicalexp file (lncRNA name, patient survival time, survival state) was combined with the survival data for univariate Cox analysis. In the univariate Cox analysis data, when the hazard ratio HR >1, this indicated that the higher the gene expression, the higher the risk of mortality, that is, the expression of the gene is contrary to survival. HR <1, indicated that the higher the expression of gene, the lower the risk of mortality, that is, the higher the survival rate. Survival-associated genes were then listed in order of ascending P-value, and the top 13 lncRNAs with P<0.05 were selected. lncRNAs were selected subsequent to entering the screening gene code in R software (version 3.4.3). Finally, the top 10 lncRNAs associated with breast cancer survival analysis were screened (Table IV; Fig. 3). These included: ADAM metallopeptidase with thrombospondin type 1 motif 9-antisense RNA 1 (ADAMTS9-AS1), AC061992.1, Wilms tumor 1 antisense RNA (WT1-AS), long intergenic non-protein coding RNA 536 (LINC00536), AL391421.1, SLIT-ROBO Rho GTPase activating protein 3 antisense RNA 2 (SRGAP3-AS2), Prader-Willi region non-protein coding RNA 1 (PWRN1), family with sequence similarity 230 member G (AC007731.1), chromosome 10 open reading frame 126 (C10orf126) and AL513123.1. Of these, the levels of ADAMTS9-AS1 and PWRN1 were downregulated in breast cancer tissues. The other 8 lncRNAs were upregulated in cancer tissues compared with in normal tissues. A total of 6 lncRNAs exhibited high prognostic values for distinguishing tumor tissues from normal tissues, with AUC values of >0.99.
Table IV.

Top 10 aberrantly expressed lncRNAs and AUC values in breast cancer.

lncRNAGene IDExpression changelog2 FC (T/N)AUCP-value
ADAMTS9-AS1ENSG00000241158Downregulated−2.7462167550.888<0.05
AC061992.1ENSG00000266970Upregulated2.1401338280.867<0.05
WT1-ASENSG00000183242Upregulated4.9013600821.102<0.05
LINC00536ENSG00000249917Upregulated2.7210252851.110<0.05
AL391421.1ENSG00000204049Upregulated3.1497014970.940<0.05
SRGAP3-AS2ENSG00000228723Upregulated4.6082975440.889<0.05
PWRN1ENSG00000259905Downregulated−2.2742433941.243<0.05
AC007731.1ENSG00000188280Upregulated2.8400147011.182<0.05
C10orf126ENSG000002043655Upregulated3.6880824051.183<0.05
AL513123.1ENSG00000236347Upregulated2.9055686211.194<0.05

lncRNA; long-non-coding RNA; FC, fold change; T/N, tumor/normal; AUC, area under the curve.

Figure 3.

ROC curves and heatmap of the top 10 lncRNAs sorted by AUC in breast cancer. (A) The red line represents cases with high expression, and blue line represents cases with low expression. The x-axis indicates overall survival time (days), and the y-axis indicates the survival rate. Kaplan-Meier analysis was performed, and the curves were generated by R. (B) The red line represents the sensitivity curve, and the black line represents the identifying line. The x-axis indicates the false-positive rate. The y-axis indicates the true-positive rate. (C) Heatmap of the top 10 lncRNAs. Expression intensity increases from green to red, where green represents low expression, and red represents high expression. ROC, receiver operating characteristic; lncRNAs, long non-coding RNAs; AUC, area under the curve.

Additional analysis for the selected lncRNAs expression

GEPIA results (Fig. 4A) revealed upregulation of ADAMTS9-AS1 in kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP) and prostate adenocarcinoma (PRAD). As demonstrated in Fig. 4B, the results also identified that WT1-AS levels were significantly increased in acute myeloid leukemia, ovarian cancer (OV), uterine corpus endometrial carcinoma (UCEC) and uterine carcinosarcomas compared with in normal tissues. Notably, the increased expression of LINC00536 was demonstrated in breast invasive carcinoma, while a decreased expression was demonstrated in testicular germ cell tumors (TGCT) (Fig. 4C). Furthermore, increased SRGAP3-AS2 expression was identified in adenoid cystic carcinoma, cholangiocarcinoma, OV and UCEC, while significant downregulation of SRGAP3-AS2 was observed in lung adenocarcinoma and lung squamous cell carcinoma (LUSC) (Fig. 4D). PWRN1 was observed to be downregulated in thyroid carcinoma and upregulated in TGCT, pheochromocytoma and paraganglioma (Fig. 4E). A decreased expression of C10orf126 was identified in KIRC, KIRP and liver hepatocellular carcinoma (Fig. 4F).
Figure 4.

lncRNAs expression among various types of cancer involved in The Cancer Genome Atlas based on GEPIA. (A) ADAMTS9-AS1, (B) WT1-AS, (C) LINC00536, (D) SRGAP3-AS2, (E) PWRN1 and (F) C10orf126. There is no description of the genes AC061992.1, AL391421.1, AC007731.1 and AL513123.1 in GEPIA. The y-axis indicates the log2 (transcripts per million + 1) for lncRNA expression. Red-colored bars represent the tumor tissue samples and the green-colored bars indicate the normal tissue samples. These charts were derived from GEPIA. lncRNAs; long non-coding RNAs; GEPIA, Gene Expression Profiling Interactive Analysis; KIRC, kidney renal clear cell carcinoma; KIRP, kidney renal papillary cell carcinoma; PRAD, prostate adenocarcinoma; LAML, acute myeloid leukemia; OV, ovarian cancer; UCEC, uterine corpus endometrial carcinoma; UCS, uterine carcinosarcomas; BRCA, breast invasive carcinoma; TGCT, testicular germ cell tumors; ACC, adenoid cystic carcinoma; CHOL, cholangiocarcinoma; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma; THCA, thyroid carcinoma; PCPG, pheochromocytoma and paraganglioma; LIHC, liver hepatocellular carcinoma. ADAMTS9-AS1, ADAM metallopeptidase with thrombospondin type 1 motif 9-antisense RNA 1; WT1-AS, Wilms tumor 1 antisense RNA; LINC00536, long intergenic non-protein coding RNA 536; SRGAP3-AS2, SLIT-ROBO Rho GTPase activating protein 3 antisense RNA 2; PWRN1, Prader-Willi region non-protein coding RNA 1; C10orf126, chromosome 10 open reading frame 126.

The top 10 lncRNAs were selected using ROC analysis. As demonstrated in Fig. 5, the expression levels of 5 DElncRNAs, including ADAMTS9-AS1, AL513123.1, C10orf126, LINC00536 and WT1-AS, were associated with overall survival (P<0.05). A total of 2 miRNAs (hsa-miR-204 and hsa-miR-301b) were associated with overall survival (P<0.05; Fig. 6). In addition, 3 mRNA (KPNA2, NTRK2 and SFRP1) were associated with overall survival (P<0.05; Fig. 7).
Figure 5.

Kaplan-Meier survival curves for 5 differentially expressed long non-coding RNAs associated with overall survival in breast cancer. (A) ADAMTS9-AS1 (P=5.52×10−103), (B) AL513123.1 (P=1.17×10−08), (C) C10orf126 (P=4.35×10−11), (D) LINC00536 (P=5.03×10−32) and (E) WT1-AS (P=2.11×10−37). The x-axis represents overall survival time (years) and the y-axis represents survival function. ADAMTS9-AS1, ADAM metallopeptidase with thrombospondin type 1 motif 9-antisense RNA 1; C10orf126, chromosome 10 open reading frame 126; LINC00536, long intergenic non-protein coding RNA 536; WT1-AS, Wilms tumor 1 antisense RNA.

Figure 6.

Kaplan-Meier survival curves for two differentially expressed miRNAs associated with overall survival in breast cancer. (A) hsa-miR-204 (P=2.02×10−61) and (B) hsa-miR-301b (P=7.76×10−40). The x-axis represents overall survival time (years) and the y-axis represents survival function. hsa, Homo sapiens; miRNA, microRNA.

Figure 7.

Kaplan-Meier survival curves for 3 mRNA associated with overall survival in breast cancer. (A) KPNA2 (P=8.08×10−81), (B) NTRK2 (P=3.89×10−70) and (C) SFRP1 (P=1.76×10−46). The x-axis represents overall survival time (years) and the y-axis represents survival function. KPNA2, karyopherin α 2; NTRK2, neurotrophic receptor tyrosine kinase 2; SFRP1, secreted frizzled related protein 1.

Discussion

In previous years, much attention has been paid to breast cancer pathogenesis. Nevertheless, clinical outcomes remain highly heterogeneous. Nik-Zainal et al (26) analyzed genome-wide data from 560 patients with breast cancer and identified that 93 oncogenes involved in the encoded proteins may cause breast cancer. Prognosis is an important indicator of disease treatment. Bioinformatics methods have been used to investigate the prognostic significance of lncRNAs in breast cancer. Several studies has also indicated that the aberrant expression of lncRNAs contributes to the development of different types of cancer: Xiao et al (27) revealed that MALAT1 may be used as a ceRNA to promote the expression of ZEB2 by expanding miR-200s, and may be a therapeutic target for kidney cancer; Wang et al (28) demonstrated that UCA1 and AATBC are not only included in the ceRNA network, but are also associated with overall survival in muscle-invasive bladder cancer; Chen et al (25) suggested that the top 10 aberrantly expressed lncRNAs identified in their study served important roles in LUSC through an lncRNA-mRNA network; and Li et al (29) described a novel gastric cancer-specific ceRNA network including 11 lncRNAs, 9 miRNAs and 41 mRNAs. ceRNA transcripts may be used to competitively bind the same MRE with miRNAs, thereby relieving or mitigating the inhibitory effects of miRNAs on target genes. MREs are the basis for miRNA function, and other non-coding genes also interact with miRNAs through MREs. Recently, Zhou et al (30) used a Pearson's correlation analysis of miRNA-gene pairs to construct breast cancer-specific ceRNA networks. Regulation of phosphatase and tensin homolog expression in a 3′ untranslated region-and miRNA-dependent manner in breast cancer has been identified in another study (31). Furthermore, ceRNA crosstalk may be associated with regulatory mechanisms in breast cancer. Shen et al (32) and Yang et al (33) demonstrated that FAM83H-AS1 appeared to be a novel prognostic biomarker in breast cancer. Cheang et al (34) revealed that patients with breast cancer with high HER-2 protein expression levels and Ki67 index exhibited poor recurrence and disease-specific survival rates. Yuan et al (35) identified that the ceRNA crosstalk network in triple negative breast cancer included the differential expression of 22 hub mRNAs, 11 miRNAs and 14 lncRNAs. The present study obtained lncRNA, mRNA and miRNA data from TCGA to construct breast cancer-associated ceRNA networks, studied the regulatory mechanisms of lncRNA as ceRNAs in the progression of cancer, and also investigated their potential as prognostic biomarkers and therapeutic targets in breast cancer. A ceRNA network with differential expression of 27 mRNAs, 19 miRNAs and 93 lncRNAs was constructed. The present study investigated the lncRNA expression profiles of a large cohort of patients in TCGA. A total of 93 DElncRNAs were identified in breast cancer samples compared with the normal samples. Among these, the top 10 lncRNA were significantly associated with overall survival. Furthermore, it was observed that the lncRNAs ADAMTS9-AS1, AL513123.1, C10orf126, LINC00536 and WT1-AS were included in the ceRNA network. Therefore, we hypothesized that these lncRNAs may serve significant roles in the pathogenesis and prognosis of breast cancer. Wang et al (36) revealed that antisense lncRNA ADAMTS9-AS1 was associated with the nearby coding gene ADAMTS9, which was involved in ovarian cancer progression. Li et al (37) identified that the lncRNA ADAMTS9-AS1 served as a novel prognostic biomarker for clinical application in esophageal squamous cell carcinoma. Crippa et al (38) hypothesized that LINC00536 disruption may have contributed to the onset of the clinical trichorhinophalangeal syndrome-like phenotype. Wang et al (39) suggested that WT1 was overexpressed in human hepatocellular carcinoma tissues, and was negatively correlated with overall survival in patients with hepatocellular carcinoma. In the present study, a novel ceRNA network was constructed to identify the associations between miRNAs and lncRNAs. miRNAs also serve important roles in cell differentiation, biological development and disease progression. Several previous data have demonstrated that the mutual regulation between lncRNAs and miRNAs and their downstream target genes are closely associated with the occurrence and development of cancer (40–42). Zhang et al (43) used reverse transcription quantitative polymerase chain reaction and in situ hybridization for breast cancer tissue samples and cell lines. Cell phenotype experiments were performed to verify that lncRNA-GAS5 exon 4 bound miR-21 and inhibited the occurrence and development of breast cancer cells (43). The present study demonstrated that hsa-miR-204 was downregulated and hsa-miR-301b was upregulated in patients with breast cancer compared with healthy controls, and was associated with overall survival. Yuan et al (44) revealed that miR-204 suppressed the proliferation and metastasis of gastric cancer cells. Todorova et al (45) identified that miR-204 promoted prostate cancer-associated androgen-responsive genes and androgen receptor (AR) target genes and AR co-regulated molecules. Abmutalib et al (46) hypothesized that hsa-miR-301b may be involved in regulating lymph node metastasis in papillary thyroid carcinoma via interactions with hepatic leukemia factor, hypoxia-inducible factor and REL/nuclear factor kappa-light-chain-enhancer of activated B cells. Geng et al (47) demonstrated that overexpression of hsa-miR-301b significantly affected the cell cycle of human lung adenocarcinoma A549 cells. ceRNAs have been implicated in several biological processes, and abnormalities in the ceRNA network may lead to tumorigenesis. In summary, the present study investigated lncRNA-mediated ceRNA interactions using miRNA, lncRNA and mRNA expression profiles in cancer and normal tissues. The results suggested that cancer-specific lncRNAs in breast cancer may be involved in the regulation of a complex ceRNA network. These data may provide novel insights into the clinical significance and regulatory mechanisms of lncRNA-mediated ceRNA networks, and identify novel lncRNAs as potential prognostic biomarkers and therapeutic targets for the diagnosis and treatment of breast cancer.
  45 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 2.  Non-coding RNAs in human disease.

Authors:  Manel Esteller
Journal:  Nat Rev Genet       Date:  2011-11-18       Impact factor: 53.242

3.  Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs.

Authors:  Yvonne Tay; Lev Kats; Leonardo Salmena; Dror Weiss; Shen Mynn Tan; Ugo Ala; Florian Karreth; Laura Poliseno; Paolo Provero; Ferdinando Di Cunto; Judy Lieberman; Isidore Rigoutsos; Pier Paolo Pandolfi
Journal:  Cell       Date:  2011-10-14       Impact factor: 41.582

4.  Long noncoding RNA associated with microvascular invasion in hepatocellular carcinoma promotes angiogenesis and serves as a predictor for hepatocellular carcinoma patients' poor recurrence-free survival after hepatectomy.

Authors:  Sheng-Xian Yuan; Fu Yang; Yuan Yang; Qi-Fei Tao; Jin Zhang; Gang Huang; Yun Yang; Ruo-Yu Wang; Sen Yang; Xi-Song Huo; Ling Zhang; Fang Wang; Shu-Han Sun; Wei-Ping Zhou
Journal:  Hepatology       Date:  2012-12       Impact factor: 17.425

5.  A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language?

Authors:  Leonardo Salmena; Laura Poliseno; Yvonne Tay; Lev Kats; Pier Paolo Pandolfi
Journal:  Cell       Date:  2011-07-28       Impact factor: 41.582

Review 6.  Targeted therapies for breast cancer.

Authors:  Michaela J Higgins; José Baselga
Journal:  J Clin Invest       Date:  2011-10-03       Impact factor: 14.808

7.  Effects of reproductive and demographic changes on breast cancer incidence in China: a modeling analysis.

Authors:  Eleni Linos; Demetri Spanos; Bernard A Rosner; Katerina Linos; Therese Hesketh; Jian Ding Qu; Yu-Tang Gao; Wei Zheng; Graham A Colditz
Journal:  J Natl Cancer Inst       Date:  2008-09-23       Impact factor: 13.506

8.  miRcode: a map of putative microRNA target sites in the long non-coding transcriptome.

Authors:  Ashwini Jeggari; Debora S Marks; Erik Larsson
Journal:  Bioinformatics       Date:  2012-06-19       Impact factor: 6.937

9.  Ki67 index, HER2 status, and prognosis of patients with luminal B breast cancer.

Authors:  Maggie C U Cheang; Stephen K Chia; David Voduc; Dongxia Gao; Samuel Leung; Jacqueline Snider; Mark Watson; Sherri Davies; Philip S Bernard; Joel S Parker; Charles M Perou; Matthew J Ellis; Torsten O Nielsen
Journal:  J Natl Cancer Inst       Date:  2009-05-12       Impact factor: 13.506

Review 10.  On the classification of long non-coding RNAs.

Authors:  Lina Ma; Vladimir B Bajic; Zhang Zhang
Journal:  RNA Biol       Date:  2013-04-15       Impact factor: 4.652

View more
  11 in total

1.  An innovative systematic approach introduced the involved lncRNA-miR-mRNA network in cell cycle and proliferation after conventional treatments in breast cancer patients.

Authors:  Maryam Mohsenikia; Solmaz Khalighfard; Ali Mohammad Alizadeh; Vahid Khori; Maziar Ghandian Zanjan; Mohammadreza Zare; Ramesh Omranipour; Elham Patrad; Hengamesadat Razavi; Ziba Veisi Malekshahi; Zahra Bagheri-Hosseinabadi
Journal:  Cell Cycle       Date:  2022-05-15       Impact factor: 5.173

2.  PWRN1 Suppressed Cancer Cell Proliferation and Migration in Glioblastoma by Inversely Regulating hsa-miR-21-5p.

Authors:  Jianxin Jiang; Xiaolin Wang; Jun Lu
Journal:  Cancer Manag Res       Date:  2020-07-02       Impact factor: 3.989

3.  LncRNA ADAMTS9-AS1 knockdown restricts cell proliferation and EMT in non-small cell lung cancer.

Authors:  Zhongwen Li; Guojun Yue; Tingyou Zhang; Jinzhi Wu; Xin Tian
Journal:  Histol Histopathol       Date:  2021-06-04       Impact factor: 2.303

4.  Exploring specific prognostic biomarkers in triple-negative breast cancer.

Authors:  Chang Bao; Yunkun Lu; Jishun Chen; Danni Chen; Weiyang Lou; Bisha Ding; Liang Xu; Weimin Fan
Journal:  Cell Death Dis       Date:  2019-10-24       Impact factor: 8.469

5.  Quantitative Expression of SFN, lncRNA CCDC18-AS1, and lncRNA LINC01343 in Human Breast Cancer as the Regulator Biomarkers in a Novel ceRNA Network: Based on Bioinformatics and Experimental Analyses.

Authors:  Mehrnoush Rishehri; Tahereh Etemadi; Leila Pisheh; Ghazaleh Koufigar; Mansoureh Azadeh
Journal:  Genet Res (Camb)       Date:  2022-09-12       Impact factor: 1.375

6.  Screening and validation of platelet activation-related lncRNAs as potential biomarkers for prognosis and immunotherapy in gastric cancer patients.

Authors:  Mingjie Yuan; Yanfei Jia; Yuanxin Xing; Yunshan Wang; Yunyun Liu; Xiangdong Liu; Duanrui Liu
Journal:  Front Genet       Date:  2022-09-14       Impact factor: 4.772

7.  Competing Endogenous RNA in Colorectal Cancer: An Analysis for Colon, Rectum, and Rectosigmoid Junction.

Authors:  Lucas Maciel Vieira; Natasha Andressa Nogueira Jorge; João Batista de Sousa; João Carlos Setubal; Peter F Stadler; Maria Emília Machado Telles Walter
Journal:  Front Oncol       Date:  2021-06-10       Impact factor: 6.244

8.  3'-UTR Shortening Contributes to Subtype-Specific Cancer Growth by Breaking Stable ceRNA Crosstalk of Housekeeping Genes.

Authors:  Zhenjiang Fan; Soyeon Kim; Yulong Bai; Brenda Diergaarde; Hyun Jung Park
Journal:  Front Bioeng Biotechnol       Date:  2020-04-29

9.  An in-silico method leads to recognition of hub genes and crucial pathways in survival of patients with breast cancer.

Authors:  Sepideh Dashti; Mohammad Taheri; Soudeh Ghafouri-Fard
Journal:  Sci Rep       Date:  2020-10-30       Impact factor: 4.379

10.  Integrated analysis of lncRNAs and mRNAs reveals key trans-target genes associated with ETEC-F4ac adhesion phenotype in porcine small intestine epithelial cells.

Authors:  Serafino M A Augustino; Qinglei Xu; Xueqin Liu; Siyuan Mi; Liangyu Shi; Yibing Liu; Hui Wen; Di Wang; Lei Liu; Qin Zhang; Ying Yu
Journal:  BMC Genomics       Date:  2020-11-10       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.