Literature DB >> 32727450

Integration analysis of long non-coding RNA (lncRNA) role in tumorigenesis of colon adenocarcinoma.

Arash Poursheikhani1,2, Mohammad Reza Abbaszadegan1,2,3, Negin Nokhandani4, Mohammad Amin Kerachian5,6,7.   

Abstract

BACKGROUND: Colon adenocarcinoma (COAD) is one of the most common gastrointestinal cancers globally. Molecular aberrations of tumor suppressors and/or oncogenes are the main contributors to tumorigenesis. However, the exact underlying mechanisms of COAD pathogenesis are clearly not known yet. In this regard, there is an urgent need to indicate promising potential diagnostic and prognostic biomarkers in COAD patients.
METHODS: In the current study, level 3 RNA-Seq and miR-Seq data and corresponding clinical data of colon adenocarcinoma (COAD) were retrieved from the TCGA database. The "limma" package in R software was utilized to indicate the differentially expressed genes. For in silico functional analysis, GO and KEGG signaling pathways were conducted. PPI network was constructed based on the STRING online database by Cytoscape 3.7.2. A ceRNA network was also constructed by "GDCRNATools" package in R software. Kaplan-Meier survival analysis (log-rank test) and ROC curve analysis were used to indicate the diagnostic and prognostic values of the biomarkers.
RESULTS: The differential expression data demonstrated that 2995 mRNAs, 205 lncRNAs, and 345 miRNAs were differentially expressed in COAD. The GO and KEGG pathway analysis indicated that the differentially expressed mRNAs were primarily enriched in canonical processes in cancer. The PPI network showed that the CDKN2A, CCND1, MYC, E2F, CDK4, BRCA2, CDC25B, and CDKN1A proteins were the critical hubs. In addition, the Kaplan-Meier analysis revealed that 215 mRNAs, 14 lncRNAs, and 39 miRNAs were associated with overall survival time in the patients. Also, the ceRNA network data demonstrated that three lncRNAs including MIR17HG, H19, SNHG1, KCNQ1OT1, MALAT1, GAS5, SNHG20, OR2A1-AS1, and MAGI2-AS3 genes were involved in the development of COAD.
CONCLUSIONS: Our data suggested several promising lncRNAs in the diagnosis and prognosis of patients with COAD.

Entities:  

Keywords:  Colorectal cancer; Long non-coding RNAs; MicroRNA; Tumorigenesis

Mesh:

Substances:

Year:  2020        PMID: 32727450      PMCID: PMC7392656          DOI: 10.1186/s12920-020-00757-2

Source DB:  PubMed          Journal:  BMC Med Genomics        ISSN: 1755-8794            Impact factor:   3.063


Background

Colon adenocarcinoma (COAD) is one of the most common gastrointestinal (GI) cancers and is the second leading cause of cancer-related death, globally [1, 2]. It is demonstrated that COAD occurs in approximately 5% of overall population at any given time in the world [3]. Despite the current screenings and therapies such as endoscopic resection and radical surgery, nearly half of the patients are diagnosed as advanced cases of COAD, experiencing tumor recurrence and relapse. COAD tumorigenesis has complicated multi-step processes including colon epithelial cell proliferation, aberration in differentiation, apoptosis resistance, survival, and invasion mechanisms [4]. Molecular aberrations of tumor suppressors and/or oncogenes are also one of the main contributors in different types of tumors especially COAD tumorigenesis [5]. However, due to complicacy of the underlying molecular pathways, the exact pathogenic contributors of COAD have not yet been clarified. Hence, there is an urgent need to indicate promising diagnostic and prognostic biomarkers for COAD. Recent investigations have highlighted the role of non-coding RNAs in the tumorigenesis of various malignancies. Among different kinds of non-coding RNAs, long non-coding RNA (lncRNA) is a putative class of non-coding RNA with more than 200 nucleotides in length, without any open-reading-frame (ORF) to encode proteins [5, 6]. Interestingly, a large body of evidence indicates that lncRNAs plays critical roles in a variety of biological processes including cell proliferation, cellular development, differentiation, carcinogenesis, and metastasis through modulating gene expression at the transcriptional and posttranscriptional levels directly or by recruiting chromatin remodeling factors [6-8]. Aberrant expression of lncRNAs has been well-documented in different sorts of cancers [9]. Dysregulation of lncRNA HOTAIR, H19, MALAT1, SNHG7, GAS8-AS, and NEAT1 were extensively well-studied and have been demonstrated to contribute in tumorigenesis and poor prognosis [5, 9–13]. Numerous investigations have shown that the lncRNAs can exert their function by competing endogenous RNA (ceRNA) crosstalk. For instance, it has been shown that lncRNA SCARNA2 was overexpressed in COAD tissues and it remarkably correlated with chemoresistance. Mechanistically, SCARNA2 via targeting miR-342-3p, upregulates EGFR and BCL2 expression in COAD cells [14]. Furthermore, overexpression of lncRNA SNHG1 has been shown to promote epithelial-mesenchymal transition (EMT) by binding to miR-497/miR-195-5p in COAD cells [15]. Also, lncRNA BDNF-AS was downregulated in COAD patients and served as a tumor suppressor gene. Unsurprisingly, ectopic expression of BDNF-AS suppressed cell proliferation and migration via epigenetically downregulating GSK-3β expression through EZH2 [16]. Moreover, several investigations have considered lncRNAs as therapeutic opportunities in COAD. For instance, it has been demonstrated that overexpression of LINC00152 can promote Fascin actin-bundling protein 1 (FSCN1) expression via sponging miR-632 and miR-185-3p, which consequently leads to proliferation and metastasis in COAD [17]. A recent study has demonstrated that targeting lncRNA FLANC by 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine nanoparticles loaded with a specific small interfering RNA, decreased metastasis without any significant toxicity. They proposed that FLANC may act as a novel therapeutic strategy in COAD [18]. Additionally, many researches have suggested the potency of lncRNAs as biomarkers in the blood and serum. They suggested microvesicles and exosomes as carriers, being protected and stabilized in circulation [19]. In the current study, we comprehensively investigate lncRNAs, miRNAs, and mRNAs expressions from a public database, “Cancer Genome Atlas (TCGA)” and we constructed a ceRNA network in COAD. Also, we proposed novel potential biomarkers for COAD.

Methods

Sample and data collection

Clinical data of COAD were retrieved from the TCGA database (https://portal.gdc.cancer.gov/repository). The inclusion criteria were: (1) the histopathological diagnosis was COAD; (2) having complete demographic data including age, vital status, race, ethnicity, pathological stage, TNM classification, and overall survival time. Totally, 459 COAD were enrolled in this study. Two hundred and thirty participants had age > 68 years and 229 patients had age ≤ 68 and 243 and 216 patients were male and female, respectively. Among 459 patients, only 4 patients were Hispanic or Latino and 271 were non-Hispanic or non-Latino. Two hundred and fourteen patients were white, 29 patients were Black or African American, 11 were Asian and 1 American Indian/Alaska native. Pathological stages of I, II, III, and IV were 76, 178, 129 and 65, respectively. The clinical characteristics are summarized in Table 1.
Table 1

Clinicopathological characteristics of COAD patients

CharacteristicsN(%)
Age (year) (mean ± SD)66.92 (13)
Age > 6823050.1
Age ≤ 6822949.9
Sex
 Male24352.9
 Female21647.1
Ethnicity
 Hispanic or Latino40.9
 Not Hispanic or Latino27159
 NA18440.1
Race
 American Indian or Alaska Native10.2
 Asian112.4
 Black or African American5912.9
 White21446.6
 NA17437.9
Vital status
 Alive35777.8
 Dead10222.2
Pathologic (stage)
 Stage I7616.5
 Stage II17838.7
 Stage III12928.1
 Stage IV6514.1
Pathologic (T)
 T1112.4
 T27817
 T331368.2
 T45612.2
 Tis10.2
Pathologic (M)
 M033773.4
 M16514.2
 MX5010.9
 NA71.5
Pathologic (N)
 N027058.8
 N110623.1
 N28318

NA Not Applicable

Clinicopathological characteristics of COAD patients NA Not Applicable

RNA-Seq and miR-Seq data analysis

RNA-Seq and miR-Seq Level 3 data were collected from the TCGA database. The raw count of the reads of RNA-Seq and miR-Seq data was normalized by Voom and TMM normalization methods. All the analyses were conducted in R software. The “limma” package in R software was utilized to indicate the differentially expressed mRNAs (DEmRNAs), lncRNAs (DElncRNAs), and miRNAs (DEmiRNAs) between normal solid tissues and primary tumors. The concluded data were filtered based on the |log2 fold change (FC)| > 1 for DEmRNA, DElncRNA, and DEmiRNA. P-value < 0.05 and false discovery rate (FDR) < 0.05 were considered as significant thresholds.

Functional enrichment analysis and protein-protein interaction (PPI) network

For in silico functional enrichment analysis, gene ontology (GO) in three domains including biological processes, cellular components, and molecular functions, in addition to Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathways were conducted. The GO and KEGG outputs were visualized by R software (ggplot2 package). The PPI network was constructed based on the STRING online database by Cytoscape 3.7.2. Molecular Complex Detection (MCODE) was used to analyze and predict the interactions (score value > 0.4).

LncRNA-miRNA-mRNA ceRNA network construction

LncRNA-miRNA-mRNA ceRNA network was constructed by “GDCRNATools” (http://bioconductor.org/packages/devel/ bioc/html/GDCRNATools.html) package in R software based on starbase database [14]. The nodes and edges were virtualized by Cytoscape 3.7.2.

Statistical analysis

All the differentially expressed data were analyzed by using R software (3.5.2) through the “GDCRNATools” package. Kaplan-Meier survival analysis (log-rank test) was used to indicate the relation between over or downregulation of RNA, based on median expression with patient’s survival time. ROC curve analysis, univariate, and multivariate Cox regression analysis were conducted by SPSS v21. P-value < 0.05 was considered as a significant threshold.

Results

Differentially expressed genes

Our data demonstrated that 2995 mRNAs (1094 up-regulated and 1901 down-regulated) were differentially expressed in COAD. Moreover, 205 lncRNAs (128 up-regulated and 77 down-regulated) were identified that were deferentially expressed in patients. Three hundred and forty-five miRNAs containing 183 up-regulated and 162 down-regulated have been found with differential expression in the COAD samples. The data are presented in Figs. 1, 2 and Tables 2, 3.
Fig. 1

Bar graph of the differentially expressed genes in the COAD samples. TEC: To be Experimentally Confirmed; TR: T cell receptor; IG: Immunoglobulin

Fig. 2

Volcano plot of the differentially expressed genes and miRNAs. a Volcano plot of differentially expressed lncRNAs and mRNAs. Overexpressed genes are demonstrated in red and down-regulated genes are demonstrated in green. b Volcano plot of differentially expressed miRNA. Overexpressed and down-regulated genes are demonstrated in red and green, respectively

Table 2

Top 20 upregulated mRNAs, lncRNAs, and miRNAs

symbollogFCAveExprtPValueFDRB
mRNA
 ENSG00000167755KLK67.792.0411.380.000.0050.14
 ENSG00000170373CST17.342.5710.840.000.0045.36
 ENSG00000137673MMP77.024.169.010.000.0030.34
 ENSG00000167767KRT806.754.5016.830.000.00104.28
 ENSG00000185269NOTUM6.672.828.130.000.0023.79
 ENSG00000123500COL10A16.432.489.400.000.0033.40
 ENSG00000062038CDH36.335.9621.720.000.00157.90
 ENSG00000164379FOXQ15.944.4811.520.000.0051.46
 ENSG00000165376CLDN25.905.637.410.000.0018.76
 ENSG00000164283ESM 15.682.1023.050.000.00172.24
 ENSG00000105989WNT25.642.3517.660.000.00113.19
 ENSG00000060718COL11A15.573.749.200.000.0031.81
 ENSG00000186007LEMD15.370.4714.010.000.0075.14
 ENSG00000181577C6orf2235.364.4914.020.000.0075.26
 ENSG00000108244KRT235.333.675.860.000.009.46
 ENSG00000015413DPEP15.296.347.840.000.0021.56
 ENSG00000175832ETV45.276.2419.640.000.00135.08
 ENSG00000115507OTX15.260.5915.240.000.0087.64
 ENSG00000178773CPNE75.043.8810.470.000.0042.15
 ENSG00000185479KRT6B5.021.248.600.000.0027.26
LncRNA
 ENSG00000214039LINC024187.401.889.020.000.0030.43
 ENSG00000230316FEZF1-AS16.45−0.0911.220.000.0048.72
 ENSG00000253929CASC215.30−0.6013.650.000.0071.58
 ENSG00000281406BLACAT15.151.4815.000.000.0085.11
 ENSG00000229404LINC008584.84−0.9510.670.000.0043.88
 ENSG00000275216AL161431.14.740.988.300.000.0025.05
 ENSG00000259485LINC022534.70−0.549.850.000.0036.98
 ENSG00000236081ELFN1-AS14.542.1911.880.000.0054.68
 ENSG00000237686AL109615.34.440.1018.560.000.00122.90
 ENSG00000245694CRNDE4.171.1812.860.000.0063.93
 ENSG00000254560BBOX1-AS14.020.219.710.000.0035.91
 ENSG00000204876AC021218.13.273.709.130.000.0031.18
 ENSG00000226476LINC017483.02−0.287.360.000.0018.54
 ENSG00000262188LINC019783.010.539.070.000.0030.80
 ENSG00000253414AC124067.22.980.789.770.000.0036.38
 ENSG00000214049UCA12.942.885.570.000.007.82
 ENSG00000265688MAFG-AS12.942.2017.510.000.00111.62
 ENSG00000230061TRPM2-AS2.811.055.770.000.009.02
 ENSG00000253161LINC016052.800.6310.920.000.0046.10
 ENSG00000255026AC136475.32.711.186.520.000.0013.26
logFCAveExprtPValueFDRB
miRNA
 hsa-miR-374a-3p7.699.5515.730.000.0090.87
 hsa-miR-135b-5p6.455.858.480.000.0026.20
 hsa-miR-21-5p6.3217.4427.960.000.00219.16
 hsa-miR-19b-3p6.317.4813.270.000.0066.88
 hsa-miR-142-3p6.2010.8111.470.000.0050.44
 hsa-miR-19a-3p6.165.3610.030.000.0038.20
 hsa-miR-424-5p6.076.7512.540.000.0059.99
 hsa-miR-142-5p6.046.1212.360.000.0058.34
 hsa-miR-542-3p5.747.0014.220.000.0075.91
 hsa-miR-5775.595.625.910.000.009.83
 hsa-miR-29b-3p5.409.4013.880.000.0072.73
 hsa-miR-126-5p5.247.0613.040.000.0064.66
 hsa-miR-32-5p5.234.6314.040.000.0074.18
 hsa-miR-33a-5p5.135.147.480.000.0019.30
 hsa-miR-582-3p5.088.2811.930.000.0054.45
 hsa-miR-203b-3p5.057.077.030.000.0016.42
 hsa-miR-101-3p5.0112.4721.160.000.00148.02
 hsa-miR-18a-5p4.994.509.330.000.0032.60
 hsa-miR-4294.938.3311.450.000.0050.22
 hsa-miR-374a-5p4.914.7712.570.000.0060.33
Table 3

Top 20 downregulated mRNAs, lncRNAs, and miRNAs

symbollogFCAveExprtPValueFDRB
mRNA
 ENSG00000104267CA2−5.615.14−18.170.000.00118.92
 ENSG00000248144ADH1C−5.613.53−17.140.000.00107.73
 ENSG00000007306CEACAM7−5.626.08−11.410.000.0050.10
 ENSG00000269404SPIB−5.710.57−34.770.000.00298.79
 ENSG00000168079SCARA5−6.001.08−30.100.000.00250.25
 ENSG00000109182CWH43−6.01−0.28−23.360.000.00176.45
 ENSG00000080493SLC4A4−6.012.61−23.500.000.00178.10
 ENSG00000016490CLCA1−6.374.95−9.290.000.0032.04
 ENSG00000142959BEST4−6.530.29−45.790.000.00402.65
 ENSG00000196616ADH1B−6.690.74−26.470.000.00210.91
 ENSG00000091138SLC26A3−6.795.53−11.680.000.0052.58
 ENSG00000197273GUCA2A−6.812.83−24.000.000.00183.57
 ENSG00000167080B4GALNT2−6.820.89−21.910.000.00160.29
 ENSG00000204936CD177−7.451.83−26.620.000.00212.57
 ENSG00000100604CHGA−7.451.25−25.150.000.00196.43
 ENSG00000167434CA4−7.771.59−23.020.000.00172.71
 ENSG00000071203MS4A12−7.930.78−24.350.000.00187.48
 ENSG00000174992ZG16−8.122.90−18.310.000.00120.48
 ENSG00000016602CLCA4−8.412.08−22.490.000.00166.82
 ENSG00000103375AQP8−9.020.89−27.390.000.00220.97
LncRNA
 ENSG00000186594MIR22HG−1.923.37−17.540.000.00112.01
 ENSG00000227258SMIM2-AS1−1.921.43−9.980.000.0037.78
 ENSG00000167912AC090152.1−1.960.17−10.720.000.0044.25
 ENSG00000224259LINC01133−1.973.44−12.970.000.0064.65
 ENSG00000167117LINC00483−2.013.21−11.420.000.0050.12
 ENSG00000225953SATB2-AS1−2.022.74−8.480.000.0025.84
 ENSG00000266036AC016888.1−2.120.23−13.730.000.0072.38
 ENSG00000229155LINC02038−2.140.02−16.450.000.00100.45
 ENSG00000268388FENDRR−2.173.13−12.790.000.0062.96
 ENSG00000237070AC005550.3−2.190.16−9.160.000.0031.33
 ENSG00000276855AC015922.4−2.220.04−12.810.000.0063.38
 ENSG00000258837AL133370.1−2.370.49−7.890.000.0021.86
 ENSG00000198788MUC2−2.408.10−4.330.000.001.46
 ENSG00000229619MBNL1-AS1−2.411.31−19.670.000.00135.40
 ENSG00000259342AC025580.1−2.461.56−12.250.000.0057.93
 ENSG00000225335AC016027.1−2.500.61−26.880.000.00215.20
 ENSG00000188242PP7080−2.754.85−14.610.000.0081.01
 ENSG00000224189HAGLR−2.912.84−15.850.000.0093.94
 ENSG00000226777FAM30A−3.26−0.28−13.100.000.0066.15
 ENSG00000256643LINC02441−3.29−0.05−13.320.000.0068.27
logFCAveExprtPValueFDRB
miRNA
 hsa-miR-378a-5p−4.206.76−15.800.000.0091.92
 hsa-miR-1180-3p−4.233.03−11.140.000.0047.49
 hsa-miR-150-3p−4.270.56−9.240.000.0031.83
 hsa-miR-671-3p−4.281.31−16.200.000.0095.92
 hsa-let-7d-3p−4.348.50−18.590.000.00120.89
 hsa-miR-125a-5p−4.397.80−16.530.000.0099.38
 hsa-miR-1976−4.442.84−18.100.000.00115.64
 hsa-miR-1306-5p−4.462.46−17.480.000.00109.21
 hsa-miR-149-5p−4.493.27−13.050.000.0064.85
 hsa-miR-766-3p−4.502.50−18.100.000.00115.63
 hsa-miR-194-3p−4.517.68−13.590.000.0070.05
 hsa-miR-133a-3p−4.593.71−8.320.000.0024.94
 hsa-miR-197-3p−4.977.87−22.640.000.00163.76
 hsa-miR-642a-5p−5.071.39−11.470.000.0050.39
 hsa-miR-6511b-3p−5.370.31−17.290.000.00107.15
 hsa-miR-139-5p−5.384.32−17.290.000.00107.26
 hsa-miR-328-3p−5.633.77−21.460.000.00151.16
 hsa-miR-129-5p−5.760.92−13.350.000.0067.73
 hsa-miR-139-3p−6.101.99−14.440.000.0078.35
 hsa-miR-486-5p−6.125.62−14.350.000.0077.43
Bar graph of the differentially expressed genes in the COAD samples. TEC: To be Experimentally Confirmed; TR: T cell receptor; IG: Immunoglobulin Volcano plot of the differentially expressed genes and miRNAs. a Volcano plot of differentially expressed lncRNAs and mRNAs. Overexpressed genes are demonstrated in red and down-regulated genes are demonstrated in green. b Volcano plot of differentially expressed miRNA. Overexpressed and down-regulated genes are demonstrated in red and green, respectively Top 20 upregulated mRNAs, lncRNAs, and miRNAs Top 20 downregulated mRNAs, lncRNAs, and miRNAs

GO enrichment and KEGG pathway analysis

GO enrichment analysis demonstrated that the differentially expressed mRNAs were enriched in different biological processes such as leukocyte migration, extracellular matrix organization, T cell activation, mitotic nuclear division, and adaptive immune response. Furthermore, GO analysis in cellular component revealed that the differentially expressed mRNAs predominantly contributed to collagen-containing extracellular matrix, basement membrane, microvillus, apical part of cell, and external side of plasma membrane. GO molecular function domain indicated that the genes were mainly enriched in glycosaminoglycan binding, heparin binding, sulfur compound binding, extracellular matrix structural constituent, and cytokine activity. GO outputs are presented in Fig. 3. In addition, KEGG pathway analysis indicated that the differentially expressed genes in the COAD patients remarkably participated in pathways involving in cancer, cell cycle, PPAR signaling pathway, PI3K-Akt signaling pathway, Wnt signaling pathway, and p53 signaling pathway (Fig. 4).
Fig. 3

GO enrichment analysis of the differentially expressed mRNAs in COAD (Top 10 GO enrichment are presented)

Fig. 4

KEGG signaling pathway analysis of the differentially expressed mRNAs in COAD (Top 20 KEGG terms are presented)

GO enrichment analysis of the differentially expressed mRNAs in COAD (Top 10 GO enrichment are presented) KEGG signaling pathway analysis of the differentially expressed mRNAs in COAD (Top 20 KEGG terms are presented)

PPI network construction

The PPI network was constructed based on the STRING database to better understand the roles of the differentially expressed mRNAs. The data demonstrated that CDKN2A, CCND1, MYC, E2F, CDK4, BRCA2, CDC25B, and CDKN1A were the protein-protein interaction (PPI) critical hubs (Fig. 5).
Fig. 5

Protein-protein interaction (PPI) network of the differentially mRNAs in COAD (score > 0.4) with Node:118, eadge:1745, MCADE score: 29.82

Protein-protein interaction (PPI) network of the differentially mRNAs in COAD (score > 0.4) with Node:118, eadge:1745, MCADE score: 29.82

Kaplan-Meier survival analysis of differentially expressed genes

Kaplan-Meier survival analysis was used to indicate the association of differentially expressed mRNAs, lncRNAs, miRNA, and prognosis of COAD patients. The data showed that 215 mRNAs, 14 lncRNAs, and 39 miRNAs were associated with overall survival time in the patients. The top 10 hits of each group are presented in Table 4.
Table 4

Top 10 mRNAs, lncRNAs, and miRNAs that were associated with overall survival

symbolHRlower95upper95p Value
mRNA
 ENSG00000204314PRRT12.111.433.120.00
 ENSG00000179528LBX22.091.423.080.00
 ENSG00000108852MPP22.081.413.070.00
 ENSG00000225968ELFN11.991.352.930.00
 ENSG00000258839MC1R1.941.322.860.00
 ENSG00000187730GABRD1.941.312.860.00
 ENSG00000163083INHBB1.921.302.830.00
 ENSG00000204389HSPA1A1.911.292.810.00
 ENSG00000124191TOX21.881.282.770.00
 ENSG00000198467TPM21.831.242.700.00
LncRNA
 ENSG00000262251AC087388.11.861.262.740.00
 ENSG00000226419SLC16A1-AS11.831.242.690.00
 ENSG00000236081ELFN1-AS11.741.182.570.01
 ENSG00000267523AC008735.21.661.122.450.01
 ENSG00000226332AL354836.11.661.122.440.01
 ENSG00000273142AC073335.21.511.022.220.04
 ENSG00000278709NKILA1.511.022.220.04
 ENSG00000254815AP006284.11.501.022.220.04
 ENSG00000234432AC092171.31.491.012.190.05
 ENSG00000228109MELTF-AS11.481.002.180.05
miRNA
 hsa-miR-130a-3p1.841.242.720.00
 hsa-miR-210-3p1.791.212.650.00
 hsa-miR-193a-3p1.781.212.630.00
 hsa-miR-887-3p1.761.192.590.01
 hsa-miR-34a-5p1.691.142.500.01
 hsa-miR-34c-5p1.661.122.450.01
 hsa-miR-26b-5p1.651.112.430.01
 hsa-miR-193b-5p1.631.102.400.02
 hsa-miR-328-3p1.621.102.400.02
 hsa-miR-1271-5p1.611.092.380.02
Top 10 mRNAs, lncRNAs, and miRNAs that were associated with overall survival

Diagnostic analysis of differentially expressed lncRNAs

AUC analysis was conducted to demonstrate the diagnostic value of each lncRNAs in the COAD samples. All 205 differentially expressed lncRNAs indicated significant diagnostic values. The top 50 hits of the lncRNAs are summarized in Table 5.
Table 5

Top 50 lncRNAs that had significant diagnostic value

LncRNAsymbolAreaSEp-valueLower BoundUpper BoundExpression
ENSG00000249859PVT11.000.000.001.001.00High
ENSG00000265688MAFG-AS11.000.000.000.991.00High
ENSG00000237686AL109615.30.990.000.000.981.00High
ENSG00000232956SNHG150.980.010.000.971.00High
ENSG00000281406BLACAT10.980.010.000.970.99High
ENSG00000236081ELFN1-AS10.980.010.000.970.99High
ENSG00000245694CRNDE0.980.010.000.970.99High
ENSG00000163597SNHG160.980.010.000.960.99High
ENSG00000186594MIR22HG0.970.010.000.010.04Low
ENSG00000225335AC016027.10.970.020.000.000.06Low
ENSG00000253929CASC210.970.010.000.960.98High
ENSG00000268388FENDRR0.970.010.000.020.05Low
ENSG00000255717SNHG10.970.010.000.950.98High
ENSG00000203497PDCD4-AS10.970.020.000.000.07Low
ENSG00000256643LINC024410.960.020.000.010.07Low
ENSG00000280798LINC002940.960.010.000.010.06Low
ENSG00000270820AC016727.10.960.010.000.020.06Low
ENSG00000272686AC006333.20.960.010.000.020.06Low
ENSG00000230316FEZF1-AS10.960.010.000.940.98High
ENSG00000262001DLGAP1-AS20.960.010.000.940.98High
ENSG00000177410ZFAS10.960.010.000.940.98High
ENSG00000224189HAGLR0.960.010.000.020.06Low
ENSG00000253161LINC016050.960.010.000.940.98High
ENSG00000270959LPP-AS20.960.010.000.030.06Low
ENSG00000196756SNHG170.960.010.000.930.98High
ENSG00000272106AL691432.20.960.010.000.020.07Low
ENSG00000228109MELTF-AS10.950.010.000.930.98High
ENSG00000261373VPS9D1-AS10.950.010.000.940.97High
ENSG00000229619MBNL1-AS10.950.010.000.030.07Low
ENSG00000234753FOXP4-AS10.950.010.000.930.97High
ENSG00000281376ABALON0.950.010.000.930.97High
ENSG00000276855AC015922.40.950.010.000.030.07Low
ENSG00000229155LINC020380.950.020.000.010.10Low
ENSG00000226380AC016831.10.950.010.000.930.97High
ENSG00000253414AC124067.20.950.010.000.930.97High
ENSG00000266680AL135905.20.950.020.000.020.09Low
ENSG00000256940AP001453.20.940.020.000.910.97High
ENSG00000243479MNX1-AS10.940.010.000.910.97High
ENSG00000245910SNHG60.940.010.000.910.96High
ENSG00000272502AC104958.20.940.010.000.920.96High
ENSG00000172965MIR4435-2HG0.940.020.000.890.98High
ENSG00000236144TMEM147-AS10.930.010.000.910.96High
ENSG00000214039LINC024180.930.010.000.910.96High
ENSG00000272462U91328.20.930.010.000.040.09Low
ENSG00000280206AC026401.30.930.020.000.900.97High
ENSG00000205664BX890604.10.930.020.000.910.96High
ENSG00000262585LINC019790.930.010.000.910.95High
ENSG00000262188LINC019780.930.010.000.900.96High
ENSG00000166770ZNF667-AS10.930.010.000.050.10Low
ENSG00000232442MHENCR0.930.010.000.900.95High
Top 50 lncRNAs that had significant diagnostic value

Novel lncRNA biomarkers

After merging the overall survival, and the diagnostic value data, we found that 14 lncRNAs had high ranks in prognostic and diagnostic areas which can be considered as COAD biomarkers. The data are presented in Table 6.
Table 6

the lncRNAs as diagnostic and prognostic biomarkers in COAD

LncRNAsymbolAreaSEp-valueExpressionHRpValue
ENSG00000262251AC087388.10.890.020.00High1.860.00
ENSG00000226419SLC16A1-AS10.880.020.00High1.830.00
ENSG00000236081ELFN1-AS10.980.010.00High1.740.01
ENSG00000254290AC124067.40.790.020.00High0.580.01
ENSG00000265415AC099850.30.890.020.00High0.600.01
ENSG00000267523AC008735.20.780.030.00High1.660.01
ENSG00000226332AL354836.10.860.020.00High1.660.01
ENSG00000268388FENDRR0.970.010.00Low0.600.01
ENSG00000260920AL031985.30.870.020.00High0.640.03
ENSG00000278709NKILA0.790.020.00High1.510.04
ENSG00000254815AP006284.10.760.030.00High1.500.04
ENSG00000273142AC073335.20.830.020.00High1.510.04
ENSG00000234432AC092171.30.880.020.00High1.490.05
ENSG00000228109MELTF-AS10.950.010.00High1.480.05
the lncRNAs as diagnostic and prognostic biomarkers in COAD Kaplan-Meier and ROC curve analysis were conducted for the top three lncRNAs (AC087388.1, SLC16A1-AS1, and ELFN1-AS1) from aforementioned analysis shown in Fig. 6. Moreover, univariate and multivariate analysis were conducted to demonstrate the power of the lncRNAs and to diminish the covariate effects. Univariate and multivariate analysis are summarized in Table 7.
Fig. 6

Kaplan-Meier and ROC curve analysis of the AC087388.1, SLC16A1-AS1, and ELFN1-AS1). a Kaplan-Meier curve of AC087388.1. b Kaplan-Meier curve of SLC16A1-AS. c Kaplan-Meier curve of ELFN1-AS1. d ROC curve of the lncRNAs

Table 7

Univariate and multivariate survival analyses of AC087388.1, SLC16A1-AS1, and ELFN1-AS1

AC087388SLC16A1-AS1ELFN1-AS1
Univariate analysisMultivariate analysisMultivariate analysisMultivariate analysis
HR95% CIPvalueHR95% CIPvalueHR95% CIPvalueHR95% CIPvalue
ENSG000002622511.451.141.850.001.531.012.310.04
ENSG000002264191.221.011.470.041.951.273.000.00
ENSG000002360811.151.011.310.041.861.232.810.00
Stage1&2/3&42.731.804.150.001.680.962.910.071.580.902.760.111.640.952.850.08
Pathologic_T (T1&2/T3&4)2.951.376.370.012.200.865.630.102.420.956.190.062.380.936.070.07
pathologic_M (M0/Mx)3.122.084.680.002.151.363.400.002.261.433.570.002.341.493.700.00
Pathologic_N (N0&1/N2)3.282.194.930.001.711.032.820.041.651.002.730.051.681.012.790.04
Sex (Female/Male)1.090.741.610.680.930.621.410.730.920.611.390.690.900.591.360.61
Age (≤65/> 65)1.761.182.630.012.251.463.480.002.491.613.860.002.291.493.540.00
Kaplan-Meier and ROC curve analysis of the AC087388.1, SLC16A1-AS1, and ELFN1-AS1). a Kaplan-Meier curve of AC087388.1. b Kaplan-Meier curve of SLC16A1-AS. c Kaplan-Meier curve of ELFN1-AS1. d ROC curve of the lncRNAs Univariate and multivariate survival analyses of AC087388.1, SLC16A1-AS1, and ELFN1-AS1 According to ceRNA hypothesis, which implicates that lncRNAs regulate mRNA expression level by competing the shared miRNAs in cells, a ceRNA network was built based on lncRNAs, mRNAs, and miRNAs expression in the samples based on starbase online tool in R software. The nodes and edges were drawn by Cytoscape 3.7.2. The ceRNA network data demonstrated important lncRNAs including MIR17HG, H19, SNHG1, KCNQ1OT1, MALAT1, GAS5, SNHG20, OR2A1-AS1, and MAGI2-AS3, which have implied in the development of COAD (Fig. 7).
Fig. 7

LncRNA-miRNA-mRNA ceRNA network construction of the differentially expressed genes in COAD (Red: LncRNA, Yellow: miRNA, and Green: mRNA)

LncRNA-miRNA-mRNA ceRNA network construction of the differentially expressed genes in COAD (Red: LncRNA, Yellow: miRNA, and Green: mRNA)

Discussion

LncRNAs regulate critical and canonical biological functions in different types of normal human cells and in a variety of tumor cells [20]. An escalating number of investigations have reported the function of lncRNAs in tumor proliferation, cell invasion and migration, chemotherapy resistance, and stemness capability in tumorigenesis and progression of COAD [21-23]. However, the exact underlying mechanisms of lncRNAs in progression of COAD are still unclear. So far, several different biological regulatory functions have been proposed for lncRNAs. Some previous studies have demonstrated that lncRNAs regulate mRNA expression via binding and sponging miRNA known as competing endogenous RNA theory, which generates a new aspect in the lncRNA regulatory mechanism [24, 25]. To the best of our knowledge, only a few investigations have displayed ceRNA networks between lncRNAs and miRNAs in COAD. Thus, a clear image of lncRNAs-miRNAs links still remains uncharacterized. In the current study, we studied the differentially expressed genes including lncRNAs, miRNAs, and mRNAs in the COAD patients based on TCGA database. Gene set enrichment by GO and KEGG signaling pathway identified the differentially expressed genes which were significantly enriched in cell proliferation, differentiation, protein phosphorylation, and signaling pathways. Furthermore, KEGG signaling pathway analysis demonstrated several canonical signaling pathways including Wnt, PI3K/Akt and PPAR signaling pathways that have been shown to contribute in tumor progression [26, 27]. A mounting of evidence has emphasized on Wnt/ β-catenin signaling pathway, promoting tumor growth, invasion and metastasis, and chemoresistance in COAD [28, 29]. For instance, it has been demonstrated that lncRNA H19 overexpression induces the EMT of colorectal cancer (CRC) cells by sponging miR-29b-3p to directly upregulate PGRN and activate Wnt axis [30]. Moreover, the up-regulation of lncRNA colorectal cancer-associated lncRNA (CCAL) promotes CRC progression through suppressing the activator protein 2α (AP-2α) to initiate Wnt/β-catenin signaling pathway [31]. In the present study, the KEGG analysis indicated that the peroxisome proliferator-activated receptor (PPAR) pathway contributes in Wnt signaling. It has been shown that the PPAR signaling pathway reduces cell proliferation and inhibits tumorigenesis in different types of cancers. Down-regulation of PPAR-α has been correlated with poor clinicopathological features of CRC that was remarkably higher in well to moderately differentiated adenocarcinoma than in mucinous adenocarcinoma [32]. In addition, lncRNA TINCR modulates PPAR signaling pathway through binding to miR-107 to up-regulate CD36 in CRC [33]. Recently, the PPAR aberration expression and its prime roles in gastrointestinal tract has been extensively reviewed [34]. It has been shown that PI3K/Akt signaling pathway had prominent roles in carcinogenesis of a variety of cancers particularly COAD. LncRNA AB073614 can take under control CRC growth and invasion by PI3K/Akt signaling pathway [35]. In addition, lncRNA SNHG7 elevated GALNT7 level and induced PI3K/Akt/mTOR pathway by sponging miR-34a in CRC cells [36]. Our ceRNA network data demonstrated important lncRNAs including MIR17HG, H19, SNHG1, KCNQ1OT1, MALAT1, GAS5, SNHG20, OR2A1-AS1, and MAGI2-AS3 which previously have been highlighted in the development of COAD. LncRNA MAGI2-AS3 have been discovered to play a crucial role as a tumor suppressor in breast cancer by targeting Fas/FasL in tumor cells [37]. Moreover, MAGI2-AS3 hampers hepatocellular carcinoma cell growth and its invasion through sponging miR-374b-5p to up-regulate SMG1 axis [38]. On the other hand, overexpression of MAGI2-AS3 has been explained to promote tumor progression by absorbing miR-141/200a and consequently, up-regulating ZEB1 which is an EMT promoting transcription factor, in gastric cancer cells [39]. MAGI2-AS3 up-regulation has also been shown to induce CRC proliferation and migration by modulating miR-3163 through upregulating TMEM106B [40]. LncRNA SNHG1 is a prominent lncRNA that is involved in a variety of cancers. SNHG1 expression is associated with unfavorable overall survival and tumor recurrence in patients with COAD. Moreover, SNHG1 promote cell growth and cell migration via upregulating EZH2 and miR-154a-5p in COAD [41]. LncRNA KCNQ1OT1 can promote EMT by decreasing miRNA-217 expression to upregulate ZEB1 axis in COAD [42]. Furthermore, KCNQ1OT1 has been demonstrated to promote chemoresistance of oxaliplatin by iR-34a/ATG4B pathway and it is associated with poor prognosis in COAD [43]. A previous study showed that lncRNA MALAT1 was remarkably upregulated in COAD cells. MALAT1 can promote metastasis of COAD via RUNX2 as a survival factor in tumor cells [44]. MALAT1 evokes EMT and angiogenesis via sponging miR-1265p to upregulate VEGFA, SLUG, and TWIST [45]. Several investigations demonstrated that lncRNA GAS5 can act as a tumor suppressor gene by different actions. It has been illustrated that GAS5 inhibited angiogenesis and metastasis via regulating Wnt signaling pathway in COAD cells [46]. Finally, lncRNA SNHG20 has been reported overexpressed prominently in CRC tissues in comparison to normal ones. Overexpression of SNHG20 was correlated with poor prognosis in the patients [47]. Although, there are several similar studies, the novelties of the current study include; an extensive exploration of lncRNA, mRNA and miRNA signatures, revealing the diagnostic and prognostic value of lncRNA, and constructing a COAD lncRNA-miRNA-mRNA ceRNA network. Hence, our data elucidated that, the suggested lncRNAs can be considered as potential promising biomarkers, which could drive tumorigenesis through hijacking canonical signaling pathways in COAD.

Conclusions

Our data highlighted the importance of lncRNA regulatory networks that might provide a promising therapeutic approach for clinical application by considering lncRNA hubs as potential efficient biomarkers.
  47 in total

1.  Long non-coding RNA TP73‑AS1 promotes colorectal cancer proliferation by acting as a ceRNA for miR‑103 to regulate PTEN expression.

Authors:  Zeming Jia; Jian Peng; Zhi Yang; Jie Chen; Ling Liu; Dongren Luo; Panxiang He
Journal:  Gene       Date:  2018-11-22       Impact factor: 3.688

2.  LncRNA HOTAIR is a Prognostic Biomarker for the Proliferation and Chemoresistance of Colorectal Cancer via MiR-203a-3p-Mediated Wnt/ß-Catenin Signaling Pathway.

Authors:  Zhigang Xiao; Zhan Qu; Zhikang Chen; Zhixue Fang; Ke Zhou; Zhongcheng Huang; Xiong Guo; Yang Zhang
Journal:  Cell Physiol Biochem       Date:  2018-04-16

3.  Construction and analysis of dysregulated lncRNA-associated ceRNA network in colorectal cancer.

Authors:  Yiping Zhu; Yinzhu Bian; Qun Zhang; Jing Hu; Li Li; Mi Yang; Hanqing Qian; Lixia Yu; Baorui Liu; Xiaoping Qian
Journal:  J Cell Biochem       Date:  2018-12-07       Impact factor: 4.429

4.  The Effect of LncRNA H19/miR-194-5p Axis on the Epithelial-Mesenchymal Transition of Colorectal Adenocarcinoma.

Authors:  Chang-Feng Li; Yong-Chao Li; Yun Wang; Li-Bo Sun
Journal:  Cell Physiol Biochem       Date:  2018-10-02

5.  The lncRNA CRNDE promotes colorectal cancer cell proliferation and chemoresistance via miR-181a-5p-mediated regulation of Wnt/β-catenin signaling.

Authors:  Peng Han; Jing-Wen Li; Bo-Miao Zhang; Jia-Chen Lv; Yong-Min Li; Xin-Yue Gu; Zhi-Wei Yu; Yun-He Jia; Xue-Feng Bai; Li Li; Yan-Long Liu; Bin-Bin Cui
Journal:  Mol Cancer       Date:  2017-01-13       Impact factor: 27.401

Review 6.  Exosomes: composition, biogenesis, and mechanisms in cancer metastasis and drug resistance.

Authors:  Ladan Mashouri; Hassan Yousefi; Amir Reza Aref; Ali Mohammad Ahadi; Fatemeh Molaei; Suresh K Alahari
Journal:  Mol Cancer       Date:  2019-04-02       Impact factor: 27.401

7.  MALAT1 regulates the transcriptional and translational levels of proto-oncogene RUNX2 in colorectal cancer metastasis.

Authors:  Qing Ji; Guoxiang Cai; Xuan Liu; Yi Zhang; Yan Wang; Lihong Zhou; Hua Sui; Qi Li
Journal:  Cell Death Dis       Date:  2019-05-16       Impact factor: 8.469

8.  Increased long noncoding RNA SNHG20 predicts poor prognosis in colorectal cancer.

Authors:  Cong Li; Li Zhou; Jun He; Xue-Qing Fang; Shao-Wen Zhu; Mao-Ming Xiong
Journal:  BMC Cancer       Date:  2016-08-19       Impact factor: 4.430

9.  The lncRNA NEAT1 activates Wnt/β-catenin signaling and promotes colorectal cancer progression via interacting with DDX5.

Authors:  Meng Zhang; Weiwei Weng; Qiongyan Zhang; Yong Wu; Shujuan Ni; Cong Tan; Midie Xu; Hui Sun; Chenchen Liu; Ping Wei; Xiang Du
Journal:  J Hematol Oncol       Date:  2018-09-05       Impact factor: 17.388

10.  LncRNA MAGI2-AS3 Is Regulated by BRD4 and Promotes Gastric Cancer Progression via Maintaining ZEB1 Overexpression by Sponging miR-141/200a.

Authors:  Dandan Li; Jingjie Wang; Meixin Zhang; Xinhui Hu; Jiajun She; Xuemei Qiu; Xudong Zhang; Li Xu; Ying Liu; Shanshan Qin
Journal:  Mol Ther Nucleic Acids       Date:  2019-11-15       Impact factor: 8.886

View more
  24 in total

1.  lncRNA GAS8-AS1 regulates cancer cell proliferation and predicts poor survival of patients with gastric cancer.

Authors:  Chao Li; Hui Wang; Song Meng; Jian Hong; Libin Yao; Yong Shao; Xiaocheng Zhu
Journal:  Oncol Lett       Date:  2021-12-14       Impact factor: 2.967

2.  Multimetric feature selection for analyzing multicategory outcomes of colorectal cancer: random forest and multinomial logistic regression models.

Authors:  Catherine H Feng; Mary L Disis; Chao Cheng; Lanjing Zhang
Journal:  Lab Invest       Date:  2021-09-18       Impact factor: 5.662

3.  Long non-coding RNA-KCNQ1OT1 mediates miR-423-5p/microfibril-associated protein 2 axis in colon adenocarcinoma.

Authors:  Xunhai Yin; Aimin Jiang; Zhibin Ma; Xi Lu; Dongyue Li; Yingying Chen
Journal:  Histol Histopathol       Date:  2021-10-27       Impact factor: 2.303

4.  Expression characteristics of long non-coding RNA in colon adenocarcinoma and its potential value for judging the survival and prognosis of patients: bioinformatics analysis based on The Cancer Genome Atlas database.

Authors:  Ruofan Li; Xu Gao; Haitao Sun; Lixin Sun; Xiaojian Hu
Journal:  J Gastrointest Oncol       Date:  2022-06

5.  THUMPD3-AS1 facilitates cell growth and aggressiveness by the miR-218-5p/SKAP1 axis in colorectal cancer.

Authors:  Yuwei Pu; Jinrong Wei; Yong Wu; Kui Zhao; Yongyou Wu; Shu Wu; Xiaodong Yang; Chungen Xing
Journal:  Cell Biochem Biophys       Date:  2022-05-10       Impact factor: 2.989

6.  Long noncoding RNA MAGI2-AS3 regulates the H2O2 level and cell senescence via HSPA8.

Authors:  Yingmin Zhang; Xinhua Qiao; Lihui Liu; Wensheng Han; Qinghua Liu; Yuanyuan Wang; Ting Xie; Yiheng Tang; Tiepeng Wang; Jiao Meng; Aojun Ye; Shunmin He; Runsheng Chen; Chang Chen
Journal:  Redox Biol       Date:  2022-06-30       Impact factor: 10.787

7.  A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations.

Authors:  Zhuangwei Shi; Han Zhang; Chen Jin; Xiongwen Quan; Yanbin Yin
Journal:  BMC Bioinformatics       Date:  2021-03-21       Impact factor: 3.169

8.  Long non-coding RNA AC087388.1 as a novel biomarker in colorectal cancer.

Authors:  Arash Poursheikhani; Mohammad Reza Abbaszadegan; Mohammad Amin Kerachian
Journal:  BMC Cancer       Date:  2022-02-21       Impact factor: 4.430

9.  Immune-Related Gene Expression Analysis Revealed Three lncRNAs as Prognostic Factors for Colon Cancer.

Authors:  Xiao-Liang Xing; Ti Zhang; Zhi-Yong Yao; Chaoqun Xing; Chunxiao Wang; Yuan-Wu Liu; Minjiang Huang
Journal:  Front Genet       Date:  2021-07-09       Impact factor: 4.599

10.  Biomarker Discovery for the Carcinogenic Heterogeneity Between Colon and Rectal Cancers Based on lncRNA-Associated ceRNA Network Analysis.

Authors:  Xin Qi; Yuxin Lin; Xingyun Liu; Jiajia Chen; Bairong Shen
Journal:  Front Oncol       Date:  2020-10-30       Impact factor: 6.244

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.