| Literature DB >> 31402962 |
Cun Liao1, Xue Huang2, Yizhen Gong1, Qiuning Lin3.
Abstract
The aim of the present study was to investigate the interactions among messenger RNAs (mRNAs), microRNAs (miRNAs), and long noncoding RNAs (lncRNAs) in colorectal cancer (CRC), in order to examine its underlying mechanisms. The raw gene expression data was downloaded from the Gene Expression Omnibus (GEO) database. An online tool, GEO2R, which is based on the limma package, was used to identify differentially expressed genes. The co-expression between lncRNAs and mRNAs was identified utilizing the weighted gene co-expression analysis package of R to construct a coding non-coding (CNC) network. The function of the genes in the CNC network was determined by performing Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathways enrichment analysis. The interactions among miRNAs, mRNAs and lncRNAs were predicted using Lncbase and mirWalk to construct the competing endogenous RNA (ceRNA) network. The expression of the genes involved in the ceRNA network was further validated in The Cancer Genome Atlas dataset. A total of 3,183 dysregulated mRNAs, 78 dysregulated miRNAs and 2,248 dysregulated lncRNAs were screened in two GEO datasets. Combined with the results of the dysregulated genes, 169 genes were selected to construct the CNC network. 'p53 signaling pathway' and the 'cell cycle' were the most significant enriched pathways in the genes involved in the CNC network. Finally, a validated ceRNA network composed of 2 lncRNAs (MIR22HG and RP11-61I13.3), 5 miRNAs (hsa-miR-765, hsa-miR-198, hsa-miR-125a-3p, hsa-miR-149-3p and hsa-miR-650) and 5 mRNAs (ANK2, BTK, GBP2, PCSK5 and PDK4) was obtained. In conclusion, MIR22HG may regulate PCSK5, BTK and PDK4, and RP11-61I13.3 may regulate the ANK2, GBP2, PCSK5 through sponging miRNAs to act on the progression of CRC, and the potential function of these genes have been revealed. However, the diagnostic and prognostic value of these genes requires further validation.Entities:
Keywords: The Cancer Genome Atlas; bioinformatics analysis; colorectal cancer; competing endogenous RNA; weighted gene co-repression analysis
Year: 2019 PMID: 31402962 PMCID: PMC6676736 DOI: 10.3892/ol.2019.10605
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Clinical information of the patients included in the GSE109454 dataset (18).
| Sex | Age, years | TNM stage ( | CEA, ng/ml |
|---|---|---|---|
| Male | 55 | II | 12.43 |
| Male | 61 | IVA | 37.84 |
| Female | 56 | IIIB | 23.60 |
| Male | 59 | IIIA | 39.93 |
| Female | 53 | II | 15.51 |
| Female | 63 | IIIA | 18.89 |
CEA, carcinoembryonic antigen; TNM, Tumor-Node-Metastasis.
Clinical information of the patients included in the GSE41655 dataset (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41655).
| Sex | Age, years | Pathological grade ( | pTNM stage ( |
|---|---|---|---|
| Male | 35 | Normal epithelium | |
| Male | 60 | Normal epithelium | |
| Male | 53 | Normal epithelium | |
| Male | 68 | Normal epithelium | |
| Female | 61 | Normal epithelium | |
| Female | 55 | Normal epithelium | |
| Male | 29 | Normal epithelium | |
| Male | 37 | Normal epithelium | |
| Female | 38 | Normal epithelium | |
| Female | 22 | Normal epithelium | |
| Female | 62 | Normal epithelium | |
| Male | 69 | Normal epithelium | |
| Female | 46 | Normal epithelium | |
| Male | 51 | Normal epithelium | |
| Female | 39 | Normal epithelium | |
| Male | 49 | Low-grade dysplasia | |
| Female | 51 | Low-grade dysplasia | |
| Male | 64 | High-grade dysplasia | |
| Male | 69 | Low-grade dysplasia | |
| Male | 54 | Low-grade dysplasia | |
| Male | 60 | Low-grade dysplasia | |
| Male | 61 | Low-grade dysplasia | |
| Female | 74 | Low-grade dysplasia | |
| Male | 64 | Low-grade dysplasia | |
| Male | 61 | High-grade dysplasia | |
| Female | 70 | High-grade dysplasia | |
| Male | 83 | High-grade dysplasia | |
| Female | 53 | High-grade dysplasia | |
| Male | 49 | High-grade dysplasia | |
| Female | 29 | High-grade dysplasia | |
| Male | 49 | High-grade dysplasia | |
| Female | 70 | High-grade dysplasia | |
| Male | 80 | High-grade dysplasia | |
| Male | 65 | High-grade dysplasia | |
| Male | 56 | High-grade dysplasia | |
| Female | 76 | High-grade dysplasia | |
| Male | 56 | High-grade dysplasia | |
| Male | 74 | High-grade dysplasia | |
| Male | 61 | High-grade dysplasia | |
| Male | 35 | Low-grade dysplasia | |
| Male | 59 | Low-grade dysplasia | |
| Female | 53 | High-grade dysplasia | |
| Female | 68 | Low-grade dysplasia | |
| Male | 61 | Low-grade dysplasia | |
| Female | 43 | Low-grade dysplasia | |
| Male | 37 | Low-grade dysplasia | |
| Male | 55 | High-grade dysplasia | |
| Male | 82 | High-grade dysplasia | |
| Female | 62 | Low-grade dysplasia | |
| Male | 72 | Low-grade dysplasia | |
| Male | 62 | Low-grade dysplasia | |
| Female | 34 | Low-grade dysplasia | |
| Female | 61 | Low-grade dysplasia | |
| Female | 58 | Low-grade dysplasia | |
| Female | 53 | Low-grade dysplasia | |
| Male | 76 | Low-grade dysplasia | |
| Male | 29 | Low-grade dysplasia | |
| Male | 37 | Low-grade dysplasia | |
| Male | 38 | Low-grade dysplasia | |
| Female | 22 | Low-grade dysplasia | |
| Male | 69 | Low-grade dysplasia | |
| Male | 69 | Low-grade dysplasia | |
| Male | 65 | Low-grade dysplasia | |
| Male | 49 | High-grade dysplasia | |
| Male | 69 | Low-grade dysplasia | |
| Male | 63 | Low-grade dysplasia | |
| Male | 57 | Low-grade dysplasia | |
| Female | 56 | Low-grade dysplasia | |
| Female | 69 | Low-grade dysplasia | |
| Female | 72 | Low-grade dysplasia | |
| Female | 65 | Low-grade dysplasia | |
| Male | 75 | Low-grade dysplasia | |
| Male | 57 | Low-grade dysplasia | |
| Female | 35 | Low-grade dysplasia | |
| Male | 69 | Adenocarcinoma | T2N0M0 |
| Male | 63 | Adenocarcinoma | T4N1M1 |
| Male | 64 | Adenocarcinoma | T3N1M0 |
| Female | 75 | Adenocarcinoma | T1N0M0 |
| Male | 49 | Adenocarcinoma | T3N1M0 |
| Female | 70 | Adenocarcinoma | T3N1M0 |
| Male | 80 | Adenocarcinoma | T1N0M0 |
| Male | 65 | Adenocarcinoma | T2N0M0 |
| Male | 56 | Adenocarcinoma | T3N0M0 |
| Female | 76 | Adenocarcinoma | T2N1M0 |
| Male | 56 | Adenocarcinoma | T1N0M0 |
| Female | 68 | Adenocarcinoma | T2N1M0 |
| Male | 74 | Adenocarcinoma | T4N1M0 |
| Male | 61 | Adenocarcinoma | T4N1M0 |
| Male | 70 | Adenocarcinoma | T3N1M0 |
| Female | 80 | Adenocarcinoma | T1N0M0 |
| Male | 71 | Adenocarcinoma | T2N0M0 |
| Male | 35 | Adenocarcinoma | T3N0M0 |
| Male | 59 | Adenocarcinoma | T3N0M0 |
| Male | 60 | Adenocarcinoma | T3N0M0 |
| Female | 68 | Adenocarcinoma | T2N0M0 |
| Male | 61 | Adenocarcinoma | T3N1M0 |
| Male | 51 | Adenocarcinoma | T3N1M0 |
| Male | 55 | Adenocarcinoma | T4N2M1 |
| Male | 66 | Adenocarcinoma | T3N1M0 |
| Male | 72 | Adenocarcinoma | T4N2M0 |
| Female | 34 | Adenocarcinoma | T4N2M0 |
| Female | 63 | Adenocarcinoma | T1N0M0 |
| Male | 63 | Adenocarcinoma | T1N0M0 |
| Male | 69 | Adenocarcinoma | T4N1M0 |
| Female | 69 | Adenocarcinoma | T2N0M0 |
| Male | 61 | Adenocarcinoma | T4N1M0 |
| Male | 45 | Adenocarcinoma | T2N0M0 |
pTNM, pathological Tumor-Node-Metastasis.
Figure 1.Identification of differentially expressed genes between tumor and normal samples in the GSE109454 dataset. (A) The volcano plot. The green dots indicate the significantly downregulated genes, the red dots indicate the significantly upregulated genes, while the blue dots indicate the genes with no significant difference. The horizontal axis represents the log2 (fold change) and the vertical axis represents the -log10 (P-value). (B and C) Heatmaps of the (B) differentially expressed mRNAs and (C) differentially expressed lncRNAs. Each row presents a dysregulated RNA transcript and each column represents a sample. Orange represents upregulation and blue represents downregulation.
Figure 2.Identification of DEMs between tumor and normal samples in the GSE41655 dataset. (A) The volcano plots of the DEMs between colorectal adenoma samples and normal colorectal samples. (B) The volcano plots of the DEMs between colorectal adenocarcinoma samples and normal colorectal samples. The green dots indicate the significantly downregulated genes, the red dots indicate the significantly upregulated genes and the blue dots indicate the genes with no significant difference. The horizontal axis represents the log2 (fold change) and the vertical axis represents the -log10 (P-value). (C) Heatmaps of DEMs between colorectal adenoma samples and normal colorectal samples. (D) Heatmaps of DEMs between colorectal adenocarcinoma samples and normal colorectal samples. Each row presents a dysregulated RNA transcript and each column represents a sample. Orange represents upregulation and blue represents downregulation. DEMs, differentially expressed mRNAs.
Figure 3.(A) Hierarchical clustering dendrogram of all the probe sets used in the weighted gene co-expression network analysis based on the 1-TOM dissimilarity measure. Each line of the dendrogram represents an individual probe. The multi-color bar represents the 39 modules identified using the dynamic tree cut algorithm. (B) Correlations between module eigengenes and tumor status. The numbers within the heatmap indicate correlations and P-values for the module-trait associations, respectively. Red wells indicate positive correlation, blue wells indicate negative correlation and white wells indicate no correlation.
Number of genes in the 39 modules.
| Module colors | Frequency |
|---|---|
| Black | 426 |
| Blue | 2,249 |
| Brown | 1,417 |
| Cyan | 274 |
| Dark green | 133 |
| Dark grey | 103 |
| Dark magenta | 56 |
| Dark olive green | 59 |
| Dark orange | 84 |
| Dark red | 145 |
| Dark turquoise | 103 |
| Green | 507 |
| Green yellow | 293 |
| Grey | 191 |
| Grey 60 | 223 |
| Light cyan | 246 |
| Light green | 177 |
| Light yellow | 169 |
| Magenta | 349 |
| Midnight blue | 264 |
| Orange | 97 |
| Pale turquoise | 60 |
| Pink | 391 |
| Plum 1 | 39 |
| Purple | 313 |
| Red | 439 |
| Royal blue | 154 |
| Saddle brown | 63 |
| Salmon | 282 |
| Sienna 3 | 52 |
| Sky blue | 72 |
| Sky blue 3 | 47 |
| Steel blue | 63 |
| Tan | 291 |
| Turquoise | 3,809 |
| Violet | 59 |
| White | 80 |
| Yellow | 909 |
| Yellow green | 48 |
Figure 4.Coding non-coding network. Red represents upregulation and blue represents downregulation. The V-shape represents lncRNA and circles represent mRNA.
Gene function enrichment analysis of the genes included in the coding non-coding network.
| Category | Term | Count | P-value | Genes |
|---|---|---|---|---|
| KEGG | Cell cycle | 5 | 1.66×10−3 | CDK1, MAD2L1, CCNB2, E2F5, ATR |
| p53 signaling pathway | 3 | 3.18×10−2 | CDK1, CCNB2, ATR | |
| BP | Cell division | 13 | 7.04×10−8 | CDK1, MAD2L1, CCNB2, KIF11, OIP5, NEK2, KIF20B, NUF2, SDCCAG3, CENPF, AURKA, UBE2C |
| Mitotic nuclear division | 10 | 2.17×10−6 | CDK1, KIF11, CCNB2, OIP5, NEK2, KIF20B, NUF2, CENPF, CENPW, AURKA | |
| Chromosome segregation | 6 | 1.58×10−5 | KIF11, OIP5, NEK2, NUF2, CENPF, CENPW | |
| DNA strand elongation involved in DNA replication | 4 | 4.21×10−5 | GINS1, RFC3, GINS4, POLD2 | |
| Positive regulation of telomere maintenance via telomerase | 4 | 4.34×10−4 | DKC1, NEK2, ATR, CCT6A | |
| DNA replication | 6 | 7.71×10−4 | CDK1, RFC3, GINS4, POLD2, ATR, DSCC1 | |
| DNA repair | 7 | 7.81×10−4 | CDK1, RAD51AP1, FANCD2, PARPBP, RUVBL2, ACTL6A, ATR | |
| CENP-A containing nucleosome assembly | 4 | 1.04×10−3 | CENPL, OIP5, CENPQ, CENPW | |
| DNA duplex unwinding | 4 | 1.11×10−3 | GINS1, GINS4, RUVBL2, RAD54B | |
| Sister chromatid cohesion | 5 | 1.35×10−3 | CENPL, MAD2L1, CENPQ, NUF2, CENPF | |
| Positive regulation of telomerase RNA localization to Cajal body | 3 | 2.15×10−3 | DKC1, RUVBL2, CCT6A | |
| Spindle organization | 3 | 2.45×10−3 | KIF11, AURKA, AUNIP | |
| Cell cycle | 6 | 3.38×10−3 | DTYMK, SDCCAG3, AURKA, ATR, CDKN3, SUV39H2 | |
| G2/M transition of mitotic cell cycle | 5 | 3.79×10−3 | CDK1, PLK4, CCNB2, NEK2, AURKA | |
| Regulation of mitotic nuclear division | 3 | 5.50×10−3 | MKI67, NEK2, KIF20B | |
| Anaphase-promoting complex-dependent catabolic process | 4 | 5.92×10−3 | CDK1, MAD2L1, AURKA, UBE2C | |
| Cytoplasmic translation | 3 | 5.96×10−3 | RPL6, RPL8, FTSJ1 | |
| Reciprocal meiotic recombination | 3 | 8.52×10−3 | CCNB1IP1, RAD54B, TRIP13 | |
| Histone H4 acetylation | 3 | 9.08×10−3 | NCOA1, RUVBL2, ACTL6A | |
| Positive regulation of type I hypersensitivity | 2 | 01.3871 | FCER1A, BTK | |
| Mitotic nuclear envelope disassembly | 3 | 1.78×10−2 | CDK1, CCNB2, RAE1 | |
| Regulation of mitotic centrosome separation | 2 | 1.85×10−2 | KIF11, NEK2 | |
| Interstrand cross-link repair | 3 | 2.18×10−2 | RAD51AP1, FANCD2, ATR | |
| Mitotic centrosome separation | 2 | 2.30×10−2 | KIF11, AURKA | |
| Regulation of growth | 3 | 2.52×10−2 | ARMC10, RUVBL2, ACTL6A | |
| Cell proliferation | 6 | 2.77×10−2 | CDK1, MKI67, DKC1, DLGAP5, DTYMK, CENPF | |
| Positive regulation of DNA-directed DNA polymerase activity | 2 | 3.21×10−2 | RFC3, DSCC1 | |
| Box C/D snoRNP assembly | 2 | 3.21×10−2 | NUFIP1, RUVBL2 | |
| Protein ubiquitination involved in ubiquitin-dependent protein catabolic process | 4 | 3.44×10−2 | CDK1, MAD2L1, AURKA, UBE2C | |
| Positive regulation of establishment of protein localization to telomere | 2 | 4.10×10−2 | DKC1, CCT6A | |
| Negative regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle | 3 | 4.31×10−2 | CDK1, MAD2L1, UBE2C | |
| Positive regulation of ubiquitin-protein ligase activity involved in regulation of mitotic cell cycle transition | 3 | 4.87×10−2 | CDK1, MAD2L1, UBE2C |
Figure 5.Competing endogenous RNA network. Red represents upregulation and blue represents downregulation. The V-shape represents lncRNA, circles represent mRNA and triangles represent microRNA.
Figure 6.Gene expression of the selected long non-coding RNAs validated in The Cancer Genome Atlas dataset. The dysregulation of (A) MIR22HG, (B) CHL1-AS2 and (C) RP11-61I13.3. The red box corresponds to tumor samples and the grey box corresponds to non-tumor samples. The vertical axis represents the relative expression of genes. *P<0.05. COAD, colon adenocarcinoma; T, tumor; N, normal.
Figure 7.Gene expression of the selected mRNAs validated in The Cancer Genome Atlas dataset. The dysregulation of (A) ANK2, (B) ARMC10, (C) BTK, (D) CENPF, (E) DKC1, (F) FGL2, (G) GBP2, (H) GINS4, (I) PAICS, (J) PARPBP, (K) PCSK5, (L) PDK4, (M) RAD54B, (N) RAE1 and (O) TOMM34. The red box corresponds to tumor samples and the grey box corresponds to non-tumor samples. The vertical axis represents the relative expression of genes. *P<0.05. COAD, colon adenocarcinoma; T, tumor; N, normal.
Figure 8.Competing endogenous RNA network constructed from validated genes. Red represents upregulation and blue represents downregulation. V-shape represents lncRNA, circle represents mRNA, and triangle represents miRNA.