| Literature DB >> 33832120 |
Gu Xi1, Xu Ziyu1, Liu Yiting2, Liu Zonghang1, Zheng Lifeng1.
Abstract
ABSTRACT: Colon cancer patients suffer from high incidence and mortality rates worldwide. More novel molecular biomarkers should be used for the diagnosis and treatment of colon cancer. Long noncoding RNAs (lncRNAs) are found to be involved in colon cancer tumorigenesis and metastasis. This study aimed to identify novel lncRNAs in colon cancer.Two independent datasets (GSE70880 and GSE110715) were downloaded from the Gene Expression Omnibus database and merged with the sva package. R software was used to distinguish differentially expressed lncRNAs and mRNAs in the merged dataset. The competing endogenous RNA (ceRNA) network was constructed using differentially expressed lncRNAs and mRNAs with Cytoscape. Differentially expressed RNAs in the ceRNA network were further verified using the Cancer Genome Atlas database. Gene oncology analysis, Kyoto Encyclopedia of Genes and Genomes pathway enrichment and survival analysis were also performed to identify hub genes.A total of 99 differentially expressed lncRNAs and 95 differentially expressed mRNAs were identified in the merged database. Ten lncRNAs, 8 miRNAs, and 6 mRNAs were involved in the ceRNA network, in which LINC00114 and UCA1 were highly expressed in colon cancer. They were both associated with early tumor stages and might be used for the early diagnosis of colon cancer. High expression of LINC00114 can lead to poor overall survival of colon cancer patients. Furthermore, new pathways such as LINC00114/miR-107/PCKS5, UCA1/miR-107/PCKS5, and UCA1/miR-129-5p/SEMA6A were identified.Two novel lncRNAs (LINC00114 and UCA1) in colon cancer were identified by bioinformatics analysis. They might contribute to the occurrence and development of colon cancer. In addition, LINC00114 may be involved in the overall survival of colon cancer patients.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33832120 PMCID: PMC8036122 DOI: 10.1097/MD.0000000000025369
Source DB: PubMed Journal: Medicine (Baltimore) ISSN: 0025-7974 Impact factor: 1.817
Details of datasets and merged dataset from the GEO database.
| Series accession | Platform | Tumor samples | Normal samples | Total differentially expressed genes | Differentially expressed lncRNA | Differentially expressed mRNA |
| GSE70880 | GPL19748 | 20 | 20 | 884 (Up:379; Down:505) | 113 (Up:61; Down:52) | 771 (Up:318; Down:453) |
| GSE110715 | GPL18180 | 6 | 6 | 498 (Up:199; Down:299) | 277 (Up:108; Down:169) | 221 (Up:91; Down:130) |
| Merged | – | 26 | 26 | 194 (Up:83; Down:111) | 99 (Up:53; Down:46) | 95 (Up:30; Down:65) |
GEO = Gene Expression Omnibus.
The clinical information of colon cancer patients in TCGA database.
| Characteristics | Number of cases | Percentages (%) |
| Gender | ||
| Female | 180 | 46.75 |
| Male | 205 | 53.25 |
| Age | ||
| >60 | 273 | 70.91 |
| ≤60 | 112 | 29.09 |
| Vital status | ||
| Alive | 314 | 81.56 |
| Dead | 71 | 18.44 |
| Stage | ||
| Stage I | 66 | 17.14 |
| Stage II | 151 | 39.22 |
| Stage III | 103 | 26.75 |
| Stage IV | 54 | 14.03 |
| Unknow | 11 | 3.86 |
| Topography | ||
| T1 | 9 | 2.34 |
| T2 | 68 | 17.66 |
| T3 | 263 | 68.31 |
| T4 | 44 | 11.43 |
| Tis | 1 | 0.26 |
| Lymph node stage | ||
| N0 | 231 | 60 |
| N1 | 88 | 22.86 |
| N2 | 66 | 17.14 |
| Metastasis | ||
| M0 | 286 | 74.29 |
| M1 | 54 | 14.03 |
| Mx | 45 | 11.69 |
TCGA = the Cancer Genome Atlas.
Figure 1Volcano plot of differentially expressed genes in datasets from GEO database. (A) GSE70880 dataset. (B) GSE110715 dataset. (C) The merged dataset. Green dots indicates downregulated genes, red dots indicates upregulated genes, black dots indicates no significant difference in expression. FC = fold change, GEO = Gene Expression Omnibus.
A total of 194 differentially expressed genes are identified from the merged dataset.
| lncRNAs | Upregulated | LINC02595,LINC01614,LINC01655,CRNDE,AC009262,TRPM2,AL365226,LINC01186,LINC02418,AL513123,LINC00858,FEZF1,LINC01555,LINC02253,HECW1-IT1,LINC02487,FAM222A,AP000526,LINC01124,PVT1,LINC01594,AC104958,DLEU7,LINC02577,FIRRE,AC007128,LINC02331,LINC02257,AL135999,ARNTL2,UCA1,LINC00114,DUXAP8,MNX1,ZFAS1,LINC02263,CRAT37,CASC19,AL365181,AC099792,AL136987,AP000542,MIR44352HG,AC007405,CYTOR,MAP3K20,ELFN1,LUCAT1,AC100861,ISM1,LINC00920,LINC01235 |
| Downregulated | AF064860,DIRC3,AC104117,SLC25A21,AL391704,AC104260,TARID,NKAIN3IT1,AF165147,HAGLR,AC017015,AC026391,PCAT18,AL158211,B3GALT5,LINC02023,AL391807,AL591501,AC017067,ADAMTS9,CDKN2B,LINC02024,CARMN,GDNF,AL442638,AC073862,MIR4307HG,AL136317,LINC00702,CYP1B1,LINC01474,FENDRR,PWRN1,DPP10,AL024497,AL590302,AC006007,ADAMTS9,AC022034,AC100826,MAMDC2,AC007182,PGM5,HAND2,AC087463,AL136369 | |
| mRNAs | Upregulated | CEMIP,MMP1,WNT2,LEMD1,CHI3L1,SLC7A11,NKD2,INHBA,GJB3,MTHFD1L,TLX1,NEBL,HCAR3,NOX4,HSPH1,TRPM2,COL8A1,MUC5AC,ADAMTS2,CCNB1,GRIN2B,AKR1C8P,HECW2,MAP3K20,SIX1,TM4SF19,RNF43,SGIP1,KCNJ15,EDNRA |
| Downregulated | BLK,UNC5C,SLC22A18AS,DDR2,SEMA6A,ADIPOQ,ST6GALNAC6,PLP1,HSD17B2,MYH11,KRT222,PPARGC1B,CDON,TENM1,AC020907,HDAC9,TMEM72,SLC9A9,PAPPA2,KRTAP222,TMEFF2,TTLL7,DPF3,RAB9B,REP15,PDE7B,ANK2,AC124312,SLC16A9,LRRC3B,LPP,CSRP1,SLIT3,PPARGC1A,ADCYAP1R1,DCLK1,NOVA1,SPEG,P2RY1,FBXO32,DISP2,RIC3,EDIL3,PCSK5,ROR1,MYOCD,LMO3,STMN2,LGI1,FOXP2,GSTM3,JCHAIN,TNXB,B3GALT5,CFL2,UGT2A3,LMOD1,RNF150,INSL5,VSTM2A,MAL,PTCHD1,XKR4,TPH1,LYVE1 |
Figure 2Heatmap of differentially expressed lncRNAs and mRNAs in the merged dataset (26 colon cancer samples and 26 non-tumor samples). (A) Heatmap of differentially expressed lncRNAs. (B) Heatmap of differentially expressed mRNAs. The green color indicates downregulated genes; the red color indicates upregulated genes; the black color indicates no significant difference in expression.
Figure 3CeRNA network. Red diamond indicates lncRNAs. Green triangle indicates miRNAs and blue circle indicates mRNAs. ceRNA = competing endogenous RNA.
Figure 4Functional analysis of mRNAs in ceRNA network. (A) Go analysis of mRNAs. (B) KEGG pathway analysis of mRNAs. (C) PPI network of mRNAs. ceRNA = competing endogenous RNA, Go = Gene ontology, KEGG = Kyoto Encyclopedia of Genes and Genomes, PPI = protein–protein interaction.
Figure 5KM analysis result of LINC00114. KM = Kaplan–Meier.
Figure 6KM analysis result of SLC16A9. KM = Kaplan–Meier.
Figure 7LnRNAs expression levels. (A and B) The expression level of LINC00114 and UCA1 between tumors and normal tissues using TCGA database. (C and D) The expression level of LINC00114 and UCA1 in paired colon cancer tissues using TCGA database. (E and F) The correlations between LINC00114, UCA1 expression levels and clinicopathological variables. TCGA = the Cancer Genome Atlas.