Literature DB >> 34227920

Comprehensive analysis of lncRNA-miRNA-mRNA regulatory networks for microbiota-mediated colorectal cancer associated with immune cell infiltration.

Xiangzhou Tan1,2,3, Linfeng Mao1,3, Changhao Huang1,3, Weimin Yang1,3, Jianping Guo1,3, Zhikang Chen1,3, Zihua Chen1,3.   

Abstract

Recent findings have identified microbiota as crucial participants in many disease conditions, including cancers. Competing endogenous RNA (ceRNA) is regarded as a candidate mechanism involving relevant biological processes. We therefore constructed a ceRNA network using the TCGA and GEO database, to determine the potential mechanisms of microbiota-mediated colorectal carcinogenesis and progression. We found a total of 75 lncRNAs, 8 miRNAs, and 9 mRNAs in the probiotics-mediated ceRNA network and a total of 49 lncRNAs, 4 miRNAs, and 3 mRNA in the pathobiont-mediated ceRNA network, which could induce the microbiota-mediated carcinogenesis and progression. The GO and KEGG analysis indicated that the ceRNA network is mainly enriched in the metabolic process, and two unique pathways (the p53 signaling pathway and microRNA in cancer), respectively. A four-gene signature (FRMD6-AS2, DIRC3, LIFR-AS1, and MRPL23-AS1) was suggested as an independent prognostic factor. Four lncRNAs (LINC00355, KCNQ1OT1, LINC00491, and HOTAIR) were associated with poor survival. Three small molecule candidate anticancer drugs (Pentoxyverine, Rimexolone, and Doxylamine) were identified. A four-gene signature (FAM129A, BCL2, PMAIP1, and RPS6) is significantly correlated with immune infiltration level. This study provides a promising biomarker reservoir to explore the mechanism by which microbiota regulate the ceRNA network involving the immune response, and further participate in colorectal carcinogenesis and progression.

Entities:  

Keywords:  Cerna network; bioinformatics; colorectal cancer; immune infiltration; lncRNA; miRNA; microbiota

Mesh:

Substances:

Year:  2021        PMID: 34227920      PMCID: PMC8806860          DOI: 10.1080/21655979.2021.1940614

Source DB:  PubMed          Journal:  Bioengineered        ISSN: 2165-5979            Impact factor:   3.269


Introduction

Colorectal cancer (CRC) is one of the most frequently diagnosed cancers worldwide, and it is also a leading cause of cancer mortality [1]. The burden of CRC is predicted to substantially increase due to the adoption of the western lifestyle in the next two decades [2]. Dietary habits are one of the crucial factors in colorectal carcinogenesis, which involves various processes [3], such as eliciting inflammation and producing immune responses. Dietary behaviors can tremendously influence the composition of the intestinal microbiota, leading to dysbiosis and affecting the susceptibility to intestinal diseases [4]. Dysbiosis is characterized by the overgrowth of pathogenic bacteria and the absence of beneficial bacteria, etc. The beneficial bacteria in the gut microbiota include Saccharomyces boulardi and Lactobacillus rhamnosus GG (LGG) [5]. The therapeutic feeding of probiotics was able to serve as adjuvants for the checkpoint immunotherapy to improve cancer care [6,7]. However, an increase in pathobionts is regarded as more pronounced than the decrease in probiotics during the development of adenocarcinoma [8]. Recently, Pleguezuelos et al. first reported the pathobionts with the colibactin-producing pks pathogenicity island have a direct role in the occurrence of oncogenic mutations [9,10]. Common pathobionts in the gut microbiome include Fusobacterium nucleatum (Fn), Escherichia coli, and Bacteroides fragilis, etc [11]. Undoubtedly, both pathobionts and probiotics contribute to colorectal carcinogenesis, although they play opposite roles in this process. Noncoding RNA is the predominant RNA in the human transcriptome. Although these noncoding RNAs do not translate into protein, it was reported in recent years that they accomplish a great variety of biological functions [12,13]. As one type of noncoding RNA, lncRNA is associated with colorectal carcinogenesis, tumor progression, and intestinal microbiota [14,15]. Recent studies have reported that not only the primary structures (i.e. nucleotide sequence) but also the secondary structures of lncRNAs are related to biological processes. The secondary structures of lncRNAs could act as a guide or scaffold via binding chromatin-modifying protein complexes [16,17]. By interacting with key histone-modification enzymes, lncRNAs could also directly participate in cancer epigenetic regulation [18,19]. In addition, the mechanism of competing endogenous RNA (ceRNA) is proposed as one of the most important regulatory pathways to explain how lncRNAs influence protein expression. The lncRNAs, known as miRNA ‘sponges’ or ‘decoys’, are able to compete for binding to the same miRNAs via attracting the miRNA recognition/response elements (MREs), subsequently relieving the inhibitory activity of miRNAs on mRNA targets [16,17]. Such a mechanism of lncRNA modulating the action of miRNA was first found in 2010 [20]. Therefore, we hypothesized that microbiota could regulate the expression level of lncRNA, and further mediate the function of mRNA by competitively binding to the corresponding miRNA, which is also the so-called ceRNA regulatory network [21,22]. For instance, Fang et al. elucidated similar molecular mechanisms of microbial products in the development of CRC [23]. However, no study has yet proposed a constructive reservoir for exploring the ceRNA regulatory system in microbiota-mediated CRC pathogenesis. In this study, we established ceRNA networks of microbiota-mediated CRC and comprehensively analyzed the biological activities regarding the networks, including enrichment analysis, survival analysis, Cox regression analysis, protein-protein interaction, etc. This is the first study to investigate the ceRNA network of the microbiota-mediated CRC. It could provide promising genetic candidates for future studies on the mechanism of CRC pathogenesis.

Materials and methods

Data collection and processing

All RNA data were downloaded from the TCGA database (GDC Data Portal http://portal.gdc.cancer.gov) and the GEO database (https://www.ncbi.nlm.nih.gov/). Those RNA probe sets were re-annotated using the Ensembl database (http://www.ensembl.org). The mRNA and lncRNA expression profiles were only obtained from TCGA. The miRNA expression profiles were detected from both the TCGA database and the GEO database. The GEO database was searched on 20 June 2020 with the keywords ‘microbe’, ‘bacteria’, ‘microbiota’, ‘colon’, ‘rectum’, ‘colorectal’, ‘cancer’, ‘carcinoma’, ‘neoplasm’, ‘miRNA’, and species such as ‘Homo sapiens’. Sixty-eight records corresponding to 24 series were obtained from the GEO database. After screening, 2 datasets (GSE79383 and GSE122182) were included [24,25], which analyzed the miRNA profiles of human colorectal tissue or cells lines in different microbiota environments. The characteristics of CRC patients in the TCGA database are shown in Table S1.

Differentially expressed RNA analysis

The differentially expressed lncRNAs, miRNAs, and mRNAs from the TCGA database were identified using the edgeR package of R software. Expression differences were defined using fold-change (FC) and the false discovery rate (FDR). |logFC| >2 and FDR <0.05 were considered statistically significant for lncRNAs, miRNAs, and mRNAs. RNA expression data were normalized by edgeR. The differentially expressed miRNAs (DEMs) from the GEO database were identified by GEO2R (http://www.ncbi.nlm.nih.gov/geo/geo2r/). |FC| >2 and p-values <0.05 were considered statistically significant. Finally, DEMs of the ceRNA network were obtained by the overlap of DEM from both the GEO database and the TCGA database.

CeRNA network construction

The ceRNA network was established based on the lncRNA-miRNA-mRNA axes. DEMs of the ceRNA network were transformed into human mature miRNA names from starBase v. 2.0 (http://sysu.edu.cn). LncRNA-miRNA interaction pairs were predicted based on the DEMs using the miRcode database (http://www.mircode.org). The miRcode is an online tool to computationally aid hypothesis generation starting from an lncRNA or miRNA of interest. MiRNA-mRNA interaction pairs were identified based on the DEMs using the miRDB, miTarBase, and TargetScan databases [26-28]. The miRTarBase database provides experimentally validated miRNA-target interactions while the other two databases provide the computationally predicted miRNA-target interactions. The target mRNAs were included only when they were reported in all three databases. Based on lncRNA-miRNA pairs and miRNA-mRNA pairs, the ceRNA networks of CRC in different microbiota were reconstructed through Cytoscape (v. 3.7.2) [29].

Functional and pathway enrichment analysis

DAVID is an online functional annotation tool (https://david.ncifcrf.gov/) [30], and it was used to analyze Gene Ontology (GO) function and the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis on differentially expressed RNAs in the ceRNA network. P-values <0.05 were considered statistically significant.

Identification of prognostic signatures

The clinical information in CRC was retrieved from the TCGA database, the genes expression data were then combined with the clinical data. The univariate and multivariate Cox proportional hazard regression analyses were performed to identify CRC prognosis-mediated RNAs. The hazard ratio of the clinical features (age, gender, stage, AJCC-Tumor classification, AJCC-lymph node classification, and AJCC-Metastasis classification) and the expression of DEGs were calculated with the survival package in R. P-values <0.05 were considered statistically significant.

Survival analysis

The clinical information and gene expression data for survival analysis were retrieved from the TCGA database. According to the median expression value of each RNA in the ceRNA network, the CRC samples were classified into 2 groups: high-expression groups and low-expression groups. The Kaplan-Meier survival curves were generated by the survival R package. P-values <0.05 were considered statistically significant.

Construction of protein-protein interaction (PPI) network using the STRING database and validation of hub genes

The PPI networks of microbiota-mediated signature were constructed by searching the differentially expressed RNAs of ceRNA networks in the retrieval of interacting genes/proteins (STRING) database [31]. Each node in the PPI network represents a protein or gene, and the edges between nodes represent their physical or functional interactions. The threshold of interaction scores was defined as median confidence ≥0.4. By calculating the degree of connectivity, the genes with degree ≥1 were considered hub genes. The expression of hub genes was then validated by using gene expression profiling interactive analysis (GEPIA) [32].

Identification of candidate small molecule drugs

The identified hub genes were used query the Connectivity Map (CMap) database to screen the potential drugs against microbiota-mediated CRC [33]. The connectivity scores refer to the efficacy of small molecule drugs. A positive score means that the drug is capable of inducing CRC, while a negative score means that the drug is able to reverse the disease progression, which indicates a potentially therapeutic drug. The interactive chemical structure models of top three candidate molecular drugs were investigated in PubChem database.

Immune cells infiltration analysis

The abundance of tumor-infiltrating immune cells (TIICs) of the identified hub genes was estimated using the TIMER database [34]. The distributions of immune cells, including CD8 + T cells, CD4 + T cells, B cells, neutrophils, macrophages, and dendritic cells (DCs) were exhibited to explore the relationship between gene expression and immune infiltration. The panel also displayed gene expression levels against tumor purity. The gene expression level was displayed with log2 TPM.

Results

In this study, we identified a set of microbiota-mediated biomarkers and constructed ceRNA networks in CRC, which provide novel perspectives for studying the potential mechanisms. In total, 75 DELs, 8 DEMs, and 9 DEGs in the probiotic-related ceRNA network were screened out, while 49 DELs, 4 DEMs, and 3 DEGs in the pathobiont-related ceRNA network were recognized. Furthermore, the candidate biomarkers were comprehensively analyzed by several functional analyses to explore the biological characteristics and therapeutic targets of the ceRNA networks.

Data set acquisition and identification of differentially expressed RNAs in CRC and the different microbiota

A flowchart is presented in Figure 1. From the TCGA database, 40 normal adjacent noncancerous tissues and 382 colorectal adenocarcinoma samples with both clinical data and RNA expression data were collected. When comparing normal tissue to adenocarcinoma tissue, there were 269 down-regulated lncRNAs, 799 up-regulated lncRNAs, 138 down-regulated miRNAs, 217 up-regulated miRNAs, 984 down-regulated mRNAs, and 1125 up-regulated mRNAs. Figure S1A, S1B, S1C, S2A, S2B, and S2C show the heat maps and volcano plots of DELs, DEMs, and DEGs in the TCGA database. From the GEO database, 2 datasets (GSE79383 and GSE122182) of miRNA expression related to microbiota were collected, including 1 dataset (GSE79383) for probiotics and 1 dataset (GSE122182) for pathobionts. By comparing the DEMs in the TCGA and GEO databases, there were 4 down-regulated miRNAs and 22 up-regulated miRNAs in the GSE79383 datasets and 15 down-regulated miRNAs and 14 up-regulated miRNAs in the GSE122182 datasets. Figures S1D, S1E, S2D, and S2E show the heat maps and volcano plots of DEMs in the GEO database. Also, the Venn diagrams of DEMs in Figure S3A and S3B show the commonly DEMs in different microbiota environments.
Figure 1.

The flow chart of bioinformatics analysis. Caco-2, human epithelial colorectal adenocarcinoma cells; LGG, Lactobacillus rhamnosus GG; B.caccae, Bacteroides caccae; CRC, Colorectal cancer; CR, Colorectum; Fn, Fusobacterium nucleatum; GDC, Genomic Data Commons Data Portal

The flow chart of bioinformatics analysis. Caco-2, human epithelial colorectal adenocarcinoma cells; LGG, Lactobacillus rhamnosus GG; B.caccae, Bacteroides caccae; CRC, Colorectal cancer; CR, Colorectum; Fn, Fusobacterium nucleatum; GDC, Genomic Data Commons Data Portal

CeRNA network construction and analysis

The ceRNA networks of CRC in a probiotic environment and a pathogenic environment are presented in Figures 2 and 3, respectively. In the probiotic environment, 75 DELs, 8 DEMs, and 9 DEGs were identified by comparing the lncRNA-miRNA interaction pairs and the miRNA-mRNA interaction pairs. The expression values of DEMs in the probiotic environment from both the TCGA database and the GEO database are shown in Table 1, including hsa-mir-429, hsa-mir-141, hsa-mir-140, hsa-mir-22, hsa-mir-132, hsa-mir-454, hsa-mir-153, and hsa-mir-143. Among them, the regulating directions of four miRNAs (hsa-mir-429, hsa-mir-140, hsa-mir-132, and hsa-mir-153) in the GEO database are opposite to those of miRNAs in the TCGA database, which follow the hypothesis that probiotics can reverse gene expression in tumor cells, and subsequently turn over the effects of CRC carcinogenesis and tumor progression. In the pathogenic environment, 49 DELs, 4 DEMs, and 3 DEGs were identified using the same methods. Table 2 displays the expression values of DEMs in the pathogenic environment, including hsa-mir-223, hsa-mir-32, hsa-mir-96, and hsa-mir-106a. The regulating directions of three miRNAs (hsa-mir-223, hsa-mir-96, and hsa-mir-106a) in the GEO database are identical to those of miRNAs in the TCGA database, which follow the principles that pathobionts can promote the expression of oncogenes or inhibit the expression of tumor suppressors, further promoting the effect of CRC carcinogenesis and tumor progression.
Table 1.

The expression of 8 DEmiRNAs in probiotic environment from TCGA and GEO database

miRNATCGA (colorectal cancer vs normal)
GEO (with probiotics VS without probiotics)
Role
LogFCFDRLogFCP.valueGroup Set
hsa-mir-4294.1140571.62E-19−3.318060.01654LGG VS controlOppose
hsa-mir-1415.4141777.61E-423.0569370.021687LGG+ B.caccae VS controlSupport
hsa-mir-140−1.316753.01E-083.035690.022295LGG+ B.caccae VS controlOpposite
hsa-mir-222.1811229.85E-142.7359550.033153LGG+ B.caccae VS controlSupport
hsa-mir-132−1.359599.53E-082.8105660.030002LGG+ B.caccae VS controlOppose
hsa-mir-4546.0984123.47E-223.6010230.010924LGG+ B.caccae VS controlSupport
   3.3420030.016062LGG VS controlSupport
hsa-mir-1535.7051823.50E-05−2.772440.03157LGG+ B.caccae VS controlOppose
hsa-mir-1433.6230694.96E-083.0971310.02177LGG VS controlSupport

Control group, without bacteria; FDR, false discovery rate; FC, fold change; Role, the role of probiotics on colorectal carcinogenesis and tumor progression; LGG, Lactobacillus rhamnosus GG; B.caccae, Bacteroides caccae

Table 2.

The expression of 4 DEmiRNAs in pathogenic environment from TCGA and GEO database

miRNATCGA (colorectal cancer vs normal)
GEO (with pathobionts VS without pathobionts)
Role
LogFCFDRLogFCP.valueGroup Set
hsa-mir-2233.0573512.23E-071.3987190.030564CR tissue + Fn VS controlSupport
hsa-mir-323.8413356.25E-22−1.4905550.003800CRC tissue + Fn VS controlOppose
hsa-mir-965.4579905.53E-182.3667780.009546CRC tissue + Fn VS controlSupport
hsa-mir-106a4.3147183.32E-071.27741660.020353CR tissue + Fn VS controlSupport

FDR, false discovery rate; FC, fold change; Role, the role of pathogenic on colorectal carcinogenesis and tumor progression; CR, Colorectum; Fn, Fusobacterium nucleatum

The expression of 8 DEmiRNAs in probiotic environment from TCGA and GEO database Control group, without bacteria; FDR, false discovery rate; FC, fold change; Role, the role of probiotics on colorectal carcinogenesis and tumor progression; LGG, Lactobacillus rhamnosus GG; B.caccae, Bacteroides caccae The expression of 4 DEmiRNAs in pathogenic environment from TCGA and GEO database FDR, false discovery rate; FC, fold change; Role, the role of pathogenic on colorectal carcinogenesis and tumor progression; CR, Colorectum; Fn, Fusobacterium nucleatum Overview of lncRNA-miRNA-mRNA competing endogenous RNA (ceRNA) network in probiotics-mediated CRC. Red represents upregulation, and green represents downregulation. LncRNAs, miRNAs, and mRNAs in the networks are represented as diamonds, round rectangles, and circles, respectively. Bar plots show the key RNAs that have the top interaction number in the whole network Overview of lncRNA-miRNA-mRNA competing endogenous RNA (ceRNA) network in pathobionts-mediated colorectal cancer. Red represents upregulation, and green represents downregulation. LncRNAs, miRNAs, and mRNAs in the networks are represented as diamonds, round rectangles, and circles, respectively. Bar plots show the key RNAs that have the top interaction number in the whole network

Functional annotation of the ceRNA network

DAVID was used to analyze the differentially expressed RNAs in the ceRNA network. Figure 4(a) and Table S2 show that CRC in the different microbiota environment is mainly enriched in the positive regulation of cellular metabolic processes, positive regulation of macromolecule metabolic processes, and positive regulation of metabolic processes. These findings suggest that microbiota mostly regulate metabolic processes to participate in CRC carcinogenesis and progression. Figure 4(b) and Table S3 show that the main pathways for microbiota-mediated CRC are the p53 signaling pathway and microRNAs in cancer. These data suggest that microbiota mostly involve the p53 signaling pathway and microRNAs in cancer to participate in CRC carcinogenesis and progression.
Figure 4.

GO functional analysis (a) and KEGG pathway (b) analysis of differentially expressed RNAs in microbiota-mediated colorectal cancer

GO functional analysis (a) and KEGG pathway (b) analysis of differentially expressed RNAs in microbiota-mediated colorectal cancer The Cox proportional hazard regression model was used to screen the prognostic signatures (Table 3). The univariate analysis revealed that ten candidate genes were identified as prognostic factors, including FRMD6-AS2, LINC00461, DIRC3, LIFR-AS1, NAALADL2-AS2, LINC00402, ADAMTS9-AS2, MRPL23-AS1, LHX1, and RBM20. Four candidate genes were still considered as independent prognostic factors after the multivariate analysis, including FRMD6-AS2, DIRC3, LIFR-AS1 and MRPL23-AS1.
Table 3.

Univariate and multivariate analysis

VariablesUnivariate analysis
Multivariate analysis
HR95% CIpvalueHR95% CIpvalue
Age1.021.00–1.050.021.141.09–1.190.00
Gender0.950.61–1.500.842.210.91–5.330.08
Stage2.121.64–2.730.001.840.90–3.760.10
AJCC-T2.511.60–3.950.001.010.68–3.310.31
AJCC-N2.101.60–2.740.002.481.15–5.350.02
AJCC-M1.921.48–2.500.003.081.49–6.360.00
FRMD6-AS21.041.00–1.070.031.351.15–1.580.00
LINC004611.041.02–1.050.000.920.82–1.040.17
DIRC31.021.00–1.030.011.211.08–1.370.00
LIFR-AS11.021.00–1.040.030.810.68–0.950.01
NAALADL2-AS21.021.01–1.030.001.180.74–1.870.50
LINC004021.011.00–1.030.031.080.99–1.170.10
ADAMTS9-AS21.011.00–1.010.010.960.90–1.010.13
MRPL23-AS11.011.00–1.010.041.031.01–1.060.00
LHX11.011.01–1.020.001.020.96–1.080.56
RBM201.001.00–1.010.011.000.99–1.010.94

AJCC, the classification system developed by the American Joint Committee on Cancer

Univariate and multivariate analysis AJCC, the classification system developed by the American Joint Committee on Cancer The Kaplan-Meier survival curves and log-rank tests were used to find survival-associated RNAs in the ceRNA network. In total, four lncRNAs were found to be associated with the overall survival (OS) of CRC patients, which are shown in Figure 5. None of the miRNAs and mRNAs were identified as survival-associated genes. Four lncRNAs were obtained from CRC in the probiotic environment, including LINC00355, KCNQ1OT1, LINC00491, and HOTAIR, as shown in Figure 5(a-d). Two lncRNAs were obtained from CRC in the pathogenic environment, including LINC00355 and KCNQ1OT1, as shown in Figure 5(a,b). CRC patients with the high-expression of these lncRNAs are report poor prognosis.
Figure 5.

Overall survival analysis of RNAs in the ceRNA network of microbiota-mediated colorectal cancer. (a), (b) Kaplan-Meier survival curves of prognostic DELs both in probiotics-mediated ceRNA network and pathobionts-mediated ceRNA network. (c), (d) Kaplan-Meier survival curves of prognostic DELs only in proniotics-mediated ceRNA network

Overall survival analysis of RNAs in the ceRNA network of microbiota-mediated colorectal cancer. (a), (b) Kaplan-Meier survival curves of prognostic DELs both in probiotics-mediated ceRNA network and pathobionts-mediated ceRNA network. (c), (d) Kaplan-Meier survival curves of prognostic DELs only in proniotics-mediated ceRNA network

PPI network construction, validation of hub genes, and screening of small molecule drugs

To identify the co-expression relationship of differentially expressed RNAs, a PPI network of microbiota-mediated CRC was constructed by the STRING database that included 40 nodes and 203 edges (Figure 6). There are thirty-four nodes with a degree of connectivity ≥1; therefore, these thirty-four genes were considered as hub genes. The hub genes were then validated through the GEPIA database. Finally, a signature of 12 hub genes was identified, including BBC3, BCL2, BCL2L1, BID, EFTUD2, FAM129A, PMAIP1, PRPF19, RPS2, RPS6, RPS9, and TP53 (Figure 7).
Figure 6.

Protein-protein interaction network of differentially expressed RNAs in microbiota-mediated colorectal cancer, the nodes represent proteins, and the edges demonstrate the predicted functional associations between them, line thickness indicates the strength of data support

Figure 7.

The expressions of 12 hub genes were determined using GEPIA. The expressions of genes are expressed as relative gene expression using transformed log2 (TPM+1) Value (Y-axis) of tumor (red bar) and normal (black bar) samples and displayed as a whisker plot. * p-value <0.05. GEIPIA, gene expression profiling interactive analysis

Protein-protein interaction network of differentially expressed RNAs in microbiota-mediated colorectal cancer, the nodes represent proteins, and the edges demonstrate the predicted functional associations between them, line thickness indicates the strength of data support The expressions of 12 hub genes were determined using GEPIA. The expressions of genes are expressed as relative gene expression using transformed log2 (TPM+1) Value (Y-axis) of tumor (red bar) and normal (black bar) samples and displayed as a whisker plot. * p-value <0.05. GEIPIA, gene expression profiling interactive analysis The validated hub genes were substituted into the CMap network. Among the ten most significantly correlated small molecule drugs, four were negatively scored, which indicated potential therapeutic effects for microbiota-mediated CRC (Table 4). Pentoxyverine, Rimexolone, and Doxylamine are the three most negatively-correlated molecules in microbiota-mediated CRC, while Netilmicin showed high enrichment correlated with microbiota-mediated CRC. Three-dimensional structure models of the top three candidate small molecule drugs found in the PubChem database are shown in Figure 8.
Table 4.

Results of connectivity map analysis

RankCmap nameMeannEnrichmentp
1Netilmicin0.59140.8980.00010
2Pentoxyverine−0.624−0.8540.00082
3Meclofenamic acid0.49950.7790.00116
4Timolol0.55940.8320.00119
5Ciclopirox0.58340.8130.00237
6Rimexolone−0.6014−0.8110.00251
7Zuclopenthixol0.56140.8030.00284
8Doxylamine−0.5375−0.7310.00284
9Prestwick-10820.60130.8850.00310
10Minaprine−0.5715−0.7260.00330

Cmap, Connectivity Map

Figure 8.

Three-dimensional diagram of the three most significant candidate drugs. (a) Pentoxyverine (b) Rimexolone (c) Doxylamine

Results of connectivity map analysis Cmap, Connectivity Map Three-dimensional diagram of the three most significant candidate drugs. (a) Pentoxyverine (b) Rimexolone (c) Doxylamine

Hub signature associated with immune infiltration level

The relationship between hub-validated signatures and immune infiltration levels in CRC were investigated through the TIMER database. The four most significant positive signatures are displayed in Figure 9. The most immune infiltration-relevant gene is FAM129A. The expression level of this gene shows significant positive correlations with infiltrating levels of B cells (r = 0.249, p = 3.89e-07), CD8 + T cells (r = 0.411, p = 5.24e-18), CD4 + T cells (r = 0.62, p = 4.69e-44), macrophages (r = 0.708, p = 1.22e-62), neutrophils (r = 0.678, p = 3.27e-55), and dendritic cells (r = 0.737, p = 3.63e-70), while it also has significant negative correlations with tumor purity. Similarly, other hub genes, e.g., BCL2, PMAIP1, and RPS6, have significant correlations with infiltrating levels of immune cells, which indicates that the microbiota plays an important role in immune infiltration in CRC.
Figure 9.

Integrative analysis between hub identified signature with humor-infiltrating immune cells

Integrative analysis between hub identified signature with humor-infiltrating immune cells

Discussion

The microbiota is regarded as a ‘neglected organ’, reflecting a biological ecosystem that is closely interconnected with the host [11,35]. A precise balance in the microbiota plays an important role for health status and the prevention of some chronic diseases [36]. When the balance is disrupted (dysbiosis), it can cause acute or chronic clinical disorders – for instance, antibiotic-associated diarrhea, ulcers [37], inflammatory bowel disease [38], irritable bowel syndrome [39], and even some malignancies [40]. The absence of probiotics and the overgrowth of pathobionts are the two main characteristics of dysbiosis. Previous studies have shown that probiotics are capable of binding to mutagenic amines, degrading nitrosamines, and reducing the production of carcinogens [41]. In contrast, pathobionts have been implicated in DNA damage and tumor progression [11]. Numerous studies have been conducted on microbiota-mediated carcinogenesis; however, the mechanism responsible for the microbiota-induced pathogenesis of CRC remains undefined. Therefore, identifying unique biomarkers and exploring the microbiota-mediated mechanisms are essential for conquering dysbiosis-induced CRC. In this study, we first constructed the ceRNA networks of microbiota-mediated CRC based on the TCGA and GEO database. The ceRNA networks provide a comprehensive overview for exploring the regulatory mechanism of ceRNA in microbiota-mediated colorectal cancer. We identified 1038 differentially expressed miRNAs in the probiotic environment and 137 differentially expressed miRNAs in the pathogenic environment, which were extracted and analyzed from the GEO database. Meanwhile, we found 1068 lncRNAs, 355 miRNAs, and 2109 mRNAs with differentially expressed profiles from the TCGA database. After assessing the overlap of miRNAs, eight miRNAs and three miRNAs were screened out in the probiotic-mediated CRC network, and pathogenic CRC network, respectively. Therefore, 75 lncRNAs, 8 miRNAs, and 9 mRNAs were obtained in the probiotic-meditated ceRNA network, while 49 lncRNAs, 4 miRNAs, and 3 mRNAs were identified in the pathogenic-mediated ceRNA network. Our study demonstrated that the probiotics can inhibit the expression of the oncogenes hsa-mir-153 and hsa-mir-429, and promote the expression of the tumor suppressors hsa-mir-140 and hsa-mir-132, while the pathobionts could promote the expression of the oncogenes hsa-mir-223, hsa-mir-96, and hsa-mir-106a. The results indicated that these miRNAs are crucial for microbiota-mediated colorectal carcinogenesis and progression. Of note, only one probiotic-mediated mRNA, PMAIP1, was identified as negatively regulating CRC, while one unique pathobiont-mediated mRNA, FAM129A, was identified as positively regulating CRC. Subsequently, the enriched biological functions of mRNA in the microbiota-mediated ceRNA were evaluated by the GO and KEGG pathway analyses. The GO analysis revealed that the biological functions of the microbiota-mediated ceRNA network are mainly associated with the positive regulation of cellular metabolic process, positive regulation of macromolecule metabolic process, and positive regulation of metabolic process. Thus, microbiota may modulate the expression of ceRNAs, and further participate in the metabolic process to induce tumorigenesis. The pathway analysis showed that two unique pathways (p53 signaling pathway and MicroRNAs in cancer) were enriched in the ceRNA networks. Several studies have reported similar results to our observations. For example, according to the study of Kado et al., both intestinal microflora and p53 contribute to the development of adenocarcinoma of the colon [42]. Also, it has been shown that the carcinogenesis of hepatocellular carcinoma (HCC) is also related to gut microbiota; the mechanism could be microRNAs in the cancer pathway [43]. These enrichment results support the hypothesis that the microbiota plays essential roles via these pathways in colorectal carcinoma. Through the overall survival analysis, four lncRNAs from the microbiota-mediated ceRNA network (LINC00355, KCNQ1OT1, LINC00491, and HOTAIR) were found to be associated with poor overall survival. Based on the previous studies, LINC00355 not only takes part in the regulation of the ceRNA network, but also contributes to the pathological staging in CRC [44]. It was reported that LINC00355 is also associated with survival in other solid tumors, such as prostate cancer [45] and bladder cancer [46]. LINC00491 can positively regulate SERPINE1 expression through binding miR-145 and promoting the proliferation, migration, and invasion of colon adenocarcinoma cells [47]. Many studies regarding KCNQ1OT1 and HOTAIR have been reported. It was stated that KCNQ1OT1 can enhance the chemoresistance of oxaliplatin/methotrexate and epithelial-mesenchymal transition (EMT) in colon cancer [48-50]. Other tumor-promoting effects were also demonstrated in lung cancer [51], tongue cancer [52], and hepatocellular carcinoma [53]. HOTAIR was reported as a negative prognostic factor, not only in primary tumors, but also in the blood of CRC patients [54]. HOTAIR is also associated with EMT and stemness maintenance of cancer cell lines [55]. Although many studies have been devoted to the prognostic effects of the above lncRNAs, there are no reports suggesting that these lncRNAs may act as microbiota-mediated survival biomarkers in CRC. Through univariate and multivariate Cox proportional model analysis, a four-gene signature (FRMD6-AS2, DIRC3, LIFR-AS1, and MRPL23-AS1) from the ceRNA networks was identified as independent prognostic factors. These genes have been verified in several carcinomas too, e.g. FRMD6-AS2 was found to increase the phosphorylation of LATS1 and YAP to promote the tumor growth, migration and invasion of endometrial cancer [56,57]. The DELs, DEMs, and DEGs in the microbiota-mediated ceRNA network were selected to construct a PPI network. Thirty-four hub genes were identified after the retrieval of the STRING databases. Twelve hub genes were validated through the GEPIA database. Among these twelve top hub genes, EFTUD2 ranked the highest. EFTUD2 is a component of the U5 snRNP in the spliceosome, and it was recently proven to promote colitis-associated tumorigenesis by mediating alternative splicing of components of the TLR4-NF-κB cascade [58]. In addition, several small molecules drugs with therapeutic effects against microbiota-mediated CRC were screened. The top three are pentoxyverine, rimexolone, and doxylamine. Pentoxyverine is regarded as an antitussive that is commonly used for coughs. However, it is also a selective agonist of the sigma-1 receptor, which exhibits anti-proliferative activity in melanoma cells [59]. Rimexolone is a highly lipophilic glucocorticoid receptor agonist, which suppresses the inflammatory response to various inciting agents of a mechanical, chemical and immunological nature. It is commonly used in eye inflammation and postoperative inflammation after cataract surgery [60]. Doxylamine is a histamine H1 antagonist that has demonstrable sedative properties. It is used in allergies, and as a hypnotic, antiemetic and antitussive. The above three drugs have potential as new drugs for microbiota-related CRC. Another important aspect of this study is that the hub identified the signature of microbiota-mediated CRC to be associated with different immune infiltration levels. Our results demonstrate that FAM129A is the most significant gene in the signature that induces immune infiltration. The expression level of FAM129A is positively correlated with the infiltration level of all selected immune cells, especially in macrophages and DCs. Other microbiota-mediated genes, e.g. BCL2, PMAIP1, and RPS6, also showed moderate to strong relationships between gene expression levels and immune infiltration levels in specific immune cells. These results could be indicative of a potential mechanism where microbiota regulates immune system functions in CRC. Also, the genes FAM129A, BCL2, PMAIP1, and RPS6 could plays significant roles in the recruitment and regulation of immune-infiltrating cells. There were two limitations to the current study. Firstly, only a small number of datasets with microbiota data were included in the GEO database, which means that more experiments regarding ceRNA regulatory networks in microbiota-mediated carcinogenesis and progression are warranted. Secondly, our biomarker reservoir was identified based on the online databases through bioinformatics methods, which have not been validated in experiments. Further studies, including cell experiments, are therefore needed.

Conclusions

In conclusion, this study constructed ceRNA networks involving both probiotic- and pathobiont-mediated colorectal carcinogenesis and progression by analyzing the relevant data obtained from the TCGA and GEO databases. It provides a comprehensive analysis of the ceRNA regulatory mechanism in CRC. Furthermore, novel DELs, DEMs, and DEGs could be candidate diagnostic and prognostic biomarkers, or serve as potential therapeutic targets. Click here for additional data file.
  60 in total

1.  Comprehensive analysis of lncRNA-associated ceRNA network in colorectal cancer.

Authors:  Wenliang Yuan; Xiaobo Li; Li Liu; Cai Wei; Dan Sun; Sihua Peng; Linhua Jiang
Journal:  Biochem Biophys Res Commun       Date:  2018-11-28       Impact factor: 3.575

Review 2.  Long noncoding RNA, polycomb, and the ghosts haunting INK4b-ARF-INK4a expression.

Authors:  Francesca Aguilo; Ming-Ming Zhou; Martin J Walsh
Journal:  Cancer Res       Date:  2011-08-09       Impact factor: 12.701

3.  Gut microbiome development along the colorectal adenoma-carcinoma sequence.

Authors:  Qiang Feng; Suisha Liang; Huijue Jia; Andreas Stadlmayr; Longqing Tang; Zhou Lan; Dongya Zhang; Huihua Xia; Xiaoying Xu; Zhuye Jie; Lili Su; Xiaoping Li; Xin Li; Junhua Li; Liang Xiao; Ursula Huber-Schönauer; David Niederseer; Xun Xu; Jumana Yousuf Al-Aama; Huanming Yang; Jian Wang; Karsten Kristiansen; Manimozhiyan Arumugam; Herbert Tilg; Christian Datz; Jun Wang
Journal:  Nat Commun       Date:  2015-03-11       Impact factor: 14.919

4.  KCNQ1OT1 facilitates progression of non-small-cell lung carcinoma via modulating miRNA-27b-3p/HSP90AA1 axis.

Authors:  Zhiwu Dong; Ping Yang; Xiaojian Qiu; Shuang Liang; Bing Guan; Haisheng Yang; Feifei Li; Li Sun; Huiling Liu; Guanghui Zou; Kewen Zhao
Journal:  J Cell Physiol       Date:  2018-11-23       Impact factor: 6.384

5.  Anticancer immunotherapy by CTLA-4 blockade relies on the gut microbiota.

Authors:  Marie Vétizou; Jonathan M Pitt; Romain Daillère; Patricia Lepage; Nadine Waldschmitt; Caroline Flament; Sylvie Rusakiewicz; Bertrand Routy; Maria P Roberti; Connie P M Duong; Vichnou Poirier-Colame; Antoine Roux; Sonia Becharef; Silvia Formenti; Encouse Golden; Sascha Cording; Gerard Eberl; Andreas Schlitzer; Florent Ginhoux; Sridhar Mani; Takahiro Yamazaki; Nicolas Jacquelot; David P Enot; Marion Bérard; Jérôme Nigou; Paule Opolon; Alexander Eggermont; Paul-Louis Woerther; Elisabeth Chachaty; Nathalie Chaput; Caroline Robert; Christina Mateus; Guido Kroemer; Didier Raoult; Ivo Gomperts Boneca; Franck Carbonnel; Mathias Chamaillard; Laurence Zitvogel
Journal:  Science       Date:  2015-11-05       Impact factor: 47.728

6.  ceRNA network construction and comparison of gastric cancer with or without Helicobacter pylori infection.

Authors:  Yanyan Liu; Jingyu Zhu; Xiaoli Ma; Shuyi Han; Dongjie Xiao; Yanfei Jia; Yunshan Wang
Journal:  J Cell Physiol       Date:  2018-10-28       Impact factor: 6.384

7.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

8.  Role of the long non-coding RNA PVT1 in the dysregulation of the ceRNA-ceRNA network in human breast cancer.

Authors:  Federica Conte; Giulia Fiscon; Matteo Chiara; Teresa Colombo; Lorenzo Farina; Paola Paci
Journal:  PLoS One       Date:  2017-02-10       Impact factor: 3.240

9.  lncRNA KCNQ1OT1 enhances the chemoresistance of oxaliplatin in colon cancer by targeting the miR-34a/ATG4B pathway.

Authors:  Yongchao Li; Changfeng Li; Dandan Li; Lei Yang; Jingpeng Jin; Bin Zhang
Journal:  Onco Targets Ther       Date:  2019-04-09       Impact factor: 4.147

10.  Bioinformatics analysis reveals the competing endogenous RNA (ceRNA) coexpression network in the tumor microenvironment and prognostic biomarkers in soft tissue sarcomas.

Authors:  Dandan Zou; Yang Wang; Meng Wang; Bo Zhao; Fei Hu; Yanguo Li; Bingming Zhang
Journal:  Bioengineered       Date:  2021-12       Impact factor: 3.269

View more
  7 in total

Review 1.  Potential of Mitochondrial Ribosomal Genes as Cancer Biomarkers Demonstrated by Bioinformatics Results.

Authors:  Shunchao Bao; Xinyu Wang; Mo Li; Zhao Gao; Dongdong Zheng; Dihan Shen; Linlin Liu
Journal:  Front Oncol       Date:  2022-05-26       Impact factor: 5.738

2.  Novel targets in rectal cancer by considering lncRNA-miRNA-mRNA network in response to Lactobacillus acidophilus consumption: a randomized clinical trial.

Authors:  Zohreh Khodaii; Mahboobeh Mehrabani Natanzi; Solmaz Khalighfard; Maziar Ghandian Zanjan; Maryam Gharghi; Vahid Khori; Taghi Amiriani; Monireh Rahimkhani; Ali Mohammad Alizadeh
Journal:  Sci Rep       Date:  2022-06-02       Impact factor: 4.996

3.  LncRNA00978 contributes to growth and metastasis of hepatocellular carcinoma cells via mediating microRNA-125b-5p/SOX12 pathway.

Authors:  Zhiqing Cheng; Limei Gong; Qinghe Cai
Journal:  Bioengineered       Date:  2022-04       Impact factor: 6.832

Review 4.  Long Noncoding RNA LIFR-AS1: A New Player in Human Cancers.

Authors:  Zhiqun Bai; Xuemei Wang; Zhen Zhang
Journal:  Biomed Res Int       Date:  2022-01-13       Impact factor: 3.411

Review 5.  Gut Dysbiosis and Intestinal Barrier Dysfunction: Potential Explanation for Early-Onset Colorectal Cancer.

Authors:  Siti Maryam Ahmad Kendong; Raja Affendi Raja Ali; Khairul Najmi Muhammad Nawawi; Hajar Fauzan Ahmad; Norfilza Mohd Mokhtar
Journal:  Front Cell Infect Microbiol       Date:  2021-12-13       Impact factor: 5.293

Review 6.  NIBAN1, Exploring its Roles in Cell Survival Under Stress Context.

Authors:  Paula Diana; Gianna Maria Griz Carvalheira
Journal:  Front Cell Dev Biol       Date:  2022-04-19

7.  Long non-coding RNA COL4A2-AS1 facilitates cell proliferation and glycolysis of colorectal cancer cells via miR-20b-5p/hypoxia inducible factor 1 alpha subunit axis.

Authors:  Zijun Yu; Yeming Wang; Jianwu Deng; Dong Liu; Lingling Zhang; Hua Shao; Zilu Wang; Wenjun Zhu; Cheng Zhao; Qungang Ke
Journal:  Bioengineered       Date:  2021-12       Impact factor: 3.269

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.