Cheng-Yun Li1, Wen-Wen Zhang1, Ji-Lian Xiang2, Xing-Hua Wang3, Jin Li1, Jun-Ling Wang1. 1. Department of Toxicology, School of Public Health, Lanzhou University, Lanzhou, Gansu 730000, China. 2. Department of Gastroenterology, Third People's Hospital of Gansu Province, Lanzhou, Gansu 730000, China. 3. Department of Gastrointestinal Surgery, Gansu Wuwei Tumor Hospital, Wuwei, Gansu 733000, China.
Abstract
BACKGROUND: MicroRNAs (miRNAs) have played important roles in the regulation of gene expression in many cancers, but their roles in esophageal squamous cell carcinoma (ESCC) are still unclear. The aim of this study was to determine the potential ESCC-specific key miRNAs from a large sample dataset in The Cancer Genome Atlas (TCGA). METHODS: Integrative bioinformatics analysis was used to identify key ESCC-specific miRNAs related to the ESCC patients' tumor histological grade and lymphatic metastasis from TCGA. Next, these key miRNA potential gene regulatory functions and relationships with ESCC patients' clinical characteristics and overall survival were analyzed. Finally, three key miRNAs were selected randomly and quantificational real-time polymerase chain reaction (qRT-PCR) was used to validate in 51 newly diagnosed ESCC patients' tissues samples (collected from Nov. 2017 to Feb. 2019, in Wuwei, China) whether the bioinformatics analyses results were reliable and valid. Two-tailed Student's t test, Pearson Chi-squared test and Kaplan-Meier survival analysis were used in this study. RESULTS: Thirty-five ESCC-specific miRNAs from TCGA database were investigated (fold-change > 2.0, P < 0.05), and 28 participated in the miRNAs-mRNAs co-expression network construction, while 17 were related with ESCC patients' tumor histological grade, TNM stage, and lymphatic metastasis (P < 0.05). Meanwhile, six miRNAs (including miR-200b-3p, miR-31-5p, miR-15b-5p, miR-141-3p, miR-135b-5p, and miR-195-5p) were correlated with overall survival of ESCC patients (log-rank, P < 0.05). MiR-135b-5p, miR-15b-5p, and miR-195-5p were selected for verification of the expression levels in 51 ESCC patients' tissue samples by using qRT-PCR. We found that the fold-changes between qRT-PCR and TCGA were completely consistent. The results also suggested that miR-135b-5p, miR-15b-5p, and miR-195-5p were significantly correlated with tumor differentiation degrees (P < 0.05), miR-195-5p was significantly correlated with tumor TNM stage (P < 0.05), and miR-135b-5p was significantly correlated with lymph-node metastasis (P < 0.05). MiR-135b-5p, miR-15b-5p, and miR-195-5p expression levels, ESCC patient clinical features association analysis results and the aforementioned TCGA bioinformatics analyses were similar. CONCLUSION: This study identified key ESCC-related miRNAs. The key miRNAs are worthy of further investigation as potential novel biomarkers for diagnosis, classification, and prognosis of ESCC.
BACKGROUND: MicroRNAs (miRNAs) have played important roles in the regulation of gene expression in many cancers, but their roles in esophageal squamous cell carcinoma (ESCC) are still unclear. The aim of this study was to determine the potential ESCC-specific key miRNAs from a large sample dataset in The Cancer Genome Atlas (TCGA). METHODS: Integrative bioinformatics analysis was used to identify key ESCC-specific miRNAs related to the ESCCpatients' tumor histological grade and lymphatic metastasis from TCGA. Next, these key miRNA potential gene regulatory functions and relationships with ESCCpatients' clinical characteristics and overall survival were analyzed. Finally, three key miRNAs were selected randomly and quantificational real-time polymerase chain reaction (qRT-PCR) was used to validate in 51 newly diagnosed ESCCpatients' tissues samples (collected from Nov. 2017 to Feb. 2019, in Wuwei, China) whether the bioinformatics analyses results were reliable and valid. Two-tailed Student's t test, Pearson Chi-squared test and Kaplan-Meier survival analysis were used in this study. RESULTS: Thirty-five ESCC-specific miRNAs from TCGA database were investigated (fold-change > 2.0, P < 0.05), and 28 participated in the miRNAs-mRNAs co-expression network construction, while 17 were related with ESCCpatients' tumor histological grade, TNM stage, and lymphatic metastasis (P < 0.05). Meanwhile, six miRNAs (including miR-200b-3p, miR-31-5p, miR-15b-5p, miR-141-3p, miR-135b-5p, and miR-195-5p) were correlated with overall survival of ESCCpatients (log-rank, P < 0.05). MiR-135b-5p, miR-15b-5p, and miR-195-5p were selected for verification of the expression levels in 51 ESCCpatients' tissue samples by using qRT-PCR. We found that the fold-changes between qRT-PCR and TCGA were completely consistent. The results also suggested that miR-135b-5p, miR-15b-5p, and miR-195-5p were significantly correlated with tumor differentiation degrees (P < 0.05), miR-195-5p was significantly correlated with tumorTNM stage (P < 0.05), and miR-135b-5p was significantly correlated with lymph-node metastasis (P < 0.05). MiR-135b-5p, miR-15b-5p, and miR-195-5p expression levels, ESCCpatient clinical features association analysis results and the aforementioned TCGA bioinformatics analyses were similar. CONCLUSION: This study identified key ESCC-related miRNAs. The key miRNAs are worthy of further investigation as potential novel biomarkers for diagnosis, classification, and prognosis of ESCC.
Esophageal squamous cell carcinoma (ESCC) is a common fatal malignant disease in humans, which causes more than 400,000 deaths per year.[ In spite of the advances in multidisciplinary diagnosis and treatment, there are still important topics that need to be explored for the early diagnosis and prognosis biomarkers of ESCC.[ The ESCC detection methods are based on the endoscopy examination, while this method was often affected by doctors’ individual experience and technology, with a high rate of missed diagnosis. Meanwhile, endoscopic diagnoses of ESCC cannot find the early stage of lymph node metastasis status in esophageal carcinoma.[ In addition, conventional non-invasive serological biomarkers, such as carbohydrate antigen 19-9 (CA19-9) and carcinoembryonic antigen (CEA), are lack of specificity in the early diagnosis of ESCC.[ Therefore, the significance of exploration of innovative and favorable biomarkers for ESCC diagnosis and prognosis should be emphasized.MicroRNAs (miRNAs, 18–22 nucleotides) are small, non-coding RNAs that can bind to the messenger RNAs and negatively regulate the gene expression.[ Studies have revealed that miRNAs function in many key biological roles in various disease progressions, including cancers. Therefore, further insights into the important roles of miRNAs in diseases, especially in cancers, have shown miRNAs as attractive tools and targets for novel diagnoses, prognoses, and therapeutic approaches. ESCC is a multistep-process disease, which involves multiple effects of gene-environmental factor interactions. The miRNAs play important regulatory roles in epigenetics and are closely related with many diseases progressions.[ Further understanding of the ESCC tissue alterations and miRNA expression profiles may help us to identify better biomarkers for ESCC diagnosis, classification, and prognosis.Further evidence suggested that aberrantly expressed miRNAs in ESCC tissues have a clinical influence on disease diagnosis and prognosis.[ Recently, miRNA-associated biomarkers have been reported for the diagnosis and prognoses of ESCC.[ However, the small sample sizes used for microarray detection often present a bias toward the identification of ESCC-associated miRNAs due to lack of RNA sequencing data, and therefore often generate errors. Thus far, The Cancer Genome Atlas (TCGA) (https://tcga-data.nci.nih.gov/tcga/) database has provided a resource for large sample size genome sequencing data from 33 humancancer types including more than 30,000 patients.[ In the present study we downloaded and analyzed the ESCCpatients’ RNA sequencing expression profiles and clinical pathology features from TCGA database using bioinformatics techniques. This way will improve the method to discover potential miRNA biomarkers for the diagnosis of ESCC, as well as its classification and prediction of prognosis.
Methods
Ethical approval
Informed consent forms and clinical information were obtained for all patients by the investigator and from medical records, respectively. The procedures used in this study were approved by the Wuwei Tumor Hospital of Gansu (Wuwei, China) and conformed to the Declaration of Helsinki and current legislation.
Data collection
The ESCCpatienttumor tissues and normal esophageal epithelial tissue miRNA sequencing data set and patient personal information and clinical pathological features are included from the TCGA database (supplementary Table S1). RNA sequencing data were obtained from Illumina HiSeq platforms (https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga/using-tcga/types). ESCCpatient exclusion criteria included: (1) the admitting diagnosis was not ESCC; (2) suffering from two or more types of cancers including ESCC; (3) clinical information or RNA sequencing data were deficient; (4) patients treated with radiation and chemotherapy before specimen collection, and (5) overall survival time more than 5 years. In the present study, we obtained ESCCpatients’ tissues samples including normal esophageal mucosa tissues 11 cases, pathologic Stage I 27 cases, Stage II 74 cases and Stage III-IV 60 cases. Before further analyses and processing, RNA-sequencing data normalization was performed in full compliance with the publication guidelines of TCGA; the approval of ethics committee was not required.
Study design and data preprocessing
Hierarchical cluster analyses were performed using R package heat maps. Then, according to the z-score values to deduce the expression levels of samples RNA sequencing data, we analyzed the gene pair Euclidean distances to detect gene clusters. Subsequently, the significantly different miRNA and mRNA expression profiles in ESCC were analyzed separately, and we determined the significantly different genes of the normal vs. cancerous tissues of ESCCpatients. We compared the differentially expressed miRNAs and mRNA of four stage levels, including ESCC (G I stage) non-lymphatic metastases vs. non-tumor esophageal tissues, ESCC (G I stage) lymphatic metastases vs. non-tumor esophageal tissues, ESCC (G II stage) lymphatic metastases vs. non-tumor esophageal tissues, and ESCC (G III-IV stage) lymphatic metastases vs. non-tumor esophageal tissues, respectively. The intersections of the miRNAs and mRNAs were selected in the above four stage levels for further analyses using bioinformatics methods. Figure 1 is the flow diagram of this study design.
Figure 1
Flow diagram of the study. Data preparing, processing, and bioinformatics analysis flowchart.
Flow diagram of the study. Data preparing, processing, and bioinformatics analysis flowchart.
Co-expression network of miRNAs and mRNAs
For the current study, according to the findings that miRNAs can bind to the messenger RNAs and negatively regulate gene expression, a miRNA-mRNA network was built. Significantly differentially expressed intersecting miRNA and mRNA were used to build a co-expression network, which showed fold change >2.0 and P < 0.05. Target genes of miRNAs were predicted by miRanda (http://www.microrna.org/microrna/home.do) and Targetscan (http://www.targetscan.org/). The combined miRNA predicted target gene information and significantly differentially expressed intersecting mRNAs of TCGA ESCCpatients were used to build the miRNA-mRNA network.The miRNAs-mRNAs co-expression network was constructed using significantly differential gene expression profiles using R package software (The University of Auckland, Auckland, New Zealand). Pearson correlation matrices models were used to find the potential relevance of all pair-wise genes. Subsequently, based on the weighted adjacency matrix, we measured the network connectivity of a gene with other gene network generations, and built the miRNAs-mRNAs co-expression network by using Cytoscape 3.0 software (National Institute of General Medical Sciences, MD, USA).
Analysis of the association between key miRNAs and clinical status
According to the co-expression of miRNAs and mRNAs, the key miRNAs involved in co-expression networks were selected as our targets to determine if they positively correlated with ESCC. Subsequently, we further analyzed the potential associations between co-expression network key miRNAs and TCGA ESCCpatients’ clinical features, including sex, TNM stage, tumor grade, lymphatic metastases status, and pathological stage by using multiple linear regression analyses.
Kaplan-Meier survival analyses
To analyze the correlation between the changes of the selected key miRNA expression levels and ESCCpatients’ prognostic survival, correlations were determined by Gene Expression Profiling Interactive Analysis (GEPIA) (http://gepia.cancer-pku.cn/). Publicly available data and GEPIA tools were prepared to calculate the Kaplan-Meier survival analysis parameters using TCGA ESCCpatients’ datasets. All the RNA sequencing data probe sets or records were averaged by the quantile normalized data. Finally, the survival distributions equality of different TCGA ESCCpatients and the key miRNA expression level changes were analyzed using the Kaplan-Meier, log-rank, and the hazard ratio (HR) tests.
Preparation for human ESCC samples
Next, 51 ESCCpatients’ tumor tissues samples and paired non-tumor esophagus tissues (aged 40–69 years) were collected from Gansu Wuwei Tumor Hospital, China, for RT-qPCR validation. All of these patients were diagnosed with ESCC according to their histopathology. Samples were collected and stored in RNAlater (Ambion, Foster City, CA, USA) at −80°C. Collection and use of ESCCpatients’ tumor tissue samples were approved by the ethics committee of Gansu Wuwei Tumor Hospital.
Quantitative real time PCR (qRT-PCR) validation of bioinformatics data
Total RNA of ESCCpatients’ tissues samples were isolated by TRIzol® reagent (Invitrogen, Carlsbad, CA, USA). Total RNA quantity was evaluated by a NanoDrop® ND-1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). A Reverse Transcription Kit (Promega, Madison, WI, USA) and GoTaq® qPCR Master Mix of Power SYBR® Green (Promega) were used to synthesized cDNA and for qRT-PCR detection. Finally, qRT-PCR reactions were performed using the Step One PlusTM PCR System (Applied Biosystems, Foster City, CA, USA).
Statistical analyses
Data were analyzed using the SPSS Statistics 21.0 (International Business Machines Corporation, Armonk, NY, USA) and expressed as mean ± standard deviation (SD). All analyses were performed three times and represent data from three individual experiments. Two-tailed Student's t test was used to significance of differences between subgroups. Pearson Chi-squared test was used to find the association between the expression of miRNAs and ESCCpatients’ clinicopathological characteristics. Kaplan-Meier survival analysis was used to investigate the correlation between the changes of miRNAs expression levels and patients’ prognostic overall survival times. Statistical significance was set at P < 0.05. The qRT-PCR results were calculated and evaluated by the 2−ΔΔCt method.
Results
Differentially expressed miRNAs screening in ESCC patients
We used ESCCpatients’ RNA-seq datasets collected from 161 cancerous and 11 normal specimens for further analyses. There were 232 miRNAs that were significantly differentially expressed between ESCCpatients’ tumor vs. normal esophageal tissues samples in the Level 3 ESCC RNA-Sequencing (RNA-Seq) data from TCGA database. Next, we integrated and analyzed the relevance between the 232 miRNA expression levels and ESCCpatients’ differential clinical stage and pathologic grade. There were 103 miRNAs that showed significant differential changes between G I stage (non-lymphatic metastasis) ESCCtumors and normal esophageal tissues samples. In addition, 119 miRNAs were significantly differentially changed between G I stage (lymphatic metastasis) ESCCtumors and normal esophageal tissues samples; 125 miRNAs were found to be significantly differentially changed between G II stage (lymphatic metastasis) ESCCtumors and normal esophageal tissue samples; and 150 miRNAs were found to be significantly differentially changed between G III-IV stage (lymphatic metastasis) ESCCtumors and normal esophageal tissues samples. Subsequently, we used Venn diagrams (http://bioinfogp.cnb.csic.es/tools/venny/index.html) to analyze the overlapping subclasses of miRNAs, and found there were 35 key intersecting miRNAs [Figure 2]. These 35 intersecting miRNAs included 25 miRNAs that were overexpressed and 10 miRNAs that were downregulated (supplementary Table S2).
Figure 2
Venn diagram analysis of differentially expressed miRNAs between G I nLM/Normal, G I LM/Normal, G II LM/Normal, G III-IV LM/Normal. (LM represents lymphatic metastasis; nLM represent non-lymphatic metastasis; Normal represents normal esophageal tissues).
Venn diagram analysis of differentially expressed miRNAs between G I nLM/Normal, G I LM/Normal, G II LM/Normal, G III-IV LM/Normal. (LM represents lymphatic metastasis; nLM represent non-lymphatic metastasis; Normal represents normal esophageal tissues).
MiRNAs-mRNAs co-expression network construction and potential regulatory function analyses
We chose the abovementioned 35 intersecting miRNAs from 232 ESCC associated miRNAs by using a Venn diagram overlapping subclass analysis to construct the miRNAs-mRNAs network. Next, to build the miRNA-mRNA co-expression network, we also predicted the mRNAs targeted by the miRNAs. According to these 35 intersecting miRNAs (supplementary Table S2), we predicted that these miRNAs targeted mRNAs by using the miRanda and Targetscan databases. Based on the bioinformatics analysis, we found that there were 376 intersecting mRNAs and 2235 significantly differentially expressed mRNAs in ESCCcancer tissues from TCGA database. Finally, based on the 35 intersecting miRNAs and 376 intersecting mRNAs, we constructed a miRNAs-mRNAs co-expression network. The relationships between miRNAs and mRNAs were analyzed by their negative regulation mechanisms and the co-expression network was drawn using Cytoscape 3.0. There were 28 targeted miRNAs that potentially regulated 72 mRNAs and were involved in the miRNA-mRNA co-expression network [Figure 3]. To further analyze these 72 negatively regulated mRNAs in the co-expression network, we found that some played important roles in cancer development, such as E2F2, FGF2, COL4A3, ERBB4, FGFR1, RELN, and TCF3.
Figure 3
The miRNAs-mRNAs competing endogenous RNAs network. Red balls represent upregulated miRNAs, blue balls, downregulated miRNAs; Red triangles represent upregulated mRNAs, blue triangles, downregulated mRNAs.
The miRNAs-mRNAs competing endogenous RNAs network. Red balls represent upregulated miRNAs, blue balls, downregulated miRNAs; Red triangles represent upregulated mRNAs, blue triangles, downregulated mRNAs.To obtain further insight into the potential regulatory functions of the 28 miRNAs on the 72 mRNAs involved in miRNAs-mRNAs networks, the up- and down-regulated mRNAs were analyzed using the DAVID database. Gene Ontology (GO) analyses revealed upregulated mRNAs enriched in ‘regulation of cellular process, regulation of biological process and protein binding’ and downregulated mRNAs transcripts enriched in ‘single-organism cellular process, single-organism process, and response to stimulus’ [Figure 4A]. These significantly enriched GO functions provided a better understanding of the roles of mRNAs in regulatory functions. In addition, pathway analysis was performed for all 72 mRNAs, and the results suggested that the enriched pathways targeted by upregulated mRNAs transcripts were ‘microRNAs in cancer, humanT-cell leukaemia virus type I (HTLV-I) infection, cell cycle and hepatitis B.’ The enriched pathways of downregulated transcripts were ‘insulin secretion, glutamatergic synapse, proteoglycans in cancer, and vascular smooth muscle contraction’ [Figure 4B].
Figure 4
Top 20 enrichments of Gene Ontology (GO) terms for differentially expressed mRNAs and pathways for differentially expressed intersection mRNAs. (A) Top 20 enrichments of GO terms for differentially expressed mRNAs (the bar plot shows the enrichment scores of the significant enrichment GO terms). (B) Top 20 enrichments of pathways for differentially expressed intersection mRNAs (the bar plot shows the enrichment scores of the significant enrichment pathways).
Top 20 enrichments of Gene Ontology (GO) terms for differentially expressed mRNAs and pathways for differentially expressed intersection mRNAs. (A) Top 20 enrichments of GO terms for differentially expressed mRNAs (the bar plot shows the enrichment scores of the significant enrichment GO terms). (B) Top 20 enrichments of pathways for differentially expressed intersection mRNAs (the bar plot shows the enrichment scores of the significant enrichment pathways).
Correlations between miRNA signatures and clinical characteristics
We selected the 28 miRNAs, which were involved in miRNA-mRNA co-expression networks, and further analyzed the correlation between these miRNAs and TCGA database of 161 ESCCpatients’ clinical pathology features. The ESCC clinical/demographic features included patients’ sex, race, tumor grade, TNM stage, and lymphatic metastasis taken from TCGA database. Multiple linear regression analysis results suggested that there were 17 miRNAs differentially expressed with different clinical pathology features (P < 0.05). We showed that miR-32-5p, miR-330-5p, miR-135b-5p, miR-195-5p, and miR-145-5p were associated with ESCCpatients’ tumor grade, miR-16-5p, miR-32-5p, miR-93-5p, miR-185-5p, miR-15b-5p, miR-135b-5p, miR133a-3p, miR-28-5p, and miR-195-5p were associated with TNM stages, and miR-32-5p, miR-93-5p, miR-15b-5p, miR-135b-5p, miR-330-5p, miR-106b-5p, miR-182-5p, miR-20b-5p, miR-141-3p, miR-28-5p, miR-195-5p, and miR-320a were associated with lymphatic metastasis. Meanwhile, miR-16-5p and miR-183-5p were found associated with racial/ethnic background [Table 1].
Table 1
Correlations between ESCC specific intersection miRNAs and clinical features.
Correlations between ESCC specific intersection miRNAs and clinical features.
Prognostic analysis of miRNA expression levels and ESCC patient survival
A Kaplan-Meier survival analysis was used to investigate the relationships between the miRNAs-mRNAs co-expression network 28 key miRNAs and ESCCpatients’ survival time. The 28 key miRNA expression levels and the ESCCpatient overall survival information in TCGA database were calculated using univariate Cox proportional hazard regression model analyses. We found six miRNAs closely related with ESCCpatient overall survival time (log-rank, P < 0.05). Among these six miRNAs, there were five miRNAs (miR-200b-3p, miR-31-5p, miR-15b-5p, miR-141-3p, and miR-135b-5p) that were negatively related with ESCCpatient overall survival time (P < 0.05), and miR-195-5p was positively correlated with ESCCpatient overall survival time (P < 0.05) [Figure 5].
Figure 5
Kaplan-Meier survival curves for six miRNAs associated with ESCC patients overall survival time (Horizontal axis overall survival time: days, Vertical axis survival function). hsa: homo sapiens; OS: overall survival.
Kaplan-Meier survival curves for six miRNAs associated with ESCCpatients overall survival time (Horizontal axis overall survival time: days, Vertical axis survival function). hsa: homo sapiens; OS: overall survival.
Verification of selected miRNAs expression levels in ESCC tissues by qRT-PCR
Based on the above bioinformatics analyses, three miRNAs (miR-135b-5p, miR-15b-5p, and miR-195-5p) were randomly selected. The three key miRNA expression levels in newly diagnosed 51 ESCCpatients’ tumor samples and their self-paired normal esophagus epithelial tissues samples were compared to previous TCGA bioinformatics analysis results for reliability and validity. The expression levels of miR-135b-5p and miR-15b-5p were consistently upregulated in 51 ESCCpatienttumor tissue samples compared with normal esophageal epithelial tissues (P < 0.05); however, miR-195-5p expression levels were consistently downregulated (P < 0.05) [Table 2]. Subsequently, we also compared the increased or decreased fold-changes between these three miRNAs verified by qRT-PCR and TCGA ESCC RNA fold-changes. The fold-changes measured by qRT-PCR of the 51 collected ESCCpatients’ tissue samples and TCGA ESCC RNA sequencing results were consistent (supplementary Table S2 and Table 2).
Table 2
Relative expression of miRNAs in 51 pairs of ESCC tumor and non-tumor esophagus tissues (mean ± SD).
Relative expression of miRNAs in 51 pairs of ESCCtumor and non-tumor esophagus tissues (mean ± SD).
Relationships between the three selected miRNAs and ESCC clinical pathological features
We found that miR-135b-5p, miR-15b-5p, and miR-195-5p expression levels were significantly related with the 51 ESCCpatientstumor differentiation degrees (P < 0.05) [Table 3]; miR-195-5p was significantly related with tumorTNM stage (P < 0.05), and miR-135b-5p was significantly related with lymph-node metastasis (P < 0.05). The differential expression of miR-15b-5p was significantly associated with the 51 ESCCpatients’ sex (P < 0.05). These three miRNAs from ESCCpatients’ correlated with the above TCGA bioinformatics comprehensive analysis [Table 1] and were in good agreement, suggesting that TCGA bioinformatics analysis was reliable.
Table 3
Association between the expression of miR-135b-5p, miR-15b-5p, miR-195-5p and clinicopathological characteristics of ESCC patients.
Association between the expression of miR-135b-5p, miR-15b-5p, miR-195-5p and clinicopathological characteristics of ESCCpatients.
Discussion
ESCC is the main type of esophageal cancer and is a common cause of cancer-related death worldwide.[ ESCC development and progression is a complex process, which includes a variety of obvious changes in gene expression levels and subsequent physiological changes. MiRNAs are small, non-coding regulatory RNAs that inhibit target mRNAs.[ Recent advances in miRNA expression profiling have suggested new directions to study the pathological changes of ESCC, and also indicate that miRNAs have great potential as novel biomarkers in ESCC diagnosis and prognosis.[ Recent studies mainly concentrated on small sample size RNA sequencing or microarray data detection and, therefore, may not accurately reflect the dysregulated miRNAs that link to tumorigenesis and progression of ESCC.[ Therefore, the ESCC related significant differences of miRNAs should be identified in a large sample size to enhance accuracy and reliability of predictive value.In this study, we analyzed the dysregulated miRNAs in ESCC from TCGA database. According to the ESCCpatient tissue sample miRNA sequencing data sets in TCGA, we further explored the significant differences of miRNAs related to patients’ clinical features and overall survival time by using integrated bioinformatics analysis. Three miRNAs were selected randomly and we detected their expression levels in 51 newly collected ESCCpatients’ tumor samples and their self-paired normal esophageal epithelial tissue samples by qRT-PCR. We further analyzed the correlations between the three miRNAs and ESCCpatients’ clinical pathological characteristics to confirm the accuracy and reliability of the bioinformatics analysis results.According to the integrated bioinformatics overlapping subclasses analysis results we found 35 dysregulated miRNAs in ESCCtumor tissues and significant correlation between ESCC histological grades and lymphatic metastasis status. Among these 35 dysregulated miRNAs, some have been detected that showed significant differences expressed in esophagus cancer, including upregulated miR-205, which may act as an oncogene in ESCC.[ Alder et al[ reported that miR-31 and miR-21 were dysregulated in ESCC and their expression levels were associated with zinc deficiency promoted esophageal cancer. Gu et al[ investigated the miRNA diagnostic values in ESCC pathogenesis and prognosis, and found that in ESCC upregulation of miR-93, miR-106, and miR-205 were closely related with ESCC pathogenesis and prognosis. In addition, upregulated miR-20b-5p, miR-93-5p, miR-16-5p, miR-330-5p, miR-106b-5p, and miR-484 were reported in esophageal cancer and were involved in regulating the progression of this disease.[ The downregulation of miR-133a expression showed changes from Barrett esophagus to esophageal cancer.[ Derouet et al reported that miR-145 expression effected the progression of esophageal cancer by enhancing cell invasion and apoptosis.[ MiR-143-3p, miR-125a-5p, and miR-101-3p levels also were reported to be significantly different in esophageal cancer tissues and cell lines.[ For the remaining 16 dysregulated miRNAs, including 10 upregulated and six downregulated miRNAs, their functions in the development and progression of ESCC have not been reported.Based on the theory that mRNA transcripts bind by single-stranded miRNAs,[ we constructed a miRNAs-mRNAs network according to their negative regulatory relationship. Among these ESCC related 35 intersecting miRNAs, there were 28 key miRNAs involved in the miRNAs-mRNAs co-expression network. The miRNAs-mRNAs co-expression network revealed potential regulatory relationships between miRNAs and mRNAs in ESCC. In the miRNAs-mRNAs co-expression network, we also found some of these 28 key miRNAs were reported as potential diagnostic and prognostic biomarkers in ESCC, including miR-101-3p, miR-125a-5p, miR-16-5p, and miR-20b-5p.[ To further analyze these negatively regulated 72 mRNAs in the co-expression network, we showed that some played crucial roles in the development and progression of ESCC, such as E2F2, FGF2, and TCF3.[ Subsequently, we analyzed these 28 key miRNAs in the miRNAs-mRNAs co-expression network indirectly by GO enrichment analyses and they were involved in cell signaling pathway regulation. GO analysis results suggested that the enriched GOs mainly targeted cell molecules and biological processes. The pathway analysis also revealed that some pathways were related with cancer, such as the PI3K-AKT signaling pathway and MAPK signaling pathway.[ Ishibashi et al also reported that the overexpressed miR-141-3p targeted PHLPP-2 gene expression and affected the activity of the PI3K/AKT pathway in ESCC.[ Therefore, our bioinformatics analysis results suggested that these 28 key miRNAs may play crucial roles in the progression of ESCC and cancer pathways.To investigate the correlation between the 28 key miRNA expression levels and the 161 ESCCpatients’ clinical pathology features, we identified 17 miRNAs that were related with 161 ESCCpatients’ clinical pathology features, including miR-32-5p, miR-16-5p, miR-32-5p, and miR-183-5p. These miRNAs mainly related with ESCCpatients’ tumor grade, TNM stages, and lymphatic metastasis status. Among these 17 miRNAs, miR-16-5p, miR-330-5p, miR-145-5p, miR-93-5p, miR-93-5p, miR133a-3p, miR-28-5p, miR-15b-5p, miR-20b-5p, miR-141-3p, and miR-320a were previously reported to be important indicators in lymphatic metastasis, invasion, and as diagnostic biomarkers of ESCC.[ However, the functions of other miRNAs have not been correlated with ESCC clinical pathology features. We further analyzed the relationships between the 28 key miRNAs and ESCCpatients’ overall survival time. There were six miRNAs that correlated with ESCCpatients’ overall survival in TCGA. Among these six miRNAs, only miR-15b-5p has been reported with a correlation to ESCC survival,[ and other miRNAs have not been studied. Our bioinformatics analyses revealed the potential of using miRNAs as biomarkers for diagnosis, classification, and prognosis of ESCC.Finally, miR-135b-5p, miR-15b-5p, and miR-195-5p were selected and validated using TCGA bioinformatics analysis results from 51 newly collected ESCCpatients’ tissue samples. The correlation analyses between these three miRNA expression levels and the ESCCpatients’ clinical pathology features also were investigated. The results suggested that the ESCCpatients’ clinical pathology correlated with TCGA bioinformatics analyses results.In summary, the present study successfully identified the correlation of ESCC-specific miRNAs from a large-scale sample in TCGA database by using integrated bioinformatics analyses. We investigated the ESCC related specific miRNAs in different clinical pathology features and patients’ overall survival time. Finally, we validate the reliability and validity of TCGA database ESCCpatients’ RNA sequencing integrated bioinformatics results by qRT-PCR. We found that miRNAs (miR-16-5p, miR-183-5p, miR-32-5p, miR-330-5p, miR-135b-5p, miR-195-5p, miR-145-5p, miR-93-5p, miR-185-5p, miR133a-3p, miR-28-5p, miR-15b-5p, miR-106b-5p, miR-182-5p, miR-20b-5p, miR-141-3p, and miR-320a) and (miR-200b-3p, miR-31-5p, miR-15b-5p, miR-141-3p, miR-135b-5p and miR-195-5p) were related with ESCCpatients’ clinical pathology features and patients’ overall survival time, respectively. Taken together, above TCGA database ESCCpatients’ clinical pathology features and overall survival time related miRNAs are worthy of further study about their application functions as biomarkers indicators in diagnosis, clinical pathology classification, and prognosis estimation of ESCCpatients.
Acknowledgements
We thank Dong-Lin Cheng who supported the bioinformatics analysis technical assistance in the project.
Funding
This study was supported by grants from the Fundamental Research Funds for the Central Universities (No. lzujbky-2018-13), the National Natural Science Foundation of China (No. 81803188), and the Gansu Province Science and Technology Project (No. 1606RJYA270).
Authors: Xifeng Wu; Jaffer A Ajani; Jian Gu; David W Chang; Weiqi Tan; Michelle A T Hildebrandt; Maosheng Huang; Kenneth K Wang; Ernest Hawk Journal: Cancer Prev Res (Phila) Date: 2013-03