Literature DB >> 36051876

Survival estimation in patients with stomach and esophageal carcinoma using miRNA expression profiles.

Srinivasulu Yerukala Sathipati1, Ming-Ju Tsai2,3, Tonia Carter1, Patrick Allaire1, Sanjay K Shukla1, Afshin Beheshti4,5, Shinn-Ying Ho6,7,8.   

Abstract

Identifying a miRNA signature associated with survival will open a new window for developing miRNA-targeted treatment strategies in stomach and esophageal cancers (STEC). Here, using data from The Cancer Genome Atlas on 516 patients with STEC, we developed a Genetic Algorithm-based Survival Estimation method, GASE, to identify a miRNA signature that could estimate survival in patients with STEC. GASE identified 27 miRNAs as a survival miRNA signature and estimated the survival time with a mean squared correlation coefficient of 0.80 ± 0.01 and a mean absolute error of 0.44 ± 0.25 years between actual and estimated survival times, and showed a good estimation capability on an independent test cohort. The miRNAs of the signature were prioritized and analyzed to explore their roles in STEC. The diagnostic ability of the identified miRNA signature was analyzed, and identified some critical miRNAs in STEC. Further, miRNA-gene target enrichment analysis revealed the involvement of these miRNAs in various pathways, including the somatotrophic axis in mammals that involves the growth hormone and transforming growth factor beta signaling pathways, and gene ontology annotations. The identified miRNA signature provides evidence for survival-related miRNAs and their involvement in STEC, which would aid in developing miRNA-target based therapeutics.
© 2022 The Author(s).

Entities:  

Keywords:  Machine learning; Stomach and esophageal carcinoma; Survival estimation; miRNA signature

Year:  2022        PMID: 36051876      PMCID: PMC9421182          DOI: 10.1016/j.csbj.2022.08.025

Source DB:  PubMed          Journal:  Comput Struct Biotechnol J        ISSN: 2001-0370            Impact factor:   6.155


Introduction

Stomach and esophageal carcinomas (STEC) are among the most prevalent malignant diseases causing thousands of deaths globally. Worldwide, stomach cancer ranks sixth in cancer incidence, with 1,089,103 new cases, and third in cancer morality, with 768,793 deaths, while esophageal cancer ranks tenth in cancer incidence, with 604,100 new cases, and sixth in cancer morality, with 544,076 deaths, based on estimates for the year 2020 [1]. STEC ranks higher in mortality than incidence because these cancers are often first diagnosed at an advanced stage. In the United States, diagnosis occurs at a localized, regional, and distant stage in 28 %, 32 %, and 40 %, respectively, of stomach cancer cases, and in 25 %, 29 %, and 31 %, respectively of esophageal cancer cases [2], [3]. For localized, regional, and metastatic disease, five-year survival is 64 %, 28.2 %, and 5.3 %, respectively, for stomach cancer, and 46.7 %, 25.1 %, and 4.8 %, respectively, for esophageal cancer [2], [3]. Treatment for STEC is selected based on disease stage [4], [5]. Surgery can be curative but is offered mainly in early disease stages. Chemotherapy and chemoradiotherapy provide an added survival benefit to surgery in early-stage disease and are offered without surgery in later disease stages. Targeted therapies (e.g., Trastuzumab, an inhibitor of human epidermal growth factor receptor 2) improve survival in STEC and are increasingly being used in STEC treatment [6], and immunotherapy and other emerging therapies continue to be evaluated for improvement in STEC survival [4], [5]. Biomarkers associated with STEC survival are potential targets for designing new STEC treatments to improve patient survival [7], [8]. MicroRNAs (miRNAs) function as oncogenes or tumor suppressor genes in STEC [6], [7] and have been investigated as biomarkers of STEC diagnosis and prognosis [9], [10]. Roles for miRNAs in STEC progression and survival have been described in several reports. For example, low levels of miR148a, a miRNA that suppresses cell invasion and migration, are associated with advanced clinical stage and poor prognosis in stomach cancer [11]. MiR-616-3p promotes angiogenesis and metastasis and is correlated with poor prognosis in stomach cancer [12]. Elevated miR-21 expression is linked to lymph node metastasis [13] and poor prognosis [14] in esophageal cancer. MiR-375 targets proteins involved in cancer cell proliferation and invasion [15], and its downregulation is associated with advanced cancer staging and poor prognosis in esophageal squamous cell carcinoma [16]. Aberrant miRNA expression has also been identified in STEC. Hwang et al. identified miRNAs, including miR-601, miR-107, miR-18a, miR-370, miR-300 and miR-96 that were significantly expressed in early gastric cancers when compared to normal samples [17]. A serum biomarker miRNA panel consisting of 12 miRNAs was developed for risk assessment in patients with gastric cancer [18]. Furthermore, several dysregulated miRNAs have been found in esophageal tumors that regulate carcinogenesis [19], [20]. A quantitative RT-qPCR study on patients with esophageal carcinoma revealed three miRNAs, including miR-34a-5p, miR-148a-3p and miR-181a-5p that were associated with the cancer progression [21]. In most studies, associations between miRNAs and STEC survival have been based on results from a single study sample assessed using the log-rank test to compare Kaplan-Meier survival curves or Cox proportional hazards regression analysis [22], [23], [24], [25]. A few other studies have employed discovery and validation stages in their design to increase the strength of the evidence supporting associations between miRNAs and STEC survival. These include studies that have identified differentially expressed miRNAs in STEC in the discovery stage and tested for association between the miRNAs and survival in an independent STEC study sample in the validation stage [26], [27], [28], [29], [30], [31]. Machine learning methods are also being applied to identify miRNAs associated with STEC survival. In a study of esophageal squamous cell carcinoma, a recursive feature elimination-support vector machine algorithm along with LASSO Cox proportional hazards regression was used to identify miRNAs associated with survival and build a prognostic model in a training sample, and the prognostic model was shown to correlate with survival in an independent, test sample [32]. While these previous reports indicate that miRNAs have potential clinical value as biomarkers of prognosis in STEC, they have not addressed whether miRNAs can predict STEC survival time in individual patients. To design a personalized survival prediction model, it is necessary to identify biomarkers that show a robust association with survival in STEC patients. Accordingly, this study aimed to develop a genetic algorithm (GA)-based survival estimation method (GASE) to identify a survival-associated miRNA signature and estimate survival time in patients with STEC. A genetic algorithm (GA)-based survival estimation method (GASE) is proposed for estimating the survival time in STEC patients using miRNA expression profiles. GASE was developed using support vector regression (SVR) that incorporates an optimal feature selection algorithm inheritable bi-objective combinatorial genetic algorithm (IBCGA) [33]. The identified miRNA signature was analyzed further to explore miRNA association with STEC. The system overview of GASE is shown in the graphical abstract.

Material and methods

The miRNA expression profiles of patients with STEC were retrieved from The Cancer Genome Atlas (TCGA) database. These data were generated using an Illumina Hiseq 2000 sequencing platform. The number of patients with STEC in the initial dataset was 628. After excluding the patients without survival information and those whose survival time was less than 30 days, the final dataset consisted of 123 patients with miRNA expression profiles and clinical data, including days to death. Each miRNA expression profile consisting of 500 miRNAs was used for the survival estimation procedure. For the independent validation, we used a cohort of 393 patients who were alive with STEC at last follow-up in the TCGA.

Survival estimation method GASE

The GASE’s two primary objectives were to estimate the survival time and simultaneously identify the miRNA signature associated with survival in patients with STEC. GASE was developed using SVR and an optimal feature selection algorithm IBCGA. The optimization technique implemented in GASE was adopted from previous studies [34], [35], [36]. SVM is a supervised machine learning method, which has demonstrated good prediction capability in solving classification and regression problems in various biomedical fields, especially in cancer genomics [37]. SVR uses a nonlinear transformation to find the relation between input and output variables by generating a hyperplane that optimally fits in the high dimensional space and carries out the regression function [38]. The tuning of the parameters C, γ, and ν determine the performance of SVR; hence parameter tuning plays a vital role in the SVR modeling process. The minimization of the loss function can be optimized using the following objective function for the given input data points.where ||w|| is the magnitude of the vector to the surface, C is a regularization parameter, ξ and ξ are slack variables, ξ ≥ 0, ξ ≥ 0, and i = 1,2,…N. The optimal parameters of GASE were tuned based on an intelligent evolutionary algorithm (IEA) [39]. In the optimization process, IBCGA [33] was used to identify a small set of miRNAs while maximizing the fitness function in terms of squared correlation coefficient. GASE prediction performance was evaluated using two metrics, squared correlation coefficient and mean absolute error. IBGCA effectively solves bi-objective combinatorial problems where a small set of informative features will be selected from a large number of candidate features. The applications of IBCGA in identifying biomarkers in cancer research have been demonstrated in previous studies [34], [35], [36], [40], [41]. In the optimal feature selection process, all the candidate features were encoded into binary variables, including the parameters C, γ, and ν of the SVR. The detailed steps involved in IBCGA can be found in the supplementary methods. After identifying the miRNA signature, main effect difference (MED) [42] analysis was used to prioritize the miRNAs of the signature based on their contribution to the prediction performance.

Feature appearance score

To ensure robustness, we performed 50 independent runs of GASE and selected one feature set with the highest appearance score for the analysis. The feature appearance score (FAS) indicates the frequency of the features that appeared in the 50 independent runs. A feature set with a more significant appearance score suggests that the feature frequency in that particular set is higher when compared to other features across the independent runs. There are S features in the t-th signature. The frequency score for each feature m presented in the miRNA signatures can be calculated as follows.where m is the miRNA of the t-th signature.

LASSO and elastic net

To evaluate the estimation ability of GASE, we compared the prediction performance with some standard regression methods, including ridge [43], Lasso [44] and elastic net [45]. We used the miRNA expression profiles and survival time of 123 patients with STEC as input. The minimum λ was selected after 100 independent runs of LASSO and elastic net using 10-CV.

Strong evidence on miRNA-gene target interaction

To identify the target genes of the selected miRNAs, we used the miRTarBase (9.0 beta) database [46] to extract the experimentally verified microRNA-target interactions (MTIs) with strong evidence, which are validated by reporter assay, Western blot, and qPCR.

Gene set enrichment test

Gene-set libraries are used to organize accumulated knowledge about the function of groups of genes. We used Enrichr [47], [48], which is a web-based application that includes the latest gene-set libraries, to perform gene-set enrichment analysis. We evaluated the ability of Enrichr to rank terms from gene-set libraries by combining the p-value computed using Fisher’s exact test with the z-score of the deviation from the expected rank by multiplying these two numbers as follows:where z = z-score and p = p-value. This study used six Gene-set libraries, including 1) WikiPathway Human 2021 [49], 2) Kyoto Encyclopedia of Genes and Genomes (KEGG), 3) MSigDB Hallmark [50], 4) Gene Ontology Molecular Function 2021 [51], 5) Gene Ontology Biological Process, and 6) Gene Ontology Cellular Component.

Results

GASE prediction performance

We used a survival estimation method, GASE, to identify a miRNA signature and estimate the survival time in patients with STEC. One hundred and twenty-three patients with miRNA expression profiles were retrieved from the TCGA database. GASE identified 27 miRNAs as a survival miRNA signature and estimated the survival time with a mean squared correlation coefficient (R2) of 0.80 ± 0.01 and a mean absolute error (MAE) of 0.44 ± 0.25 years between actual and estimated survival times. A robust miRNA signature was selected by measuring the frequency appearance score (FAS) using 50 independent runs of GASE. A miRNA signature with the highest FAS indicates higher frequencies of miRNAs in the signature across the independent runs of GASE. The mean FAS obtained for the independent runs was 15.55 ± 1.45, while the highest FAS was 18.85 (shown in Supplementary Fig. S1 and Supplementary Table S1). The feature set with the highest FAS was selected for the analysis. This feature set obtained a R2 of 0.80 and a MAE of 0.43 years between actual and estimated survival times, and selected 27 miRNAs as a signature to estimate survival time in patients with STEC.

Prediction performance comparison and validation

Next, we compared GASE with some standard machine learning methods on their performance to predict survival times. The machine learning methods used in the comparison included ridge regression, least absolute shrinkage and selection operator (Lasso) and elastic net. Ridge regression obtained a R2 of 0.77 and a MAE of 0.54 years between actual and estimated survival times. Lasso obtained a R2 of 0.51 and a MAE of 0.69 years between actual and estimated survival times, and elastic net obtained a R2 of 0.50 and a MAE of 0.71 years between actual and estimated survival times, respectively. In comparison, GASE obtained a highest R2 of 0.83 and a MAE of 0.41 years between actual and estimated survival times (Table 1). The results indicated that the performance of GASE was better than that of the standard machine learning methods. The correlation plots of GASE and the other machine learning methods are shown in Supplementary Fig. S2A-D.
Table 1

Prediction performance of GASE.

MethodR2MAE (years)Features selected
Ridge regression0.770.54485
LASSO0.510.6928
Elastic net0.500.7130
GASE-FAS0.800.4327
GASE-Best0.830.4132
GASE-Mean0.80 ± 0.010.44 ± 0.2533.44 ± 3.59
Prediction performance of GASE. Next, the estimation ability of GASE was validated using a validation dataset consisting of 393 patients with STEC along with their follow-up times. The follow-up times of these patients were in the range of 0.3–56 months. We attempted to estimate the survival times of these patients using the GASE prediction model. The mean follow-up times observed in patients with STEC was 8.09 ± 12.09 months. The mean predicted survival time of these patients was 17.74 ± 10.50 months. GASE achieved an accuracy of 80.41 % for estimating the survival times of patients whose estimated survival times were higher than the follow-up times (mean follow-up time 4.0 ± 5.9 months). The mean estimated survival time of the 316 patients was 19.10 ± 10.28 months, and a mean prediction error of 12.15 months was obtained for the remaining patients. The results could be interpreted as follows: an estimated survival time that was higher than the patient’s follow-up time was considered as a correct prediction, whereas an estimated survival time that was lower than the follow-up time was a considered a prediction error. The follow-up and estimated survival times of these patients are shown in Fig. 1.
Fig. 1

The GASE prediction performance on an independent test cohort of 393 patients with follow-up times.

The GASE prediction performance on an independent test cohort of 393 patients with follow-up times.

Ranking of miRNA signature

The miRNAs of the identified miRNA signature were ranked based on their contribution towards estimating the survival time using main effect difference (MED) [42] analysis. A higher MED score represents greater contribution towards the prediction of survival time. A miRNA with a higher MED score indicates superior prediction ability towards the survival time estimation, whereas a lower-scoring miRNA indicates a smaller contribution to survival time estimation. The top 10 ranked miRNAs according to the MED analysis, include hsa-miR-760, hsa-miR-767-5p, hsa-miR-1301-3p, hsa-miR-891a-5p, hsa-miR-532-5p, hsa-miR-29a-5p, hsa-miR-16-5p, hsa-miR-130a-5p, hsa-miR-329-3p, and hsa-miR-496 (Table 2). The prioritization of miRNAs based on their contribution to the survival estimation is shown in Fig. 2.
Table 2

Ranking of miRNA signature and corresponding MED scores.

RankmiRNAMIMAT-IDMED
1hsa-miR-760MIMAT00049571.728135
2hsa-miR-767-5pMIMAT00038821.480966
3hsa-miR-1301-3pMIMAT00057971.344602
4hsa-miR-891a-5pMIMAT00049021.14225
5hsa-miR-532-5pMIMAT00028881.139153
6hsa-miR-29a-5pMIMAT00045030.887408
7hsa-miR-16-5pMIMAT00000690.88658
8hsa-miR-130a-5pMIMAT00045930.863724
9hsa-miR-329-3pMIMAT00016290.844311
10hsa-miR-496MIMAT00028180.818043
11hsa-miR-20a-3pMIMAT00044930.724058
12hsa-miR-125a-5pMIMAT00004430.63757
13hsa-miR-181b-5pMIMAT00002570.590379
14hsa-miR-675-3pMIMAT00067900.578151
15hsa-miR-9-5pMIMAT00004410.484588
16hsa-miR-664a-5pMIMAT00059480.425219
17hsa-miR-93-5pMIMAT00000930.364274
18hsa-miR-30e-5pMIMAT00006920.355408
19hsa-miR-376c-3pMIMAT00007200.345478
20hsa-miR-326MIMAT00007560.312151
21hsa-miR-193a-5pMIMAT00046140.275742
22hsa-miR-532-3pMIMAT00047800.268942
23hsa-miR-625-3pMIMAT00048080.259763
24hsa-miR-106a-5pMIMAT00001030.213424
25hsa-let-7 g-5pMIMAT00004140.152833
26hsa-let-7f-5pMIMAT00000670.04358
27hsa-miR-193b-5pMIMAT00047670.010963
Fig. 2

Chord diagram showing the prioritization of miRNAs of the signature based on their survival estimation ability in stomach and esophageal carcinoma. The size of the line is proportional to the percent contribution towards the survival estimation.

Ranking of miRNA signature and corresponding MED scores. Chord diagram showing the prioritization of miRNAs of the signature based on their survival estimation ability in stomach and esophageal carcinoma. The size of the line is proportional to the percent contribution towards the survival estimation.

Diagnosis prediction

The diagnostic ability of the identified miRNA signature was measured by distinguishing healthy and STEC patients using CancerMiRNome database [52]. The individual miRNAs that compose the miRNA signature had AUCs in a range of 0.49–0.94 for distinguishing healthy from STEC patients, as shown in Table 3. Among the signature miRNAs, 13 miRNAs, including hsa-miR-93-5p, hsa-miR-1 81b-5p, hsa-miR-125a-5p, hsa-miR-1301-3p, hsa-miR-30e-5p, hsa-miR-767-5p, hsa-miR-16-5p, hsa-miR-675-3p, hsa-miR-326, hsa-miR-760, hsa-miR-20a-3p, hsa-miR-664a-5p, and hsa-miR-130a-5p were good diagnostic predictors of esophageal carcinoma (ESCA) (AUC ≥ 0.70), as shown in Fig. 3. Ten miRNAs, including hsa-miR-30e-5p, hsa-miR-1301-3p, hsa-miR-125a-5p, hsa-miR-93-5p, hsa-miR-326, hsa-miR-532-5p, hsa-miR-9-5p, hsa-miR-181b-5p, hsa-miR-193a-5p, and hsa-let-7 g-5p were good diagnostic predictors of stomach adenocarcinoma (STAD) (AUC ≥ 0.7), as shown in Fig. 4.
Table 3

Diagnosis prediction of patients with STEC using the miRNA signature.

miRNAsESCA-AUCSTAD-AUC
hsa-miR-7600.730.60
hsa-miR-767-5p0.780.63
hsa-miR-1301-3p0.820.82
hsa-miR-891a-5p0.520.53
hsa-miR-532-5p0.590.78
hsa-miR-29a-5p0.570.64
hsa-miR-16-5p0.770.49
hsa-miR-130a-5p0.700.56
hsa-miR-329-3p0.550.53
hsa-miR-4960.600.59
hsa-miR-20a-3p0.730.62
hsa-miR-125a-5p0.840.81
hsa-miR-181b-5p0.870.77
hsa-miR-675-3p0.740.54
hsa-miR-9-5p0.490.78
hsa-miR-664a-5p0.730.69
hsa-miR-93-5p0.940.81
hsa-miR-30e-5p0.820.85
hsa-miR-376c-3p0.570.66
hsa-miR-3260.740.81
hsa-miR-193a-5p0.590.73
hsa-miR-532-3p0.520.68
hsa-miR-625-3p0.680.5
hsa-miR-106a-5p0.620.51
hsa-let-7 g-5p0.640.7
hsa-let-7f-5p0.500.62
hsa-miR-193b-5p0.580.58

Abbreviation: ESCA-Esophageal carcinoma, STAD-Stomach adenocarcinoma, AUC-Area under the receiver operating curve.

Fig. 3

Diagnosis prediction ability of miRNAs was evaluated in ESCA using ROC curves.

Fig. 4

Diagnosis prediction ability of miRNAs was evaluated in STAD using ROC curves.

Diagnosis prediction of patients with STEC using the miRNA signature. Abbreviation: ESCA-Esophageal carcinoma, STAD-Stomach adenocarcinoma, AUC-Area under the receiver operating curve. Diagnosis prediction ability of miRNAs was evaluated in ESCA using ROC curves. Diagnosis prediction ability of miRNAs was evaluated in STAD using ROC curves.

Expression differences of the miRNA signature

Expression difference analysis was performed to measure the significance in the expression levels of the identified miRNA signature between normal and tumor tissues of ESCA and STAD patients using the CancerMiRNome database [52]. There were 14 miRNAs, including hsa-miR-625-3p, hsa-miR-664a-5p, hsa-miR-326, hsa-miR-130a-5p, hsa-miR-20a-3p, hsa-miR-675-3p, hsa-miR-760, hsa-miR-16-5p, hsa-miR-767-5p, hsa-miR-1301-3p, hsa-miR-125a-5p, hsa-miR-181b-5p, hsa-miR-93-5p, and hsa-miR-30e-5p which showed a significant difference (p < 0.05) between normal and ESCA samples (Table 4). There were 19 miRNAs, including hsa-miR-664a-5p, hsa-miR-767-5p, hsa-let-7 g-5p, hsa-let-7f-5p, hsa-miR-376c-3p, hsa-miR-29a-5p, hsa-miR-760, hsa-miR-1301-3p, hsa-miR-532-5p, hsa-miR-20a-3p, hsa-miR-125a-5p, hsa-miR-181b-5p, hsa-miR-9-5p, hsa-miR-93-5p, hsa-miR-30e-5p, hsa-miR-326, hsa-miR-193a-5p, hsa-miR-532-3p, and hsa-miR-193b-5p, that had significantly different expression between normal and STAD patients (Table 4). The top 10 ranked miRNAs and their expression differences between healthy and ESCA and STAD patients are shown in Fig. 5, Fig. 6, respectively.
Table 4

Expression differences of the miRNA signature between normal and tumor tissues.

miRNA signatureNormal vs ESCANormal vs STAD
p-valuep-value
hsa-miR-7600.0030.0297
hsa-miR-767-5p0.00040.0003
hsa-miR-1301-3p0.0001<0.0001
hsa-miR-891a-5p0.87330.6737
hsa-miR-532-5p0.3987<0.0001
hsa-miR-29a-5p0.34810.008
hsa-miR-16-5p0.00050.0911
hsa-miR-130a-5p0.01150.1496
hsa-miR-329-3p0.45760.6845
hsa-miR-4960.18570.1031
hsa-miR-20a-3p0.005<0.0001
hsa-miR-125a-5p<0.0001<0.0001
hsa-miR-181b-5p<0.0001<0.0001
hsa-miR-675-3p0.00330.1292
hsa-miR-9-5p0.9717<0.0001
hsa-miR-664a-5p0.01770.0002
hsa-miR-93-5p<0.0001<0.0001
hsa-miR-30e-5p<0.0001<0.0001
hsa-miR-376c-3p0.53890.0033
hsa-miR-3260.0117<0.0001
hsa-miR-193a-5p0.3494<0.0001
hsa-miR-532-3p0.8464<0.0001
hsa-miR-625-3p0.0230.9569
hsa-miR-106a-5p0.17830.9017
hsa-let-7 g-5p0.38720.0003
hsa-let-7f-5p0.79680.0026
hsa-miR-193b-5p0.343<0.0001

Abbreviation: ESCA-Esophageal carcinoma, STAD-Stomach adenocarcinoma.

Fig. 5

Comparison of expression of the top 10 ranked miRNAs between normal and ESCA samples using boxplot representation (* indicates p < 0.05).

Fig. 6

Comparison of expression of the top 10 ranked miRNAs between normal and STAD samples using boxplot representation. (* indicates p < 0.05).

Expression differences of the miRNA signature between normal and tumor tissues. Abbreviation: ESCA-Esophageal carcinoma, STAD-Stomach adenocarcinoma. Comparison of expression of the top 10 ranked miRNAs between normal and ESCA samples using boxplot representation (* indicates p < 0.05). Comparison of expression of the top 10 ranked miRNAs between normal and STAD samples using boxplot representation. (* indicates p < 0.05).

MiRNA-gene target enrichment analysis

There were 558 miRNA target interactions (MTI) with strong evidence, which included 32 miRNAs and 352 target genes from miRTarBase (Supplementary Table S2). We performed gene-set enrichment analysis using three pathway libraries: WikiPathway, KEGG, and MSigDB Hallmark, shown in Fig. 7. The highly enriched pathways in WikiPathway, KEGG, and MSigDB Hallmark were the somatotrophic axis and its relationship to dietary restriction and aging (WP4186) (adjusted p-value: 1.34E-10, Odds ratio: 117888, combined score: 2,862,498), pancreatic cancer (adjusted p-value:1.54E-34, Odds ratio:44.55, combined score: 3655.72), and apoptosis (adjusted p-value:5.17E-20, Odds ratio: 12.68, combined score: 563.12), respectively, shown in Supplementary Tables S3-S5. Additionally, the miRNA signature-gene interaction network was built using miRTarBase [46], TarBase V8. [53] and miRecords [54]. There were 28,057 edges associated with 10,525 genes. We reduced the low priority edges using the shortest path network measures [55]. The final network, consisting of 832 edges 93 targeted genes, is shown in Supplementary Fig. S3.
Fig. 7

The pathways enrichment analysis of miRNA signature targeted genes in three categories, (A) wiki pathways, (B) KEGG pathways, and (C) MSigDB hallmark.

The pathways enrichment analysis of miRNA signature targeted genes in three categories, (A) wiki pathways, (B) KEGG pathways, and (C) MSigDB hallmark. The Gene Ontology (GO) annotations of the target genes were in three categories: biological process, molecular function, and cellular component. The highly enriched pathways for biological process, molecular function, and cellular component were positive regulation of smooth muscle cell apoptosis process (GO:0034393), I-SMAD binding (GO:0070411), and serine/threonine protein kinase complex (GO:1902554), respectively, as shown in Supplementary Figs. S4-S6 and Supplementary Table S6.

MiRNAs in cancers

The roles of the top 10 ranked miRNAs in various diseases and cancers were examined using the Human microRNA Disease Database (HMDD v3.2) [56], miRTarbase, and by reviewing the scientific literature. The information from these resources indicate that the top 10 ranked miRNAs are involved in STEC. A quantitative real-time PCR analysis reported that hsa-miR-760 is significantly downregulated in ESCA tissues and cell lines, suggesting that this miRNA could be used as a prognostic indicator [57]. Significant differential expression of hsa-miR-769-5p was observed in ESCA tissue when compared to the normal tissues [58]. Over-expression of hsa-miR-1301-3p induces cell proliferation and tumorigenesis in gastric cancer tissues [59]. Wu et al. reported the differential expression of hsa-miR-1301-3p in ESCA, suggesting that this miRNA could be used as a prognostic biomarker for ESCA [60]. Zhang and colleagues reported the downregulation of hsa-miR-532-5p in gastric cancer cells, and its expression is associated with poorer survival in patients with gastric cancer [61]. Tokumaru and colleagues demonstrated the association of hsa-miR-29a with overall survival in patients with gastric cancer, and lower expression of hsa-miR-29a worsens the overall survival in patients with gastric cancer [62]. Hsa-miR-16-5p has been utilized as a prospective biomarker for prognosis prediction in patients with gastric cancer and ESCA [63], [64]. Hsa-miR-130a-5p affects cell growth, migration and invasion by targeting cannabinoid receptor 1 in gastric cancer cells [65]; it also deregulates PTEN and controls malignant cell survival and tumor growth in multiple cancers [66]. Hsa-miR-329-3p acts as a tumor suppressor by targeting T lymphoma invasion and metastasis in gastric cancer cells and could be utilized as potential therapeutic target [67]. Hsa-miR-496 is downregulated in gastric cancer cell lines, and it inhibits cell proliferation via targeting Lyn kinase in gastric cancer cell lines [68]. Among the top 10 ranked miRNAs, the roles of two miRNAs, hsa-miR-769-5p and hsa-miR-891a-5p, have not been reported previously in either STAD or ESCA. Additionally, a miRNA-disease network was constructed for the miRNA signature using miRNet 2.0 [55]. The miRNAs of the signature were observed to be involved in several diseases. In the miRNA-disease association network, there were 12 nodes (miRNAs) with 132 edges associated with 85 diseases, shown in Supplementary Fig. S7.

Discussion

MiRNAs provide a way to explore disease mechanisms in various cancers, including STEC. The clinical applications of miRNAs in cancer rely on identifying miRNA signatures as potential biomarkers and developing miRNA-target based therapeutics. Accordingly, we developed a survival time estimation method, GASE, to identify a miRNA signature that was correlated with STEC patient survival. Computational methods for feature selection often suffer from issues related to data quality and high dimensionality, especially when dealing with biomedical data. To address the challenges to identifying the right biomarker, we used an optimal feature selection algorithm, IBCGA, which is good at identifying s small number of important features from a large number of candidate features. The optimization method was previously utilized to estimate the survival time in various cancers [34], [35], [36]. In this study, we exclusively focused on identifying a miRNA signature in patients with STEC. The proposed method, GASE, identified 27 miRNAs as a survival miRNA signature and performed better than standard machine learning methods in estimating survival time. Our evaluation of the diagnostic ability of the identified miRNAs revealed that 13 miRNAs were good diagnostic predictors (AUC ≥ 0.7) in ESCA and 10 miRNAs in STAD. The differential expression analysis between tumor and normal samples from patients with STEC revealed that several miRNAs had significantly different expression between tumor and normal samples. Further, previous reports provide evidence supporting the importance of the top 10 ranked miRNAs of the signature in STEC. The miRNA-gene target interaction analysis showed that the target genes were highly enriched in the somatotrophic axis and its relationship to dietary restriction and the aging (WP4186) pathway. The somatotrophic axis in mammals involves signaling by growth hormone (GH), which is produced by the anterior pituitary, and its secondary mediator, insulin-like growth factor 1 (IGF-1). In a previous study, growth hormone–releasing hormone and its receptor (GHRH-R) were found primarily in the anterior pituitary gland, gastric cancers, other solid tumors, and lymphomas. Increased levels of GHRH-R in tumor samples from patients with gastric cancer are associated with poor outcomes [69]. Another important enriched pathway of the miRNA signature was transforming growth factor beta (TGF-β) signaling pathway. TGF-β is a cytokine that participates in both physiological and pathological processes including tumorigenesis [70]. During tumor progression, TGF-β signaling regulates the immune/inflammatory response and the tumor microenvironment. It also regulates tumor growth, epithelial-mesenchymal transition (EMT), and cancer cell stemness depending on tumor stage and cellular context [71]. EMT is also an enriched pathway from MSigDB Hallmark (adjusted p-value: 8.24E-19, Odds ratio: 11.13, combined score: 506.81), which is consistent with this biological mechanism. Abnormal TGF-β signaling has been associated with progression of gastrointestinal cancer [72], which includes esophageal, gastric, liver, colorectal, and pancreatic carcinomas that, collectively, are major causes of cancer-related deaths worldwide [73]. Several TGF-β-based therapeutics have been developed for the treatment of gastrointestinal cancers and have displayed efficacy in clinical trials [74], [75]. Additional support for the role of TGF-β signaling in STEC was obtained from the GO annotation analysis, which showed that I-SMAD binding (GO:0070411) was enriched in the GO molecular function category (adjusted p-value: 1.03E-06). Nuclear accumulation of active SMAD complexes is crucial for the transduction of TGF-β superfamily signals from transmembrane receptors to the nucleus. The top hits for gene target enrichment analysis also indicated that the miRNA signature was related to miRNAs involved in DNA damage response, epidermal growth factor receptor tyrosine kinase inhibitor resistance, apoptosis, Wnt/beta-catenin signaling, and angiogenesis. DNA damage response pathways are known to be related to therapy resistance in STEC [76], [77], and resistance to epidermal growth factor receptor tyrosine kinase inhibitors are relevant to survival in STEC, consistent with the use of epidermal growth factor receptor tyrosine kinase inhibitors as targeted therapy in STEC [78], [79]. The Wnt/beta-catenin signaling pathway has been implicated in cancer progression in STEC [80], and the dysregulation of apoptosis and angiogenesis are known to promote tumor growth [81], [82]. This suggests the miRNAs in the signature and the putative gene targets of these miRNAs are possible molecular targets for exploitation in the pursuit to create new therapies for STEC. In addition to being associated with survival, the miRNAs in the signature could discriminate between healthy and STEC patients, and were differentially expressed between the healthy and tumor tissues of patients with STEC. This suggests that the capability of these miRNAs to function as prognostic or diagnostic biomarkers. Further investigation is needed to determine the utility of the miRNA signature as a prognostic biomarker for monitoring response to therapy or predicting survival after therapy in STEC patients and as a biomarker for early STEC diagnosis. Other questions for study are whether the miRNA signature can perform as a biomarker in STEC of different types and stages, and whether the miRNA signature can be detected in blood at a level of accuracy comparable to that in tumor tissue (to allow for the possibility of performing liquid biopsies for biomarker detection). In conclusion, a better understanding of the miRNA signature in survival predictions will aid in developing treatment strategies for STEC. We anticipate that the miRNA signature identified here could help in understanding the roles of miRNAs in STEC and developing miRNA-based cancer therapeutics.

Funding

This work was supported in part by the Marshfield Clinic Research Institute, Marshfield, WI. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author contributions

S.Y.S. designed the system, carried out the detail study and supervised the study. S.Y.S, M.T, T.C, P.A, S.K.S, A.B and S.Y.H, participated in data analysis, manuscript preparation and discussed the results. All authors have read and approved the final manuscript.

Availability of data and materials

All the data used in this analysis can be found on the TCGA data portal [https://portal.gdc.cancer.gov/].

Ethics approval and consent to participate

Not applicable.

Consent to publish

Not applicable.

CRediT authorship contribution statement

Srinivasulu Yerukala Sathipati: Conceptualization, Data curation, Writing – original draft, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision. Ming-Ju Tsai: Validation, Visualization, Formal analysis. Tonia Carter: Formal analysis, Data curation, Writing - review & editing. Patrick Allaire: Formal analysis. Sanjay K Shukla: Formal analysis. Afshin Beheshti: Formal analysis, Writing - review & editing. Shinn-Ying Ho: Formal analysis.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  77 in total

1.  Trastuzumab in combination with chemotherapy versus chemotherapy alone for treatment of HER2-positive advanced gastric or gastro-oesophageal junction cancer (ToGA): a phase 3, open-label, randomised controlled trial.

Authors:  Yung-Jue Bang; Eric Van Cutsem; Andrea Feyereislova; Hyun C Chung; Lin Shen; Akira Sawaki; Florian Lordick; Atsushi Ohtsu; Yasushi Omuro; Taroh Satoh; Giuseppe Aprile; Evgeny Kulikov; Julie Hill; Michaela Lehle; Josef Rüschoff; Yoon-Koo Kang
Journal:  Lancet       Date:  2010-08-19       Impact factor: 79.321

Review 2.  Epidermal growth factor receptor (EGFR)-targeted therapies in esophagogastric cancer.

Authors:  Sabarish Ayyappan; Dhivya Prabhakar; Neelesh Sharma
Journal:  Anticancer Res       Date:  2013-10       Impact factor: 2.480

3.  Esophageal Cancer.

Authors:  Matthew W Short; Kristina G Burgers; Vincent T Fry
Journal:  Am Fam Physician       Date:  2017-01-01       Impact factor: 3.292

4.  MicroRNA-575 regulates development of gastric cancer by targeting PTEN.

Authors:  Ya-Nan Wang; Fang Xu; Pei Zhang; Pan Wang; Ya-Ning Wei; Chen Wu; Shu-Jie Cheng
Journal:  Biomed Pharmacother       Date:  2019-03-04       Impact factor: 6.529

Review 5.  MicroRNA in the pathogenesis and prognosis of esophageal cancer.

Authors:  Jian Gu; Yan Wang; Xifeng Wu
Journal:  Curr Pharm Des       Date:  2013       Impact factor: 3.116

Review 6.  Epidemiology of esophageal cancer: update in global trends, etiology and risk factors.

Authors:  Dustin J Uhlenhopp; Eric Omar Then; Tagore Sunkara; Vinaya Gaduputi
Journal:  Clin J Gastroenterol       Date:  2020-09-23

7.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.

Authors:  Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

Review 8.  Contextual determinants of TGFβ action in development, immunity and cancer.

Authors:  Charles J David; Joan Massagué
Journal:  Nat Rev Mol Cell Biol       Date:  2018-07       Impact factor: 94.444

9.  MicroRNA Profiling in Oesophageal Adenocarcinoma Cell Lines and Patient Serum Samples Reveals a Role for miR-451a in Radiation Resistance.

Authors:  Frederike Butz; Ann-Kathrin Eichelmann; George C Mayne; Tingting Wang; Isabell Bastian; Karen Chiam; Shashikanth Marri; Pamela J Sykes; Bas P Wijnhoven; Eelke Toxopeus; Michael Z Michael; Christos S Karapetis; Richard Hummel; David I Watson; Damian J Hussey
Journal:  Int J Mol Sci       Date:  2020-11-24       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.