Literature DB >> 32175327

Systematical Analysis of the Cancer Genome Atlas Database Reveals EMCN/MUC15 Combination as a Prognostic Signature for Gastric Cancer.

Wentao Dai1,2,3, Jixiang Liu1,3, Bingya Liu2, Quanxue Li1,3,4, Qingqing Sang2, Yuan-Yuan Li1,2,3.   

Abstract

Digestive cancers-including gastric cancer (GC), colorectal cancer, hepatocellular carcinoma, esophageal cancer, and pancreatic cancer-accounted for 26% of cancer cases and 35% of cancer deaths worldwide in 2018. It is crucial and urgent to develop biomarkers for the diagnosis, prognosis, and therapeutic benefits of digestive cancers, especially for GC, since the incidence of GC is lower only than lung cancer in China, is hard to detect at an early stage, and is associated with poor prognosis. Mucins, glycoproteins encoded by MUC family genes, act as a part of a physical barrier in the digestive tract and participate in various signaling pathways. Some mucins have been used or proposed as biomarkers for carcinomas, such as MUC16 (CA125) and MUC4. However, there are no systematic investigations on the association of MUC family members with diagnoses and clinical outcomes even though relevant data have been largely accumulated in the past decade. By analyzing transcriptomic and clinical data of digestive cancer samples from TCGA involving colon adenocarcinoma (COAD), esophageal carcinoma (ESCA), liver hepatocellular carcinoma (LIHC), stomach adenocarcinoma (STAD), and pancreatic adenocarcinoma (PAAD), it was found that expressions levels of MUC15, MUC13, and MUC21 were individually associated with survival for digestive cancers, and high expressions of EMCN (MUC14) and MUC15 were correlated with poor survival for STAD. Cox regression analysis indicated the predictive power of an EMCN/MUC15 combination for overall survival (OS) of GC patients, which was validated on an independent dataset from GEO. EMCN/MUC15 correlated genes were identified to be enriched in cancer-related processes, such as vasculature development, mitosis, and immunity. Therefore, we propose that an EMCN/MUC15 combination could be a potential prognostic signature for gastric cancer.
Copyright © 2020 Dai, Liu, Liu, Li, Sang and Li.

Entities:  

Keywords:  EMCN; MUC family; MUC15; gastric cancer; prognostic

Year:  2020        PMID: 32175327      PMCID: PMC7055423          DOI: 10.3389/fmolb.2020.00019

Source DB:  PubMed          Journal:  Front Mol Biosci        ISSN: 2296-889X


Introduction

Digestive cancers are a group of cancers that occur in the digestive tract, and include gastric cancer (GC), colorectal cancer, hepatocellular carcinoma, esophageal cancer, and pancreatic cancer. Digestive cancers accounted for around 26% of cancer cases and 35% of cancer deaths in the world in 2018 (Bray et al., 2018). Among them, the morbidity and mortality of GC in Eastern Asia is much higher than the worldwide average level. In China, the incidence of GC is only lower than lung cancer, and the mortality is third to lung cancer and liver cancer (Chen et al., 2014). Most patients suffering from early stage GC are asymptomatic and always develop distant metastasis at the time of diagnosis (Van Cutsem et al., 2016; Bray et al., 2018). Surgery is the main treatment for GC. Adjuvant or neoadjuvant therapy combined with surgery is commonly used to treat advanced GC, while targeted drugs for advanced GC, such as the HER2 (also known as ERBB2) antibody trastuzumab, and the VEGFR-2 antibody ramucirumab, are still in clinical trials (Van Cutsem et al., 2016). Therefore, developing biomarkers for the diagnosis, prognosis, and therapeutic response of digestive cancers, especially of GC, is necessary and urgent for reducing the mortality rate. Mucins represent a group of glycoproteins encoded by MUC family genes. These high-molecular weight and filamentous glycoproteins could be classified into secreted mucins and membrane-bound mucins. In the digestive tract, secreted mucins form a mucus layer and act as part of a physical defensive barrier against external aggressive forces (Dekker et al., 2002; Dhanisha et al., 2018); membrane-bound mucins possess membrane specific domains which enable their diverse roles in signaling pathways (Dekker et al., 2002; Dhanisha et al., 2018). Not surprisingly, dysfunction of mucins in their fundamental roles is implicated in disease development at mucosal surfaces (Corfield, 2015; Dhanisha et al., 2018), and some mucins have been reported to display diagnostic or prognostic significance in different types of cancer. For example, MUC16, also known as CA125, is a widely used biomarker for the diagnosis of ovarian cancer (Yonezawa et al., 2011; Jonckheere and Van Seuningen, 2018) and was also found to be over-expressed in several other human malignancies, including pancreas, breast, and lung (Aithal et al., 2018). MUC4 promotes carcinogenetic progression and has been proposed as a promising biomarker for pancreatic, ovarian, esophagus, and lung cancers (Kaur et al., 2013; Jonckheere and Van Seuningen, 2018). MUC15 overexpression is significantly correlated with several types of cancers, including colon cancer, hepatocellular carcinoma, and thyroid cancer (Huang et al., 2009; Nam et al., 2011; Wang et al., 2013; Choi et al., 2018). Moreover, MUC4/MUC16/MUC20 high-expression signature was very recently reported to be correlated with poor overall survival (OS) in several types of digestive cancers including pancreatic, colon, and GCs (Jonckheere and Van Seuningen, 2018). However, there are no systematic investigations, so far, on the association of MUC family members with diagnosis, prognosis, and/or therapeutic benefits, even though the Cancer Genome Atlas (TCGA) project is producing massive genomic, transcriptomic, proteomic, and clinical data involving more than 11,000 patients of 33 different types of tumors (Weinstein et al., 2013), and meanwhile, a number of web tools, such as GEPIA (Tang et al., 2017) and cBioPortal for Cancer Genomics (Cerami et al., 2012; Gao et al., 2013), have been developed that enable users to easily and effectively mine TCGA data. In the present study, by analyzing digestive cancer samples from TCGA involving colon adenocarcinoma (COAD), esophageal carcinoma (ESCA), liver hepatocellular carcinoma (LIHC), stomach adenocarcinoma (STAD), and pancreatic adenocarcinoma (PAAD), we found that expression levels of MUC15, MUC13, and MUC21 were individually associated with survival for all these digestive cancers, and high expressions of EMCN (MUC14) and MUC15 were correlated with poor survival for STAD. Cox regression analysis showed that EMCN/MUC15 combination still exhibited a significant correlation with the OS of GC patients. The prognostic prediction power of signature EMCN/MUC15 was further validated on an independent GC dataset, GSE84437. EMCN/MUC15 top 50 correlated genes were identified to be enriched in cancer-related processes, including vasculature development, mitosis, immunity, and so on. Taken together, we propose EMCN/MUC15 combination as a potential prognostic signature for GC.

Materials and Methods

Datasets

Datasets were collected from TCGA[1] and GEO[2] (Barrett et al., 2012). Specifically, gene expression data (TPM, Transcripts Per Kilobase Million) and clinical data for digestive cancers including COAD, ESCA, LIHC, STAD, and PAAD, were analyzed with the online webserver GEPIA 1.0 (Tang et al., 2017). Among them, MUC family mRNA expression data (mRNA expression z-scores, which is based on RNASeqV2 processed and normalized using RSEM) and clinical profiles involving 407 STAD samples were extracted by using an online web tool cBioPortal for Cancer Genomics (Cerami et al., 2012; Gao et al., 2013). Additionally, GSE84437 were extracted from the GEO database, which involves mRNA microarray data and clinical profiles of 433 GC samples.

Survival Analysis

Kaplan–Meier (KM) survival analysis for digestive cancer samples as a whole was carried out by using the webserver GEPIA 1.0 (Tang et al., 2017), and for GC samples (TCGA-STAD from cBioPortal and GSE84437 R package survival[3] was used. KM analysis was based on individual gene expression value and survival data. By using the median expression value of a query gene in a certain sample group as a cutoff, the samples were split into high and low expression groups with the expression level of the query gene not less than and less than the cutoff. The Cox proportional hazard model was built by using R package survival, fitted with two genes’ expression values for OS or disease free survival (DFS). Similar to the individual gene analysis, the median value of weighted expression value (shortened as WEV) of a gene combination in a certain cohort were used as a group cutoff, where WEV was calculated as the sum of cox-regression coefficient weighted expression value of each gene involved in the combination. Log rank p-values, cox proportional hazard ratios (HRs), and HR p-values were calculated to compare the survival between two groups split by the median value of gene expression or WEV. A p-value of less than 0.05 and HR greater than 1.05 or less than 0.95 suggest statistical significance of the survival difference between high and low groups, which indicates the corresponding gene or gene combination has a prognostic potential.

Gene Co-expression Analysis and Enrichment Analysis

Gene co-expression analysis was carried out using webserver cBioPortal, and the top 25 positively correlated and top 25 negatively correlated genes were selected according to Spearman correlation coefficients, which were taken together and simplified as “top 50 correlated genes” in our results. Here, correlated genes met two criteria: the absolute value of Spearman correlation coefficient is greater than 0.25, and the p-value is less than 0.01. Gene set enrichment analysis (GSEA) was performed by using R package clusterProfiler (Yu et al., 2012). The pathways enriched for GO (Gene Ontology) (Ashburner et al., 2000; The Gene Ontology Consortium, 2019) were plotted based on the negative logarithm of p-value.

Results

MUC15, 13, and 21 Display Prognostic Potential for Digestive Cancer on TCGA

Aiming to assess the prognostic potentials of every MUC gene, KM survival analysis was applied to TCGA digestive cancer samples as a whole involving COAD, ESCA, LIHC, STAD, and PAAD by using the webserver GEPIA 1.0 (Tang et al., 2017). Among the 14 MUC family members with expression data available, the expression levels of MUC1, MUC5AC, MUC6, OVGP1 (MUC9), MUC13, EMCN (MUC14), MUC15, MUC16, MUC17, and MUC21 individually exhibited significant correlations with OS, with HR p-values less than 0.05 and HR greater than 1.05 or less than 0.95; similarly, MUC2, MUC3A, MUC12, MUC13, MUC15, MUC17, MUC20, and MUC21 were significantly correlated with DFS (Table 1 and Supplementary Figure S1). MUC13, MUC15, MUC17, and MUC21 were significant for both OS and DFS, among which MUC15 performed best for OS correlation and the second best for DFS correlation. In comparison, MUC13 displayed the best performance in DFS analysis, while ranked relatively lower (9th) in OS analysis; MUC21 ranked 3rd for OS, and 8th for DFS (Table 1 and Supplementary Figure S1). These indicate that MUC15 represents a promising candidate for developing strategies for prognosis prediction for digestive cancers.
TABLE 1

Survival analysis of TCGA digestive cancer samples for prognostic potentials of MUC family genes.

GeneHR p-value for OSOS p-value rankHR p-value for DFSDFS p-value rank
MUC11.3E−0560.2311
MUC20.69145.2E−083
MUC3A0.49121.7E−064
MUC5AC7.9E−0650.7414
MUC62.7E−0730.4112
OVGP1 (MUC9)0.002170.09410
MUC120.58130.000125
MUC130.03292.1E−081
EMCN (MUC14)0.044100.7113
MUC151.7E−0913.6E−082
MUC166.4E−0920.0599
MUC170.005380.000516
MUC200.36110.017
MUC219.8E−0740.018
Survival analysis of TCGA digestive cancer samples for prognostic potentials of MUC family genes.

MUC14 (EMCN) and 15 Display Prognostic Potential for Gastric Cancer on TCGA-STAD

To investigate the prognostic potentials of MUC family genes for STAD, we performed KM survival analysis exclusively on STAD samples from TCGA with R package survival. It was found that the expression levels of EMCN (MUC14) and MUC15 individually showed significant correlations with both OS and DFS, and MCAM (MUC18) was significant only with OS (Table 2). KM survival plots, together with log rank p-values, cox proportional HRs, and HR p-values summarized in Figure 1 indicated that EMCN performed better than MUC15 in both OS and DFS analyses. Overall, EMCN and MUC15 could be potential biomarkers for STAD prognosis.
TABLE 2

Survival analysis of TCGA STAD samples for prognostic potentials of MUC family genes.

GeneHR p-value for OSHR p-value for DFS
MUC10.6540.591
MUC20.1290.364
MUC40.90.203
MUC5B0.4410.753
MUC60.670.0854
OVGP1 (MUC9)0.6620.925
MUC120.9570.637
MUC130.05110.234
EMCN (MUC14)0.001540.00737
MUC150.01850.0141
MUC160.8250.0975
MUC170.1450.406
MCAM (MUC18)0.01670.323
MUC200.8910.62
MUC210.2240.745
FIGURE 1

Survival analysis of TCGA STAD samples for prognostic potentials of EMCN (MUC14) and MUC15. (A) Overall Survival (OS) of EMCN. (B) Disease Free Survival (DFS) of EMCN. (C) Overall Survival of MUC15. (D) Disease Free Survival of MUC15. Log rank p-values, hazard ratios (HRs) and hazard ratio p-values were calculated. The 95% confidence intervals for survival time were shown in as dotted lines in the Kaplan–Meier (KM) survival plot.

Survival analysis of TCGA STAD samples for prognostic potentials of MUC family genes. Survival analysis of TCGA STAD samples for prognostic potentials of EMCN (MUC14) and MUC15. (A) Overall Survival (OS) of EMCN. (B) Disease Free Survival (DFS) of EMCN. (C) Overall Survival of MUC15. (D) Disease Free Survival of MUC15. Log rank p-values, hazard ratios (HRs) and hazard ratio p-values were calculated. The 95% confidence intervals for survival time were shown in as dotted lines in the Kaplan–Meier (KM) survival plot.

EMCN/MUC15 Combination Could Serve as Prognostic Signature for Gastric Cancer

So far we have observed that high expressions of both EMCN and MUC15 were associated with poor prognosis in GC, and that EMCN and MUC15 displayed the strongest correlation to survival for GC and digestive cancers, respectively (Table 2 and Figure 1). Thus, we set out to investigate whether EMCN/MUC15 combination could be a prognostic signature for GC. Cox proportional hazards regression analysis was performed based on the two genes’ expression values and OS data derived from TCGA STAD dataset. As expected, the expression of EMCN/MUC15 combination exhibited significant correlation with OS, with log rank p-value of 0.00299 and HR p-value of 0.00301 (Figure 2A).
FIGURE 2

Overall survival analyses of gastric cancer (GC) samples from TCGA STAD (A) and GSE84437 (B) for predictive power of EMCN/MUC15 signature. Log rank p-values, hazard ratios (HRs) and hazard ratio p-values were calculated. The 95% confidence intervals for survival time were shown as dotted lines in the Kaplan–Meier survival plot.

Overall survival analyses of gastric cancer (GC) samples from TCGA STAD (A) and GSE84437 (B) for predictive power of EMCN/MUC15 signature. Log rank p-values, hazard ratios (HRs) and hazard ratio p-values were calculated. The 95% confidence intervals for survival time were shown as dotted lines in the Kaplan–Meier survival plot. We then separately tested the prognostic prediction power of EMCN, MUC15 and their combination on an independent dataset, GSE84437, which involved 433 GC samples. Again, significant results of EMCN/MUC15 combination (HR = 1.33) were obtained with log rank p-value being 0.0419 and HR p-value being 0.0413 (Figure 2B); while one single gene, EMCN (HR p-value of 0.0807, HR = 1.27) or MUC15 (HR p-value of 0.156, HR = 0.82), had no significant prognostic prediction power, as shown in Supplementary Figure S2. We therefore proposed that EMCN/MUC15 combination could be a potential prognostic signature for GC.

EMCN/MUC15 Correlated Genes Are Functionally Enriched in Cancer Related Processes

By using webserver cBioPortal, the top 50 EMCN- (Table 3) or MUC15- (Table 4) correlated genes were identified based on mRNA expression data of TCGA STAD samples, including the top 25 positively correlated genes and top 25 negatively correlated genes. It is noticeable that there is no intersection between the two top 50 gene lists at all and no co-expression between EMCN and MUC15 (Spearman’s Correlation of 0.0264 with p-value of 0.592) either, implying the functional complementarity between EMCN and MUC15 and thus the rationality of the combination of the two genes in predicting prognosis for GC.
TABLE 3

Top 50 genes correlated with EMCN based on TCGA STAD dataset.

Correlated geneCytobandSpearman correlationp-value
CYYR121q21.30.9314142.19E−183
MYCT16q25.20.9290441.90E−180
ERG21q22.20.8941793.19E−146
DIPK2BXp11.30.8875254.57E−141
ADGRL41p31.10.8867571.71E−140
CD341q32.20.8803836.99E−136
TEK9p21.20.8733974.03E−131
PECAM117q23.30.8716395.73E−130
S1PR11p21.20.8702244.72E−129
LDB24p15.320.8600928.59E−123
RHOJ14q23.20.8599131.10E−122
CLEC14A14q21.10.8542012.25E−119
GNG117q21.30.8530271.03E−118
EBF15q33.30.8462865.16E−115
MMRN210q23.20.8460057.29E−115
CLEC1A12p13.20.8434161.71E−113
CALCRL2q32.10.8415941.53E−112
LRRC705q12.10.840158.47E−112
MEF2C5q14.30.8393542.16E−111
ARHGEF1517p13.10.8360659.86E−110
CDH516q210.8284834.80E−106
PALMD1p21.20.8282835.97E−106
SHE1q21.30.8267923.01E−105
SPARCL14q22.10.8231211.52E−103
JAM221q21.30.8214428.85E−103
RAD54L1p34.1–0.539261.10E−32
CDCA511q13.1–0.536122.96E−32
PKP311p15.5–0.531081.41E−31
CDCA81p34.3–0.53031.79E−31
ZWINT10q21.1–0.528173.44E−31
KIF2C1p34.1–0.523391.46E−30
HJURP2q37.1–0.519824.21E−30
MCM23q21.3–0.518296.63E−30
CDT116q24.3–0.513692.54E−29
MYO1917q12–0.510586.24E−29
TONSL8q24.3–0.506841.82E−28
CCNA24q27–0.50562.58E−28
NCAPH2q11.2–0.50187.48E−28
POC1A3p21.2–0.501657.81E−28
NELFA4p16.3–0.501168.95E−28
UBE2T1q32.1–0.500261.15E−27
POLD27p13–0.499971.25E−27
DTL1q32.3–0.499671.35E−27
PTBP119p13.3–0.499591.38E−27
CNOT112q11.2–0.498711.76E−27
STIP111q13.1–0.497182.69E−27
MAP76q23.3–0.496313.41E−27
ESPL112q13.13–0.495913.81E−27
TBRG47p13–0.495484.29E−27
CDC25A3p21.31–0.494745.24E−27
TABLE 4

Top 50 genes correlated with MUC15 based on TCGA STAD dataset.

Correlated geneCytobandSpearman correlationp-value
ANO311p14.3-p14.20.5588791.82E−35
FSTL45q31.10.49593.82E−27
TMPRSS1311q23.30.4696093.76E−24
ZNF75017q25.30.4648981.21E−23
LGALS719q13.20.4544281.54E−22
NCCRP119q13.20.4523692.52E−22
PCLO7q21.110.4490545.50E−22
GABRA3Xq280.4467119.51E−22
DLX317q21.330.4436371.94E−21
LIN28B6q16.3-q210.4402434.21E−21
ADGRV15q14.30.4390285.55E−21
USH1G17q25.10.4366419.52E−21
C12ORF5612q14.20.4298494.32E−20
RSPO420p130.4288195.41E−20
SPAG171p120.4259921.00E−19
MARK11q410.4243531.43E−19
HTR2CXq230.4230441.90E−19
CT45A5Xq26.30.4207123.13E−19
PRPF40B12q13.120.4199943.64E−19
C3ORF673p14.20.4193764.16E−19
RIPPLY321q22.130.4174376.27E−19
CNGB38q21.30.4173986.32E−19
ATP6V0A47q340.4134521.45E−18
LINC009648q24.130.4125481.74E−18
VGLL1Xq26.30.4094633.30E−18
MCUB4q25–0.359853.92E−14
FAS10q23.31–0.327797.51E−12
IRF15q31.1–0.327328.08E−12
ZIC213q32.3–0.314025.99E−11
CDC42SE25q31.1–0.312437.55E−11
HK35q35.2–0.301983.37E−10
NUB17q36.1–0.300074.41E−10
GBP41p22.2–0.297336.45E−10
BBC319q13.32–0.297226.55E−10
AIM21q23.1-q23.2–0.297076.68E−10
NLRC516q13–0.296697.04E−10
MAX14q23.3–0.296427.30E−10
MTHFD114q23.3–0.294379.67E−10
AGAP212q14.1–0.290961.54E−09
IFNG12q15–0.290681.59E−09
RASSF13p21.31–0.287872.32E−09
GZMA5q11.2–0.286962.62E−09
CCL417q12–0.285153.32E−09
MAT2B5q34–0.282314.82E−09
FCGR3A1q23.3–0.282264.85E−09
THG1L5q33.3–0.282074.97E−09
TK216q21–0.282025.01E−09
PRKXXp22.33–0.277728.71E−09
JAK29p24.1–0.277528.94E−09
EEF219p13.3–0.27749.07E−09
Top 50 genes correlated with EMCN based on TCGA STAD dataset. Top 50 genes correlated with MUC15 based on TCGA STAD dataset. We then performed functional enrichment analysis with the two top 50 correlated genes as a whole. GSEA identified a total of 22 GO terms (Figure 3 and Supplementary Table S1). Among them, the most significant pathways were associated with vasculature development, such as glomerulus vasculature development and renal system vasculature development. Some enriched pathways are associated with mitosis, such as mitotic sister chromatid segregation and mitotic metaphase plate congression. Some pathways were associated with immunity, such as inflammatory cell apoptotic process and response to interferon-gamma. The other enriched pathways were involved in DNA binding, cell cycle phase transition, cell polarity, phosphatase activity, and side of plasma membrane (Figure 3 and Supplementary Table S1). These indicate that genes correlated with EMCN and MUC15 in GC tend to be enriched in cancer related processes, such as vasculature development, mitosis, and immunity.
FIGURE 3

Go terms derived from gene set enrichment analysis (GSEA) for top 50 EMCN- and MUC15-correlated genes. The pathways are ranked by -log p-value. The 95% confidence intervals for survival time were shown as dotted lines in the Kaplan–Meier survival plot.

Go terms derived from gene set enrichment analysis (GSEA) for top 50 EMCN- and MUC15-correlated genes. The pathways are ranked by -log p-value. The 95% confidence intervals for survival time were shown as dotted lines in the Kaplan–Meier survival plot.

Discussion

In the present study, by systematically analyzing mRNA expression and clinical data of TCGA digestive cancer samples and GEO GC samples, we propose MUC15 as a promising candidate for prognosis prediction of digestive cancers, and EMCN/MUC15 combination as a potential prognostic signature for GC. Gene signature identification is essentially a process of dimension reduction of high dimensional data. On one hand, a signature involving less features or genes obviously has more practicality; on the other hand, a signature is also expected to have sufficient interpretability, although it is far from achieved. In this sense, a good signature is supposed to consist of orthogonal or mutually exclusive features which are able to hold a testable hypothesis from a systematic viewpoint while also sustaining the robustness and reliability of the signature. However, most current efforts in this field focus on reducing dimension over enhancing explanatory power of the signature. In our work, although EMCN and MUC15 coding genes belong to the same gene family, it is noted that there is no expression correlation between the two genes and no intersection between their top 50 correlated genes, implying the orthogonality and functional complementarity between EMCN and MUC15. As we expected, the combination of EMCN/MUC15 shows more robust prognostic power than the individual genes in GC according to the testing result implemented on an independent dataset GSE84437. These observations not only support the rationality of the combination of the two genes in predicting prognosis, but also indicate the explanatory power of EMCN/MUC15 signature, which is supposed to play an important role in the robustness improvement. EMCN, i.e. MUC14, encodes a membrane-bound protein, endothelial sialomucin or mucin-like sialo glycoprotein, which was reported to inhibit cell and extracellular matrix interaction, interfere with leukocyte-endothelial cell adhesion, and even promote the peritoneal metastasis process of GC cells (Liu et al., 2001; Zahr et al., 2016; Dhanisha et al., 2018; Bao et al., 2019). Among the 22 enriched functions for top 50 EMCN-correlated genes and top 50 MUC15-correlated genes, the most significant one is glomerulus vasculature development that is associated with four EMCN/MUC15 correlated genes including CD34, TEK, PECAM1, and IFNG (Tables 3, 4 and Supplementary Table S1). After carefully checking functional annotations of the four genes, we focused on two cancer relevant genes, CD34 and PECAM1. Both genes are significantly coexpressed with EMCN with correlation coefficients of 0.880 and 0.871, respectively (Table 3). CD34, a marker of vascular endothelial cells, is capable of supporting cell adhesion by increasing surface expression (Nielsen and McNagny, 2008). PECAM1, also known as CD31, encodes platelet endothelial cell adhesion molecule 1 that is necessary for leukocyte transendothelial migration (TEM) (Dasgupta et al., 2009). It is noteworthy that EMCN/COL4A5/CCL11 combination was very recently reported as a prognostic signature for diffuse type GC (Bao et al., 2019). In our study, among MUC family members, EMCN exhibits the strongest correlation with survival for GC. Taken together, EMCN may play crucial roles in tumorigenesis and progression of GC via cell adhesion and TEM of lymphocytes. MUC15 also encodes a membrane-bound protein, which could promote cell proliferation, cell-extracellular matrix adhesion, colony forming ability, and invasion in colon cancer cells (Huang et al., 2009). Its overexpression is significantly correlated with diverse cancers (Pallesen et al., 2002; Shyu et al., 2007; Huang et al., 2009; Nam et al., 2011; Wang et al., 2013; Choi et al., 2018). However, it was also found that the expression of MUC15 decreased in hepatocellular carcinoma cells and negatively regulated metastasis of hepatocellular carcinoma (Wang et al., 2013). This suggests that MUC15 may perform diverse functions in tumorigenesis and progression. In our study, MUC15 displays the strongest correlation among the MUC family with survival for digestive cancers and MUC15 overexpression seems to be a promising candidate for a prognosis biomarker of digestive cancers. Combined with EMCN, the two genes provide a potential prognostic signature for GC and show more robustness in the prognostic prediction power than individual genes. As far as we know, the association of MUC15 with GC is rarely reported. In summary, we propose EMCN/MUC15 combination as a prognostic signature with mechanistic interpretability. It not only possesses prognostic capability for GC, but also offers clues for further exploring systematic mechanisms of carcinogenesis of GC and other digestive cancers.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: TCGA-STAD, TCGA-COAD, TCGA-ESCA, TCGA-LIHC, TCGA-PAAD, and GSE84437.

Author Contributions

Y-YL and WD designed the study. WD and JL implemented the data analysis. BL, QL, and QS provided the valuable suggestions. JL and WD drafted the manuscript. Y-YL revised the manuscript and coordinated the study. All authors read and approved the final manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  10 in total

1.  PBX1, EMCN and ERG are associated with the sub-clusters and the prognosis of VHL mutant clear cell renal cell carcinoma.

Authors:  Haiwei Wang; Xinrui Wang; Liangpu Xu; Ji Zhang
Journal:  Sci Rep       Date:  2022-05-27       Impact factor: 4.996

2.  Unsupervised Hierarchical Clustering of Pancreatic Adenocarcinoma Dataset from TCGA Defines a Mucin Expression Profile that Impacts Overall Survival.

Authors:  Nicolas Jonckheere; Julie Auwercx; Elsa Hadj Bachir; Lucie Coppin; Nihad Boukrout; Audrey Vincent; Bernadette Neve; Mathieu Gautier; Victor Treviño; Isabelle Van Seuningen
Journal:  Cancers (Basel)       Date:  2020-11-09       Impact factor: 6.639

3.  Quantitative assessment of the diagnostic role of mucin family members in pancreatic cancer: a meta-analysis.

Authors:  Shunda Wang; Lei You; Menghua Dai; Yupei Zhao
Journal:  Ann Transl Med       Date:  2021-02

4.  Downregulation of MUC15 by miR-183-5p.1 promotes liver tumor-initiating cells properties and tumorigenesis via regulating c-MET/PI3K/AKT/SOX2 axis.

Authors:  Tao Han; Hao Zheng; Jin Zhang; Pinghua Yang; Hengyu Li; Zhangjun Cheng; Daimin Xiang; Ruoyu Wang
Journal:  Cell Death Dis       Date:  2022-03-02       Impact factor: 8.469

5.  MUC14-Related ncRNA-mRNA Network in Breast Cancer.

Authors:  Shuqian Wang; Jing Jin; Jing Chen; Weiyang Lou
Journal:  Genes (Basel)       Date:  2021-10-23       Impact factor: 4.096

6.  Expression of Mucin Family Proteins in Non-Small-Cell Lung Cancer and its Role in Evaluation of Prognosis.

Authors:  Jing Tu; Min Tang; Guoqing Li; Liang Chen; Yubo Wang; Yong Huang
Journal:  J Oncol       Date:  2022-08-26       Impact factor: 4.501

7.  The mucin protein MUCL1 regulates melanogenesis and melanoma genes in a manner dependent on threonine content.

Authors:  J Kim; H Choi
Journal:  Br J Dermatol       Date:  2021-11-24       Impact factor: 11.113

8.  Loss of endothelial EMCN drives tumor lung metastasis through the premetastatic niche.

Authors:  Guoxin Zhang; Mengyuan Li; Dandan Zhou; Xingjiu Yang; Wenlong Zhang; Ran Gao
Journal:  J Transl Med       Date:  2022-10-02       Impact factor: 8.440

9.  Construction of mRNA prognosis signature associated with differentially expressed genes in early stage of stomach adenocarcinomas based on TCGA and GEO datasets.

Authors:  Fuquan Jiang; Haiguan Lin; Hongfeng Yan; Xiaomin Sun; Jianwu Yang; Manku Dong
Journal:  Eur J Med Res       Date:  2022-10-17       Impact factor: 4.981

10.  High Expression of MUC15 Is Correlated with Poor Prognosis of Pancreatic Cancer and Promotes Migration, Invasion, and Chemo-Resistance In Vitro.

Authors:  Shunda Wang; Junjie Li; Lei You; Menghua Dai; Yupei Zhao
Journal:  Med Sci Monit       Date:  2020-10-14
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.