| Literature DB >> 32952640 |
Fei-Yu Chen1, Cheng Zhou1, Xiang-Yu Zhang1, Kai-Qian Zhou1, Yuan-Fei Peng1, Lei Yu1, Jia Fan1,2,3,4, Jian Zhou1,2,3,4, Jie Hu1, Zheng Wang1.
Abstract
Intrahepatic cholangiocarcinoma (ICC) is a cancer type with high malignancy and a current lack of biomarkers to predict recurrence. In the present study, to identify potential biomarkers, five ICC datasets from the Gene Expression Omnibus database were analyzed to construct initial datasets by using a robust rank aggregation approach. A total of 19 upregulated genes were identified in the initial datasets. The genes identified were then further analysed using data from The Cancer Genome Atlas. Only mucin 1 (MUC1) exhibited significance regarding differential expression and survival prediction. Finally, the expression levels of MUC1 were assessed using reverse transcription-quantitative PCR in 61 pairs of ICC tumor and matched non-cancerous samples. The expression of MUC1 was significantly elevated in ICC tissues compared with that in matched non-cancerous counterparts (P=0.001). Patients with high MUC1 expression levels had significantly shorter overall survival (OS, P=0.009) and recurrence-free survival (RFS, P=0.012). MUC1 was identified as an independent prognostic factor for OS [hazard ratio (HR)=2.364, 95%CI: 1.214-4.485; P=0.023] and RFS (HR=2.552 95%CI: 1.294-5.032; P=0.007) in the multivariate analysis. Using receiver operating characteristic analysis, a co-index including MUC1 had a high accuracy for predicting survival [MUC1 combined with serum levels of CEA and cancer antigen 19-9, and lymph node metastasis, area under curve (AUC)=0.746, 95%CI: 0.620-0.872] and recurrence (MUC1 combined with bile duct invasion and lymph node metastasis, AUC=0.729, 95%CI: 0.605-854). In conclusion, MUC1 is highly expressed in ICC tissue and is a potential prognostic biomarker and therapeutic target for ICC. Copyright: © Chen et al.Entities:
Keywords: Bioinformatics analysis; gene dysregulation; intrahepatic cholangiocarcinoma; mucin 1
Year: 2020 PMID: 32952640 PMCID: PMC7480127 DOI: 10.3892/etm.2020.9178
Source DB: PubMed Journal: Exp Ther Med ISSN: 1792-0981 Impact factor: 2.447
Features of the Gene Expression Omnibus datasets included in the present analysis.
| Series accession | First author (year) | Region | Array type | Tumor samples (n) | Type (number) of control samples | Upregulated genes (n) | Downregulated genes (n) | (Refs.) |
|---|---|---|---|---|---|---|---|---|
| GSE89749 | Jusakul (2017) | Asia | Illumina HumanHT-12 V4.0 expression beadchip | 118 | Non-tumor bile duct from the same patients as the tumor tissue (n=2) | 18 | 767 | ( |
| GSE76297 | Chaisaingmongkol (2017) | North America | [HTA-2_0] Affymetrix Human Transcriptome Array 2.0 [transcript (gene) version] | 91 | Matched peritumoral liver tissue (n=91) | 46 | 242 | ( |
| GSE57555 | Murakami (2015) | Asia | Agilent-039494 SurePrint G3 Human GE v2 8x60K Microarray 039381 | 16 | Matched peritumoral liver tissue (n=16) | 14 | 17 | ( |
| GSE32879 | Oishi (2012) | North America | [HuGene-1_0-st] Affymetrix Human Gene 1.0 ST Array [transcript (gene) version] | 16 | Paired non-tumor tissue from the same patients as the tumor tissue (n=7) | 159 | 534 | ( |
| GSE26566 | Andersen (2012) | Europe | Illumina human Ref-8 v2.0 expression beadchip | 104 | Matched peritumoral liver tissue (59) | 4,889 | 3,801 | ( |
Figure 1MUC1 is upregulated in ICC tissue, as validated using databases. (A) The 19 upregulated genes and the 19 most significant downregulated genes linked to cholangiocarcinoma identified using robust rank aggregation. Numbers in the colored fields indicate the log2-transformed fold change. 0.00 or white boxes indicates that the gene was absent from the respective dataset. (B) Upregulated genes linked to cholangiocarcinoma identified using the edgeR package (TCGA dataset). The enlarged gene symbols adjacent to the heatmap are the 13 intersected genes. (C) 13 genes were identified to be upregulated in the RRA integrated datasets and the TCGA data. (D and E) The expression levels of (D) the 13 genes and (E) MUC1 were significantly higher in ICC tissue, validated using TCGA data. (F) Among the 13 genes, only MUC1 predicted OS. RRA-upsig refers to the significantly upregulated genes identified using RRA; TCGA-upsig refers to significantly upregulated genes among the TCGA data. ***P<0.001; ****P<0.0001. MUC1, mucin 1; TCGA, The Cancer Genome Atlas; ICC, intrahepatic cholangiocarcinoma; OS, overall survival; RRA, robust rank aggregation; PDZK1IP1, PDZK1 Interacting Protein 1; MMP7, matrix metalloproteinase 7; CEACAM6, carcinoembryonic antigen related adhesion molecules 6; MUC1, mucin 1; CEACAM7, carcinoembryonic antigen related adhesion molecules 7; KRT19, keratin 19; LAMC2, laminin subunit gamma-2; SPP1, secreted phosphoprotein 1; RAB25, Ras Genes from Brain Protein 25; UBD, ubiquitin D; DCDC2, doublecortin domain containing 2; EPCAM, epithelial cell adhesion molecule; MMP11, matrix metalloproteinase 11.
The 19 upregulated genes and the 19 most significant downregulated genes associated with cholangiocarcinoma, identified using robust rank aggregation.
| A, Upregulated | |||
|---|---|---|---|
| Gene symbol | P-value | Adjusted P-value | logFC |
| COL1A1 | 4.14x10-11 | 2.22x10-6 | 3.147 |
| PDZK1IP1 | 1.08x10-9 | 5.82x10-5 | 2.093 |
| MMP7 | 2.16x10-9 | 1.16x10-4 | 2.864 |
| SPP1 | 2.99x10-9 | 1.61x10-4 | 3.904 |
| CEACAM6 | 5.02x10-9 | 2.70x10-4 | 2.195 |
| UBD | 1.26x10-8 | 6.79x10-4 | 2.610 |
| RAB25 | 1.50x10-8 | 8.06x10-4 | 2.208 |
| MUC1 | 2.06x10-8 | 1.11x10-3 | 2.284 |
| CEACAM7 | 2.74x10-8 | 1.47x10-3 | 2.174 |
| SULF1 | 3.78x10-8 | 2.03x10-3 | 2.511 |
| KRT7 | 4.01x10-8 | 2.16x10-3 | 2.040 |
| KRT19 | 4.26x10-8 | 2.29x10-3 | 2.819 |
| MMP11 | 4.89x10-8 | 2.63x10-3 | 2.526 |
| FAP | 5.34x10-8 | 2.87x10-3 | 2.099 |
| DCDC2 | 5.95x10-8 | 3.20x10-3 | 3.545 |
| EPCAM | 6.92x10-8 | 3.72x10-3 | 2.401 |
| COL1A2 | 1.36x10-7 | 7.34x10-3 | 2.544 |
| VCAN | 1.56x10-7 | 8.37x10-3 | 2.353 |
| LAMC2 | 1.83x10-7 | 9.86x10-3 | 2.228 |
| B, Downregulated | |||
| Gene symbol | P-value | Adjusted P-value | logFC |
| TAT | 2.57x10-12 | 1.38x10-7 | -3.201 |
| MT1M | 1.18x10-11 | 6.32x10-7 | -3.387 |
| MT1G | 2.20x10-11 | 1.18x10-6 | -3.279 |
| FBP1 | 7.71x10-11 | 4.15x10-6 | -2.892 |
| BHMT | 8.03x10-11 | 4.32x10-6 | -3.980 |
| METTL7A | 8.36x10-11 | 4.49x10-6 | -2.438 |
| ADH1C | 8.58x10-11 | 4.62x10-6 | -3.404 |
| MT1H | 8.81x10-11 | 4.74x10-6 | -2.657 |
| ALDOB | 1.03x10-10 | 5.54x10-6 | -3.811 |
| MTTP | 1.42x10-10 | 7.62x10-6 | -3.601 |
| MT1X | 2.89x10-10 | 1.56x10-5 | -2.816 |
| APOC3 | 3.31x10-10 | 1.78x10-5 | -3.859 |
| SEPP1 | 4.31x10-10 | 2.32x10-5 | -2.490 |
| MT1E | 6.40x10-10 | 3.44x10-5 | -2.646 |
| OTC | 7.06x10-10 | 3.79x10-5 | -3.225 |
| ADH4 | 7.06x10-10 | 3.80x10-5 | -4.379 |
| APOF | 7.06x10-10 | 3.80x10-5 | -3.828 |
| FOLH1 | 7.43x10-10 | 3.99x10-5 | -2.194 |
| GSTA2 | 7.95x10-10 | 4.28x10-5 | -3.152 |
P-values were adjusted using Bonferroni correction. LogFC, log2-transformed fold change. COL1A1, collagen type I alpha 1 chain; PDZK1IP1, PDZK1 Interacting Protein 1; MMP7, matrix metalloproteinase 7; SPP1, secreted phosphoprotein 1; CEACAM6, carcinoembryonic antigen related adhesion molecules 6; UBD, ubiquitin D; RAB25, Ras Genes from Brain Protein 25; MUC1, mucin 1; CEACAM7, carcinoembryonic antigen related adhesion molecules 7; SULF1, Sulfatase 1; KRT7, keratin 7; KRT19, keratin 19; MMP11, matrix metalloproteinase 11; FAP, Fibroblast Activation Protein Alpha; DCDC2, doublecortin domain containing 2; EPCAM, epithelial cell adhesion molecule; COL1A2, collagen type I alpha 2 chain; VCAN, versican; LAMC2, laminin subunit gamma-2. TAT, tyrosine aminotransferase; MT1M, metallothionein 1M; MT1G, metallothionein 1G; FBP1, fructose-bisphosphatase 1; BHMT, betaine-homocysteine S-methyltransferase; METTL7A, methyltransferase like 7A; ADH1C, alcohol dehydrogenase 1C (Class I), gamma polypeptide; MT1H, metallothionein 1H; ALDOB, aldolase, fructose-bisphosphate B; MTTP, microsomal triglyceride transfer protein; MT1X, metallothionein 1X; APOC3, apolipoprotein C3; SEPP1, selenop; MT1E, metallothionein 1E; OTC, ornithine carbamoyltransferase; ADH4, alcohol dehydrogenase 4 (Class II), Pi Polypeptide; APOF, apolipoprotein F; FOLH1, folate hydrolase 1; GSTA2, glutathione S-transferase alpha 2.
Clinical characteristics of enrolled patients.
| Parameter | All patients (n=61) | MUC1low (n=30) | MUC1high (n=31) | P-value[ |
|---|---|---|---|---|
| Age (years) | 59 (39-77) | 57 (39-72) | 59 (40-77) | 0.86 |
| Gender | 0.21 | |||
| Male | 40 (65.57) | 22 (73.33) | 18 (58.06) | |
| Female | 21 (34.43) | 8 (26.67) | 13 (41.94) | |
| Number of tumors | 0.85 | |||
| Single | 42 (68.85) | 24 (80.00) | 18 (58.06) | |
| Multiple | 19 (31.15) | 6 (20.00) | 13 (41.94) | |
| Tumor size (cm) | 6 (2-14) | 6 (2-11.5) | 6 (2.5-14) | 0.79 |
| Vascular invasion | 0.28 | |||
| Yes | 16 (26.23) | 6 (20.00) | 10 (32.26) | |
| No | 45 (73.77) | 24 (80.00) | 21 (67.74) | |
| Bile duct invasion | 0.08 | |||
| Yes | 6 (9.84) | 5 (16.67) | 1 (3.23) | |
| No | 55 (90.16) | 25 (83.33) | 30 (96.77) | |
| Nerve invasion | 0.53 | |||
| Yes | 3 (4.92) | 2 (6.67) | 1 (3.23) | |
| No | 58 (95.08) | 28 (93.33) | 30 (96.77) | |
| T stage | 0.16 | |||
| T1a | 14 (22.95) | 9 (30.00) | 5 (16.13) | |
| T1b | 17 (27.87) | 8 (26.67) | 9 (29.03) | |
| T2 | 26 (42.62) | 13 (43.33) | 13 (41.94) | |
| T3 | 0 (0.00) | 0 (0.00) | 0 (0.00) | |
| T4 | 4 (6.56) | 0 (0.00) | 4 (12.90) | |
| LN metastasis | 0.53 | |||
| Yes | 3 (4.92) | 2 (6.67) | 1 (3.23) | |
| No | 58 (95.08) | 28 (93.33) | 30 (96.77) | |
| Surrounding tissue invasion | 0.04 | |||
| Yes | 4 (6.56) | 0 (0.00) | 4 (12.90) | |
| No | 57 (93.44) | 30 (100.00) | 27 (87.10) | |
| Differentiation | 0.11 | |||
| Well | 50 (81.97) | 27 (90.00) | 23 (74.19) | |
| Poor | 11 (18.03) | 3 (10.00) | 8 (25.81) | |
| Type of resection | 0.29 | |||
| Less than hemihepatectomy | 46 (75.41) | 21 (70.00) | 25 (80.65) | |
| Hemihepatectomy | 13 (21.31) | 7 (23.33) | 6 (19.35) | |
| Extended hemihepatectomy | 2 (3.28) | 2 (6.67) | 0 (0.00) | |
| Post-operative treatment | 0.311 | |||
| Yes | 9 (14.75) | 3 (10.00) | 6 (19.35) | |
| No | 42 (68.85) | 27 (90.00) | 25 (80.65) |
Values are expressed as the median (range) or n (%). Vascular invasion, bile duct invasion and nerve invasion were identified pathologically. Post-operative treatment includes transarterial chemoembolization or radiofrequency ablation. LN, lymph node; MUC1, mucin 1.
aP-values represent comparison between MUC1low group and MUC1high group.
Figure 2Clinical validation of MUC1 expression and its prognostic significance. (A) The levels of MUC1 were significantly greater in the ICC tumor tissues compared with the matched paratumor tissues. (B) MUC1 was also significantly higher in tumors with surrounding tissue invasion, compared with the tumors that did not invade. All data were presented as the mean + standard deviation. (C and D) MUC1 was a prognostic predictor for (C) OS and (D) RFS. MUC1low refers to the group with low expression of MUC1 in ICC tissue and MUC1high to the group with high expression of MUC1 in ICC tissue. MUC1, mucin 1; ICC, intrahepatic cholangiocarcinoma; OS, overall survival; RFS, recurrence-free survival.
Univariate and multivariate analysis of prognostic factors for overall survival.
| Univariate | Multivariate | |||
|---|---|---|---|---|
| Prognostic factor | HR (95%CI) | P-value | HR (95%CI) | P-value |
| MUC1 (≥0.056 vs. <0.056, ratio to GAPDH) | 2.233 (1.214-4.485) | 0.011 | 2.364 (1.127-4.960) | 0.023 |
| CEA (≥20 ng/ml vs. <20 ng/ml) | 2.197 (1.106-4.365) | 0.025 | 2.315 (1.081-4.958) | 0.031 |
| CA-19-9 (≥37 ng/ml vs. <37 ng/ml) | 2.675 (1.368-5.231) | 0.004 | 2.303 (1.109-4.780) | 0.025 |
| Differentiation (poor vs. well) | 2.284 (1.102-4.736) | 0.026 | 1.102 (0.485-2.502) | 0.816 |
| Tumor diameter (>5 cm vs. ≤5 cm) | 2.236 (1.110-4.503) | 0.024 | 1.461 (1.127-4.960) | 0.320 |
| LN metastasis (yes vs. no) | 4.440 (1.916-10.288) | 0.001 | 2.846 (1.259-6.436) | 0.012 |
| T stage (T2-4 vs. T1) | 1.435 (0.995-2.070) | 0.053 | ||
| Surrounding tissue invasion (yes vs. no) | 2.612 (0.797-8.557) | 0.113 | ||
| Tumor number (solitary vs. multiple) | 1.603 (0.838-3.064) | 0.154 | ||
| Nerve invasion (yes vs. no) | 1.117 (0.268-4.645) | 0.879 | ||
| Vascular invasion (yes vs. no) | 0.798 (0.378-1.681) | 0.552 | ||
| Bile duct invasion (presence vs. absence) | 1.995 (0.762-5.018) | 0.163 | ||
LN, lymph node; HR, hazard ratio; CA, cancer antigen.
Univariate and multivariate analysis of prognostic factors for recurrence-free survival.
| Univariate | Multivariate | |||
|---|---|---|---|---|
| Prognostic factor | HR (95%CI) | P-value | HR (95%CI) | P-value |
| MUC1 (≥0.056 vs. <0.056, ratio to GAPDH) | 2.187 (1.171-4.083) | 0.014 | 2.552 (1.294-5.032) | 0.007 |
| Bile duct invasion (yes vs. No) | 2.445 (1.022-5.851) | 0.045 | 3.979 (1.449-10.928) | 0.007 |
| Tumor diameter (>5 cm vs. ≤5 cm) | 1.977 (1.026-3.810) | 0.042 | 1.130 (0.509-2.510) | 0.764 |
| T stage (T2-4 vs. T1) | 1.468 (1.039-2.073) | 0.029 | 1.403 (0.931-2.115) | 0.106 |
| LN metastasis (yes vs. no) | 2.449 (1.256-4.776) | 0.009 | 2.053 (1.042-4.046) | 0.038 |
| Surrounding tissue invasion (yes vs. no) | 2.045 (0.628-6.656) | 0.235 | ||
| CEA (≥20 ng/ml vs. <20 ng/ml) | 1.108 (10.567-2.167) | 0.764 | ||
| CA-19-9 (≥37 ng/ml vs. <37 ng/ml) | 0.828 (0.452-1.516) | 0.540 | ||
| Differentiation (poor vs. well) | 1.930 (0.940-3.960 | 0.073 | ||
| Tumor number (solitary vs. multiple) | 1.504 (0.798-2.836) | 0.207 | ||
| Nerve invasion (yes vs. no) | 2.217 (0.679-7.239) | 0.187 | ||
| Vascular invasion (yes vs. no) | 0.971 (0.488-1.933) | 0.933 | ||
LN, lymph node; HR, hazard ratio; CA, cancer antigen.
Figure 3ROC curves for OS and RFS. (A) ROC curve for OS. (B) ROC curve for RFS. LNM, lymph node metastasis; BI, bile duct invasion; CA19-9, cancer antigen 19-9; ROC, receiver operating characteristic; MUC1, mucin 1; OS, overall survival; RFS, recurrence-free survival.
Receiver operating characteristic analysis for OS and RFS.
| A, OS | |
|---|---|
| Parameter | AUC (95%CI) |
| MUC1 | 0.623 (0.476-0.769) |
| CEA | 0.509 (0.357-0.660) |
| CA19-9 | 0.734 (0.600-0.867) |
| LNM | 0.538 (0.388-0.689) |
| Combined Co-index 1 | 0.746 (0.620-0.872) |
| B, RFS | |
| Parameter | AUC (95%CI) |
| MUC1 | 0.622 (0.480-0.763) |
| BI | 0.571 (0.423-0.720) |
| LNM | 0.536 (0.382-0.689) |
| Combined Co-index 2 | 0.729 (0.605-0.854) |
The status of OS and RFS used to calculate the AUC after the very last follow-up appointment (2016.06.30). LNM refers to the presence of lymph node metastasis; BI refers to the presence of bile duct invasion. Combined co-index 1, MUC1 + CEA+ CA19-9 + LNM; Combined Co-index 2, MUC1 + BI + LNM. MUC1, mucin 1; OS, overall survival; RFS, recurrence-free survival; CA, cancer antigen.