| Literature DB >> 33178362 |
Xiangchou Yang1, Liping Chen2, Yuting Mao3, Zijing Hu4, Muqing He1.
Abstract
The role of an extracellular matrix- (ECM-) receptor interaction signature has not been fully clarified in gastric cancer. This study performed comprehensive analyses on the differentially expressed ECM-related genes, clinicopathologic features, and prognostic application in gastric cancer. The differentially expressed genes between tumorous and matched normal tissues in The Cancer Genome Atlas (TCGA) and validation cohorts were identified by a paired t-test. Consensus clusters were built to find the correlation between clinicopathologic features and subclusters. Then, the least absolute shrinkage and selection operator (lasso) method was used to construct a risk score model. Correlation analyses were made to reveal the relation between risk score-stratified subgroups and clinicopathologic features or significant signatures. In TCGA (26 pairs) and validation cohort (134 pairs), 25 ECM-related genes were significantly highly expressed and 11 genes were downexpressed in gastric cancer. ECM-based subclusters were slightly related to clinicopathologic features. We constructed a risk score model = 0.081∗log2 (CD36) + 0.043∗log2 (COL5A2) + 0.001∗log2 (ITGB5) + 0.039∗log2 (SDC2) + 0.135∗log2 (SV2B) + 0.012∗log2 (THBS1) + 0.068∗log2 (VTN) + 0.023∗log2 (VWF). The risk score model could well predict the outcome of patients with gastric cancer in both training (n = 351, HR: 1.807, 95% CI: 1.292-2.528, P = 0.00046) and validation (n = 300, HR: 1.866, 95% CI: 1.347-2.584, P = 0.00014) cohorts. Besides, risk score-based subgroups were associated with angiogenesis, cell adhesion molecules, complement and coagulation cascades, TGF-beta signaling, and mismatch repair-relevant signatures (P < 0.0001). By univariate (1.845, 95% CI: 1.382-2.462, P < 0.001) and multivariate (1.756, 95% CI: 1.284-2.402, P < 0.001) analyses, we regarded the risk score as an independent risk factor in gastric cancer. Our findings revealed that ECM compositions became accomplices in the tumorigenesis, progression, and poor survival of gastric cancer.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33178362 PMCID: PMC7647771 DOI: 10.1155/2020/8816070
Source DB: PubMed Journal: Dis Markers ISSN: 0278-0240 Impact factor: 3.434
Genes of researched signatures.
| Gene signature | Gene | Source |
|---|---|---|
| ECM-receptor interaction | GP1BA, COL6A2, COL6A3, GP1BB, COL5A2, COL6A1, LAMA1, VWF, HSPG2, TNN, FN1, ITGA9, GP9, COMP, IBSP, CD36, CHAD, GP5, VTN, THBS4, ITGA4, ITGA3, ITGA2B, ITGA7, ITGA5, COL5A1, COL4A6, ITGA11, SV2C, COL2A1, COL3A1, COL4A1, AGRN, COL4A2, COL4A4, ITGB3, ITGB4, RELN, ITGB5, ITGB6, ITGB7, LAMC2, ITGAV, ITGB1, LAMB2, SPP1, LAMB3, LAMC1, COL1A1, LAMA4, LAMA5, LAMB1, COL1A2, ITGA10, GP6, ITGA8, LAMB4, TNR, CD47, SV2A, CD44, DAG1, TNXB, LAMA3, LAMA2, SDC3, ITGB8, ITGA6, ITGA2, ITGA1, SV2B, TNC, COL11A1, LAMC3, COL11A2, HMMR, SDC2, SDC4, COL5A3, THBS3, COL6A6, THBS2, SDC1, THBS1 | KEGG hsa04512 |
| Cell adhesion molecules (CAMs) | CDH5, JAM3, CDH3, NLGN3, CDH4, CD80, NLGN1, CD86, CD28, CD274, PDCD1LG2, ITGA9, ITGAL, NRCAM, ITGAM, CD34, CD276, ICOSLG, CADM3, ITGA4, ICOS, SIGLEC1, CADM1, HLA-G, CLDN20, PECAM1, CD22, ITGB7, SELL, VCAM1, ITGAV, SELP, SPN, ITGB1, SELPLG, ITGB2, CDH2, JAM2, CTLA4, HLA-DRB4, CDH15, CLDN18, CD4, HLA-DRB5, CNTN1, NLGN2, HLA-DRB3, NRXN3, ALCAM, SELE, CD8A, CD8B, CD6, NLGN4X, CLDN17, L1CAM, ITGB8, MAG, VCAN, HLA-F, NFASC, HLA-E, NRXN1, HLA-DPA1, HLA-DPB1, CLDN9, GLG1, HLA-DQA1, HLA-DQA2, HLA-DQB1, NRXN2, CD2, CLDN16, CLDN23, MADCAM1, SDC2, SDC4, CLDN14, CD40, SDC1, OCLN, PVR, HLA-DRB1, NECTIN2, CDH1, HLA-DRA, NECTIN1, HLA-DOA, HLA-DOB, CLDN10, CNTNAP1, ICAM2, ICAM3, CLDN8, CLDN2, CLDN6, CLDN5, CLDN1, ICAM1, NEO1, HLA-C, HLA-B, ESAM, CD40LG, PTPRM, HLA-DMB, HLA-DMA, HLA-A, F11R, PDCD1, CLDN19, PTPRF, CLDN15, CD226, CD99, CLDN22, CNTN2, ITGA6, CNTNAP2, MPZ, MPZL1, PTPRC, NECTIN3, ITGA8, NCAM2, NCAM1, CD58, NEGR1, CLDN11, SDC3, CLDN3, CLDN7, CLDN4 | KEGG hsa04514 |
| Complement and coagulation cascades | F2, F2R, VWF, KNG1, FGB, PLAT, SERPIND1, MBL2, F3, F5, SERPINA1, PLAUR, F7, PLAU, F10, F9, C1S, TFPI, F8, KLKB1, CR2, MASP1, C9, A2M, CR1, F12, F13A1, F11, CFI, SERPING1, MASP2, THBD, C1QC, C8A, F13B, C7, C8G, C8B, CD59, SERPINA5, FGG, CD55, C6, C5AR1, C5, BDKRB1, CFD, C1QA, C1R, C4BPB, C4BPA, C4B, C4A, BDKRB2, CFH, CFB, CPB2, CD46, PROS1, SERPINF2, PROC, C3, C1QB, C3AR1, FGA, SERPINE1, PLG, C2, SERPINC1 | KEGG hsa04610 |
| TGF-beta signaling pathway | TFDP1, NOG, TNF, GDF7, INHBB, INHBC, COMP, INHBA, THBS4, RHOA, CREBBP, ROCK1, INHBE, THBS2, DCN, ID1, ID2, RPS6KB1, RPS6KB2, CUL1, ID4, SMAD3, MAPK3, RBL2, SMAD4, RBL1, NODAL, THBS1, THBS3, SP1, SMAD1, MYC, SMAD2, MAPK1, SMURF2, SMURF1, EP300, BMP8A, GDF5, SKP1, CHRD, ZFYVE16, BMP6, BMP5, E2F4, TGFB2, TGFB1, IFNG, CDKN2B, PPP2CB, PPP2CA, PPP2R1A, ID3, SMAD5, RBX1, FST, PITX2, ZFYVE9, BMP7, PPP2R1B, TGFBR2, AMHR2, LTBP1, LEFTY1, AMH, TGFBR1, SMAD9, LEFTY2, SMAD7, ROCK2, BMP8B, ACVR1C, TGFB3, SMAD6, BMPR2, GDF6, BMPR1A, BMPR1B, ACVRL1, ACVR2B, ACVR2A, ACVR1, BMP4, E2F5, BMP2 | KEGG hsa04530 |
| Base excision repair | NEIL2, MPG, SMUG1, XRCC1, POLE4, HMGB1, POLE3, POLD4, MBD4, OGG1, UNG, POLD3, MUTYH, PARP1, LIG1, PCNA, NEIL1, POLE2, PARP4, PARP3, PARP2, POLB, APEX1, POLL, POLD1, POLD2, POLE, NEIL3, FEN1, TDG, APEX2, LIG3, HMGB1P1, NTHL1, HMGB1P40 | KEGG hsa03410 |
| DNA replication | DNA2, POLE4, POLE3, PRIM1, PRIM2, POLD4, RFC4, RFC5, RPA1, POLA1, RPA3, POLD3, LIG1, SSBP1, FEN1, RNASEH2B, RPA2, PCNA, RPA4, RNASEH1, RNASEH2C, MCM4, POLE2, MCM3, MCM6, MCM5, POLA2, FEN1, MCM2, MCM7, POLD1, POLD2, RNASEH2A, POLE, RFC1, RFC3, RFC2 | KEGG hsa03030 |
| Nucleotide excision repair | MNAT1, POLE4, ERCC4, POLE3, ERCC3, ERCC6, ERCC5, GTF2H5, POLD4, ERCC2, RFC4, CETN2, RFC5, GTF2H3, RPA1, RAD23B, RBX1, DDB2, RPA3, POLD3, RPA2, RAD23A, PCNA, RPA4, DDB1, POLE2, ERCC1, POLD1, GTF2H4, POLD2, POLE, RFC1, RFC3, RFC2, XPC, XPA, GTF2H2, GTF2H1, CDK7, LIG1, CUL4A, CUL4B, ERCC8,CCNH | KEGG hsa03420 |
| Mismatch repair | MLH3, POLD1, MLH1, POLD2, RFC1, MSH2, RFC3, RFC2, MSH3, POLD4, PMS2, RFC4, LIG1, SSBP1, RPA4, EXO1, RFC5, RPA1, MSH6, RPA3, POLD3, RPA2, PCNA | KEGG hsa03430 |
Figure 1Differential expression of ECM-receptor interaction-related genes between tumor and matched normal tissues in TCGA and validation cohorts of gastric cancer. (a) The heat map showed differential expression of all ECM-receptor interaction-related genes of 26 pairs of tumorous and matched normal tissues of gastric cancer in TCGA. (b) The heat map validated 45 ECM-receptor interaction-related genes that were identified in the validation cohort (GSE29272) with 134 pairs of tumorous and matched normal tissues. TCGA: The Cancer Genome Atlas; N: adjacent tissue to cancer; T: tumorous tissue. ∗P < 0.05, ∗∗P < 0.01, ∗∗∗P < 0.001, and ∗∗∗∗P < 0.0001.
Figure 2Identification of subclusters stratified by ECM-receptor interaction-related genes and correlation between subclusters and clinicopathologic features. (a) Identification of the consensus matrix of TCGA cohort for k = 2. (b) Principal component analysis of subclusters. (c) Survival curve of subclusters stratified by ECM-receptor interaction-related genes. (d, e) Correlation analyses between tumor characteristics or mutations and the subclusters. ∗P < 0.05, ∗∗P < 0.01.
Univariate analysis of the hazard ratio with 95% confidence interval of each gene.
| Gene | Hazard ratio | HR.95%L | HR.95%H |
| Gene | Hazard ratio | HR.95%L | HR.95%H |
|
|---|---|---|---|---|---|---|---|---|---|
| AGRN | 0.980 | 0.814 | 1.181 | 0.8352 | ITGA8 | 1.101 | 0.939 | 1.292 | 0.2352 |
| CD36 | 1.344 | 1.133 | 1.595 | 0.0007 | ITGA9 | 1.158 | 0.985 | 1.362 | 0.0753 |
| CD44 | 1.134 | 0.982 | 1.310 | 0.0864 | ITGAV | 1.405 | 1.112 | 1.777 | 0.0044 |
| CD47 | 0.824 | 0.626 | 1.086 | 0.1689 | ITGB1 | 1.192 | 0.941 | 1.510 | 0.1450 |
| CHAD | 0.997 | 0.785 | 1.267 | 0.9793 | ITGB3 | 1.286 | 0.991 | 1.669 | 0.0587 |
| COL11A1 | 1.115 | 0.987 | 1.260 | 0.0795 | ITGB4 | 0.870 | 0.762 | 0.995 | 0.0418 |
| COL11A2 | 0.843 | 0.679 | 1.046 | 0.1212 | ITGB5 | 1.329 | 1.030 | 1.715 | 0.0287 |
| COL1A1 | 1.138 | 1.017 | 1.273 | 0.0243 | ITGB6 | 1.102 | 0.971 | 1.251 | 0.1315 |
| COL1A2 | 1.168 | 1.033 | 1.322 | 0.0135 | ITGB7 | 1.040 | 0.810 | 1.334 | 0.7596 |
| COL2A1 | 1.089 | 0.960 | 1.234 | 0.1850 | ITGB8 | 0.995 | 0.813 | 1.219 | 0.9635 |
| COL3A1 | 1.173 | 1.041 | 1.321 | 0.0087 | LAMA1 | 1.044 | 0.830 | 1.312 | 0.7142 |
| COL4A1 | 1.251 | 1.042 | 1.502 | 0.0164 | LAMA2 | 1.288 | 1.085 | 1.529 | 0.0038 |
| COL4A2 | 1.196 | 1.006 | 1.421 | 0.0427 | LAMA3 | 1.033 | 0.902 | 1.183 | 0.6366 |
| COL4A4 | 1.118 | 0.911 | 1.373 | 0.2860 | LAMA4 | 1.298 | 1.080 | 1.560 | 0.0054 |
| COL4A6 | 1.134 | 0.916 | 1.404 | 0.2471 | LAMA5 | 0.937 | 0.790 | 1.112 | 0.4558 |
| COL5A1 | 1.163 | 1.013 | 1.335 | 0.0319 | LAMB1 | 1.244 | 1.014 | 1.526 | 0.0363 |
| COL5A2 | 1.233 | 1.060 | 1.433 | 0.0065 | LAMB2 | 0.999 | 0.814 | 1.227 | 0.9948 |
| COL5A3 | 1.104 | 0.909 | 1.340 | 0.3173 | LAMB3 | 0.987 | 0.859 | 1.133 | 0.8490 |
| COL6A1 | 1.135 | 0.972 | 1.326 | 0.1091 | LAMB4 | 2.320 | 0.762 | 7.066 | 0.1386 |
| COL6A2 | 1.170 | 1.014 | 1.349 | 0.0312 | LAMC1 | 1.277 | 1.064 | 1.532 | 0.0086 |
| COL6A3 | 1.162 | 1.013 | 1.333 | 0.0319 | LAMC2 | 1.056 | 0.936 | 1.191 | 0.3788 |
| COL6A6 | 1.361 | 0.661 | 2.802 | 0.4030 | LAMC3 | 0.954 | 0.741 | 1.226 | 0.7113 |
| COMP | 1.040 | 0.949 | 1.139 | 0.3989 | RELN | 1.183 | 0.976 | 1.435 | 0.0874 |
| DAG1 | 0.985 | 0.787 | 1.234 | 0.8958 | SDC1 | 0.951 | 0.823 | 1.099 | 0.4968 |
| FN1 | 1.144 | 1.028 | 1.273 | 0.0139 | SDC2 | 1.381 | 1.144 | 1.667 | 0.0008 |
| GP1BA | 1.026 | 0.772 | 1.364 | 0.8576 | SDC3 | 0.904 | 0.740 | 1.103 | 0.3203 |
| GP5 | 1.652 | 0.676 | 4.035 | 0.2708 | SDC4 | 1.068 | 0.903 | 1.265 | 0.4417 |
| GP6 | 1.187 | 0.603 | 2.338 | 0.6198 | SPP1 | 1.047 | 0.962 | 1.139 | 0.2848 |
| GP9 | 1.862 | 0.666 | 5.210 | 0.2362 | SV2A | 1.205 | 0.967 | 1.502 | 0.0962 |
| HMMR | 0.917 | 0.756 | 1.113 | 0.3811 | SV2B | 2.033 | 1.269 | 3.257 | 0.0032 |
| HSPG2 | 1.087 | 0.929 | 1.272 | 0.2990 | SV2C | 1.144 | 0.433 | 3.027 | 0.7859 |
| IBSP | 1.182 | 0.957 | 1.461 | 0.1212 | THBS1 | 1.210 | 1.069 | 1.369 | 0.0025 |
| ITGA1 | 1.193 | 1.001 | 1.421 | 0.0484 | THBS2 | 1.138 | 1.023 | 1.265 | 0.0171 |
| ITGA10 | 1.439 | 0.958 | 2.161 | 0.0792 | THBS3 | 1.220 | 0.936 | 1.590 | 0.1417 |
| ITGA11 | 1.163 | 0.994 | 1.361 | 0.0602 | THBS4 | 1.058 | 0.979 | 1.144 | 0.1515 |
| ITGA2 | 0.989 | 0.837 | 1.168 | 0.8945 | TNC | 1.082 | 0.978 | 1.197 | 0.1245 |
| ITGA2B | 1.801 | 0.666 | 4.876 | 0.2466 | TNN | 1.252 | 1.020 | 1.536 | 0.0314 |
| ITGA3 | 1.069 | 0.900 | 1.270 | 0.4475 | TNR | 1.436 | 0.616 | 3.349 | 0.4024 |
| ITGA4 | 1.175 | 0.953 | 1.449 | 0.1318 | TNXB | 1.102 | 0.964 | 1.261 | 0.1549 |
| ITGA5 | 1.129 | 0.977 | 1.306 | 0.1006 | VTN | 1.134 | 1.041 | 1.235 | 0.0040 |
| ITGA6 | 0.876 | 0.729 | 1.052 | 0.1561 | VWF | 1.285 | 1.085 | 1.521 | 0.0036 |
| ITGA7 | 1.041 | 0.901 | 1.202 | 0.5864 |
Figure 3Construction of the lasso regression model with ECM-receptor interaction-related genes and enrichment analyses. (a) The relation between partial likelihood deviances and number of genes involved in the risk model. (b) The solution paths of the risk model. (c) The coefficients of each gene involved in the risk model. (d–h) Top five enriched gene sets between high- and low-risk groups identified by the risk model. (i) The GO analysis between high- and low-risk groups identified by the risk model. (j) Top genes refer to the top BP. GO: BP: biological process; MF: molecular function; CC: cellular component; logFC: log2(fold change).
Figure 4Survival analyses and distribution of the risk model in the training and validation cohorts. (a) Survival curve of the high- and low-risk groups identified by the risk model in TCGA cohort. (b) The distribution of survival month, risk score, and gene expression in TCGA cohort. (c) Survival curve of the high- and low-risk groups identified by the risk model in the validation cohort (GSE62254). (d) The distribution of survival month, risk score, and gene expressions in the validation cohort (GSE62254).
Figure 5Relevant signatures and clinicopathologic features of the risk groups. (a) The violin plot showed high- and low-risk groups identified by different signatures. Within each group, the middle line represents the mean value of signature genes, and the bottom and top lines represent the 25th and 75th percentiles, respectively. (b) Correlation analyses between tumor characteristics and the risk groups. ∗∗P < 0.01.
Figure 6The forest plot of clinicopathologic features and the risk score. (a) Univariate analysis of clinicopathologic features and the risk score in TCGA cohort. (b) Multivariate analysis of clinicopathologic features and the risk score in TCGA cohort. Hazard ratios are shown with 95% confidence interval.