Literature DB >> 35117889

COL4A family: potential prognostic biomarkers and therapeutic targets for gastric cancer.

Xi Zeng1,2, Hao-Ying Wang1,2, Yu-Ping Wang1,2, Su-Yang Bai1,2, Ke Pu1,2, Ya Zheng1,2, Qing-Hong Guo1,2, Quan-Lin Guan3, Rui Ji1,2, Yong-Ning Zhou1,2.   

Abstract

BACKGROUND: The type IV collagen alpha chain (COL4A) family is a major component of the basement membrane (BM) that has recently been found to be involved in tumor angiogenesis and progression. However, the expression levels and the exact roles of distinct COL4A family members in gastric cancer (GC) have not been completely understood.
METHODS: Here, the expression levels of COL4As in GC and normal gastric tissues were calculated by using TCGA datasets and the predicted prognostic values by the GEPIA tool. Furthermore, the cBioPortal and Metascape tools were integrated to analyze the genetic alterations, correlations and potential functions of COL4As, and their frequently altered neighboring genes in GC.
RESULTS: Notably, the expression levels of COL4A1/2/4 in GC were higher to those in normal gastric tissues, while the expression levels of COL4A3/5/6 were lower in GC than normal. Survival analysis revealed that lower expression levels of COL4A1/5 led to higher overall survival (OS) rate. Multivariate analysis using the Cox proportional-hazards model indicated that age, gender, pathological grade, metastasis and COL4A5 expression, are independent prognostic factors for OS. However, TNM stage, lymph node metastasis, Lauren's classification, COL4A1-4 and COL4A6 were associated with poor OS but not independent prognostic factors. Function-enriched analysis of COL4As and their frequently altered neighboring genes was involved in tumor proliferation and metastasis in GC.
CONCLUSIONS: These results implied that COL4A1/2 were potential therapeutic targets for GC. COL4A3/4/6 might have an impact on gastric carcinogenesis and subsequent progression, whereas COL4A5 was an independent prognostic marker for GC. 2020 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  COL4As; Kaplan-Meier plot; bioinformatics analysis; gastric cancer (GC); prognostic value

Year:  2020        PMID: 35117889      PMCID: PMC8799138          DOI: 10.21037/tcr-20-517

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


Introduction

Gastric cancer (GC) is the fourth most common malignancy and remains the second leading cause of cancer-related deaths (1). GC is a multifactorial disease, including environmental and genetic factors (2,3). Despite considerable advancements in prevention, diagnosis and treatment, the disease is still a great threat to human health with a five-year overall survival (OS) rate of less than 30% (4,5). Therefore, potential targets for treatments and new biomarkers for the prognosis of GC should be identified. Conventional biomarkers (CEA, CA19-9, AFP and CA125) have been applied in diagnosis and prediction of prognosis for GC in clinical practice. Then the first molecular biomarker, HER2, is also available to improve recurrence and the efficacy of treatments. The fibroblast growth factor receptor 2 (FGFR2), vascular endothelial growth factor, E-cadherin, and TP53, etc. are recognized as metastasis related genes and could be biomarkers for recurrence forecast and metastasis assessment in GC patients (6). Besides, Immune checkpoint receptors ligands (PD-L1/2) and microsatellite-high (MSI-High) may serve as prognostic biomarkers for treatment response for GC (7). Recently, with the progression of liquid biopsy, more and more researches focus on using body fluids to detect GC biomarkers. Circulating tumor cells (CTCs), circulating cell-free DNA (cfDNA) such as EBV DNA, microRNAs such as miR-21 and miR-23a, long noncoding RNAs such as ncRuPAR, GACAT1, and GACAT2, and exosomes may provide prognostic and predictive markers for GC (8,9). With the development of knowledge of novel approach, such as TCGA Research Network, high specific and sensitive markers will continue to be tested. The basement membrane (BM) acts as a physical barrier for prohibiting invasion and metastasis of tumors. The type IV collagen alpha chain (COL4A) family is a major component of BM that may be involved in tumor angiogenesis and progression. COL4A family constitutes of six genetically different alpha chains, α1(IV) to α6(IV), also known as COL4A1 to COL4A6 (10). COL4A1 and COL4A2 are ubiquitous, whereas COL4A3 to COL4A6 are tissue-specific. Six COL4A proteins have been found in mammalian cells, numbered according to the abundance and tissue distribution (11). COL4A1 and COL4A2 are major types and COL4A3 to COL4A6 are minor types. Mutations of COL4As genes have been confirmed to result in defective BM synthesis diseases like Goodpasture Syndrome, Alport Syndrome, and thin BM nephropathy (12-14). However, abnormal expressions of COL4As proteins have been reported involved in not only proliferation and malignant transformation but also migration and invasion of cancers by several studies (15-20). It has been observed that up-regulated COL4A1 was closely associated with tumor growth and metastasis in papillary thyroid carcinoma (16) and promoted the proliferation of the invasive ductal carcinoma cells in breast (15). Inhibition of miR-29c could upregulation the expression of COL4A1 and increase proliferation of endometrial cancer cells (21). The suppression of COL4A2 could also significantly inhibit the migration and proliferation of triple-negative breast cancer cells (19). Compared with patients with extrahepatic bile duct carcinoma of positive COL4A2 and COL4A6, loss of COL4A2 and COL4A6 had significantly poorer prognosis (22). Nie et al. (20) reported that aberrant expression of COL4A3 might play a role in the malignant transformation of gastric epithelial cells, which is a key step in the progression of gastric carcinogenesis. COL4A4 has been observed to be downregulated in esophageal cancer (17). COL4A5 may promote lung cancer progression through discoidin domain receptor-1 (23). Ikeda et al. (18) showed that COL4A5 and COL4A6 were under-expressed in colorectal cancer as compared to normal colorectal tissues and that might remodel the epithelial BM during cancer cell invasion. Baba et al. (24) demonstrated that the expressions of COL4A5 and COL4A6 were closely related to the grade of histological atypia and tumor cell growth activity in gastric intramucosal carcinoma. Thus, it can be inferred that COL4As family are closely related to the progression of many kinds of cancers, including gastrointestinal cancers. COL4A2 and COL4A6 may be prognostic biomarkers in extrahepatic bile duct carcinoma whereas COL4A5 and COL4A6 may be prognostic biomarkers in gastric intramucosal carcinoma. Although COL4As family are considered to be GC-related factors (20,25), the underlying mechanisms by which the COL4A factors are activated or suppressed, and their separate function in GC have not been elucidated so far. In this study, the relationship between COL4A factors and GC was further explored. With the development of microarray technology, RNA and DNA research has been revolutionized as an essential method of biological and biomedical studies (26). The expression levels and alterations of different COL4A factors in GC patients were analyzed to identify their expression patterns, the potential functions and distinct prognostic values in GC based on the thousands of gene expression or copy number variation analysis published online.

Methods

ONCOMINE analysis

ONCOMINE gene expression array datasets (www.oncomine.org), an online cancer microarray database, was used to analyze the mRNA expression levels of COL4As in different cancers. The mRNA expressions of COL4As in clinical cancer specimens were compared with that in normal controls, using a Students’ t-test to generate a P value. The cut-off of P value and fold change was defined as 1E-4 and 2, respectively.

Gene Expression Profiling Interactive Analysis (GEPIA) dataset

GEPIA (http://gepia.cancer-pku.cn/) is a developed interactive web server for analyzing the mRNA expression data. It consists of 9,736 tumors and 8,587 normal samples from the TCGA and the GTEx projects, using a standard processing pipeline. GEPIA provides customizable functions such as tumor/normal differential expression analysis, profiling according to the cancer types or pathological stages, patient survival analysis, similar gene detection, correlation analysis and dimensionality reduction analysis (27).

XENA analysis

The clinical and pathological data of 380 cases of GC patients and 37 adjacent normal tissues as well as the relative expression levels of COL4As were downloaded from TCGA 2015 RNA sequencing database from UCSC Xena (http://xena.ucsc.edu/). The mRNA expressions of COL4As in clinical cancer specimens were compared with that in normal controls. The clinical and pathological data were compared between high expression of COL4As in GC patients and low expression of that in GC patients. Statistical analyses were carried out by using SPSS 22.0 (IBM, SPSS, Chicago, IL, USA) and GraphPad Prism 7. Student’s t-test or Chi-square test was used to assess the statistical significance for comparisons of two groups (28). P value <0.05 was considered as significant.

OncoLnc tool

OncoLnc (http://www.oncolnc.org) is a tool that contains survival data for 8,647 patients from 21 cancer studies performed by The Cancer Genome Atlas (TCGA), along with RNA-SEQ expression for mRNAs and miRNAs from TCGA, and lncRNA expression from MiTranscriptome beta. It can be used to interactively explore survival correlations and to download clinical data coupled to expression data for mRNAs, miRNAs, or lncRNA. Users can investigate the range of expression of the gene at the Kaplan-Meier plotting page (29).

TCGA and CBioPortal analysis

The frequency of the COL4A family gene alterations (amplification, deep deletion, and missense mutations) and copy number variance were obtained from the cBioPortal (http://www.cbioportal.org/) (30). Besides, according to the online instructions of cBioPortal, we performed co-expression and network analyses.

Functional enrichment analysis

In this study, Metascape (http://metascape.org) was used to conduct pathway and process enrichment analysis of COL4A family members and neighboring genes significantly associated with COL4As alterations (31). The Gene Ontology (GO) terms as well as Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were enriched based on the Metascape online tool. PPI enrichment analysis was performed. Further, MCODE (Molecular Complex Detection) algorithm was applied to identify densely connected network components.

Results

The mRNA expression levels of COL4As in patients with GC

ONCOMINE database was used in this study to compare the mRNA expression levels of COL4As in cancers to those in normal tissues (). It was found that the mRNA expression level of COL4A1 was significantly upregulated in GC patients in six datasets. In Chen’s dataset (32), the expression level of COL4A1 was significantly up-regulated in all types of GC as compared to that in normal tissues (in diffuse gastric adenocarcinoma with a fold change of 5.045, gastric intestinal-type adenocarcinoma of 4.104, and gastric mixed adenocarcinoma of 6.23, ). COL4A2 was also overexpressed with a fold change of 10.501, 2.113, and 3.82 in diffuse gastric adenocarcinoma, gastric intestinal-type adenocarcinoma, and gastric mixed adenocarcinoma respectively in Chen’s dataset as shown in . D-Errico (33) showed another increased expression of factor, COL4A4 in diffuse gastric adenocarcinoma with a fold change of 2.739 compared with normal samples ().
Figure 1

The mRNA expression levels of COL4A factors in different types of cancers (ONCOMINE). The best gene rank percentile for the analyses within the cell determines cell color. The numbers in cells refer to numbers that met the threshold (P<1E-4, fold change ≥2).

Table 1

The significant changes of COL4As expression in transcription level between different types of gastric cancer and gastric tissues (ONCOMINE database)

Types of GC vs. gastricFold changeP valuet-testRef
COL4A1 Diffuse gastric adenocarcinoma vs. normal5.0454.54E-1314.254Chen
Gastric mixed adenocarcinoma vs. normal6.236.43E-0710.438Chen
Gastric intestinal type adenocarcinoma vs. normal4.1046.04E-1815.779Chen
Total gastric cancer vs. normal2.2765.67E-065.853Wang
COL4A2 Diffuse gastric adenocarcinoma vs. normal10.5011.63E-0910.501Chen
Gastric intestinal type adenocarcinoma vs. normal2.1332.38E-1710.669Chen
Gastric mixed adenocarcinoma vs. normal3.824.23E-069.131Chen
Total gastric cancer vs. normal2.4831.85E-065.93Wang
COL4A3 Gastric intestinal type adenocarcinoma vs. normal–2.5261.03E-05–4.775DErrico
COL4A4 Diffuse gastric adenocarcinoma vs. normal2.7391.26E-055.168DErrico
COL4A5 Diffuse gastric adenocarcinoma vs. normal–2.5859.71E-08–6.25Cho
Gastric intestinal type adenocarcinoma vs. normal–4.4677.41E-05–4.467Cho
COL4A6 Gastric intestinal type adenocarcinoma vs. normal–2.7482.26E-08–6.522Derrico
The mRNA expression levels of COL4A factors in different types of cancers (ONCOMINE). The best gene rank percentile for the analyses within the cell determines cell color. The numbers in cells refer to numbers that met the threshold (P<1E-4, fold change ≥2). On the contrary, in the same dataset of D-Errico, the expression levels of COL4A3 and COL4A6 were significantly decreased in gastric intestinal-type adenocarcinoma with a fold change of −2.526 and −2.748 respectively (). As shown in for COL4A5, there was down-regulation of mRNA expressions in diffuse gastric adenocarcinoma and gastric intestinal-type adenocarcinoma in comparison with normal patients with −2.585 and −4.467 fold changes separately according to Cho’s dataset (34).

The relationship between mRNA levels of COL4As and clinicopathological parameters of GC patients

By using GEPIA dataset, a comparison of the mRNA expression of COL4A factors between GC and gastric tissues. It was indicated from the results that COL4A1 and COL4A2 expressions were up-regulated in gastric adenocarcinoma compared with gastric tissues, whereas the expression levels of COL4A3 and COL4A5 were lower in gastric adenocarcinoma than gastric tissues ().
Figure 2

The mRNA expression of COL4As in GC (GEPIA). (A) The differences of gene expression profiles between stomach adenocarcinoma (STAD) and normal tissues (the red, green, and black “STAD” in the top represent that the expressions of related genes were increased, decreased or not significant in STAD as compared to normal tissues); (B) the differences in gene expression on box plots between STAD and normal tissues. *, P<0.01. N, normal gastric tissues, T, tumor tissues.

The mRNA expression of COL4As in GC (GEPIA). (A) The differences of gene expression profiles between stomach adenocarcinoma (STAD) and normal tissues (the red, green, and black “STAD” in the top represent that the expressions of related genes were increased, decreased or not significant in STAD as compared to normal tissues); (B) the differences in gene expression on box plots between STAD and normal tissues. *, P<0.01. N, normal gastric tissues, T, tumor tissues. In order to further validate the relationship between mRNA levels of COL4As and clinicopathological parameters of GC patients, we downloaded the clinicopathological data of 380 cases of GC patients and 37 adjacent normal tissues as well as the relative mRNA expression levels of COL4As from UCSC Xena database (28). The clinicopathological data of 380 cases of GC patients is shown in . By using TCGA sequencing data, we further validated that there was the increased mRNA expressions of COL4A1, COL4A2 and COL4A4 and the decreased mRNA expressions of COL4A5 and COL4A6 in human gastric adenocarcinoma tissues (n=380), adjacent normal tissues (n=37), and in paired GC tissues (n=34) ().
Table S1

Clinicopathological data of gastric cancer (GC) patients from TCGA database

Parameters [cases of data available*]Cases (n=380)
Age [375]
   ≥60258 (68.8)
   <60117 (31.2)
Gender [380]
   Female131 (34.5)
   Male249 (65.5)
Pathological stage [368]
   I/II171 (46.5)
   III/IV197 (53.5)
T classification [377]
   T1/T298 (26.0)
   T3/T4279 (74.0)
N classification [370]
   N0/N1218 (58.9)
   N2/N3152 (41.1)
Metastasis and[or] recurrence [380]
   Negative315 (82.9)
   Positive65 (17.1)

*, there are some data missed in clinicopathological data of GC patients from TCGA database. Data present as n (%).

Figure 3

TCGA RNA sequencing database analysis of the relatively transcriptional levels of COL4As in human GC and pair-matched normal tissues. P<0.05 was considered statistically significant.

TCGA RNA sequencing database analysis of the relatively transcriptional levels of COL4As in human GC and pair-matched normal tissues. P<0.05 was considered statistically significant. To evaluate whether the mRNA expression levels of COL4As were associated with clinical and pathological characteristics and prognosis of GC patients, 380 GC patients were divided into two groups based on the mean of mRNA expressions of each one of COL4As. There were COL4As high expression (the value > the median) and COL4As low expression (the value ≤ the median). As indicated in , COL4A2 expression positively correlated with classification of the grade (P=0.044). High expressions of both COL4A3 and COL4A4 were linked to poor TNM stage, pathological grade, lymph node metastasis, and Lauren’s classification (P<0.05). The expression of COL4A5 was high in patients aged less than 60 years. COL4A6 harbored no association with clinical parameters (P>0.05). Multivariate analysis using the Cox proportional hazards model indicated that age, gender, pathological grade, metastasis and COL4A5 expression are independent prognostic factors for OS. However, TNM stage, lymph node metastasis, Lauren’s classification, COL4A1-4 and COL4A6 were associated with poor OS but not independent prognostic factors ().
Table 2

Correlation of COL4As expression with clinicopathologic features of gastric cancer (GC) patients

Clinicopathologic features [cases of data available*]Cases (n)COL4A1COL4A2COL4A3
LowHighPLowHighPLowHighP
Age [375]
   ≥602581271311321261351230.181
   <6011762550.50657600.7385265
Gender [380]
   Female1316764686368630.666
   Male2491231260.8291221270.666122127
TNM stage [376]
   I533023332037160.003
   II–IV3231591640.3741551680.075151172
Pathological grade [371]
   G1–21487669856388600.003
   G32231101130.4601041190.04497126
Lymph node metastasis [370]
   N01175760645372450.004
   N1–N32531291240.7381221310.265113140
Metastasis [361]
   M03411731681741671731681
   M12010101.0008120.3661010
Lauren classification [241]
   Intestinal type1658382838292730.004
   Diffuse type7632440.26831450.2112749

*, there are some data missed in clinicopathological data of GC patients from TCGA database.

Figure S1

Multivariate analysis of clinicopathologic variables for survival with gastric cancer (GC)

*, there are some data missed in clinicopathological data of GC patients from TCGA database. To determine the COL4As protein expressions, they were analyzed using clinical specimens retrieved from the Human Protein Atlas (the Human Protein Atlas available from www.proteinatlas.org). It showed that COL4A1 and COL4A2 had strong expressions in GC and weak expressions in normal tissues (). Whereas COL4A3 had the inverse expression (). Unfortunately, related results of other COL4As have not been uploaded till date; hence, they could not be presented here.
Figure S2

COL4As expressions in normal gastric tissues and gastric carcinoma specimens. Images were taken from the Human Protein Atlas (http://www.proteinatlas.org) online database. N, normal gastric tissues; T, tumor tissues.

Kaplan Meier analysis of the correlation of COL4As expression levels with the overall survival (OS) and disease-free survival (DFS) of GC patients by GEPIA dataset analysis. P<0.05 was considered statistically significant.

Increased mRNA expressions of COL4A1/2/5 were associated with OS of GC patients and increased mRNA expressions of COL4A3/4 were associated with poor disease-free survival (DFS) of GC patients ()

COL4A1 and COL4A5 high mRNA expression levels were found to be associated with poor OS of patients in GC by using both GEPIA and OncoLnc tools (29) (, ). The high mRNA expression of COL4A2 was correlated with poor OS of patients in GC by using OncoLnc. However, results not same with GEPIA tool (, ). Different results between GEPIA and OncoLnc tools may derived from different cut-off values. The high expressions of COL4A3 and COL4A4 were found to be associated with poor DFS of patients in GC (). Other factors had no links with OS or DFS of patients in GC ().
Figure 4

Kaplan Meier analysis of the correlation of COL4As expression levels with the overall survival (OS) and disease-free survival (DFS) of GC patients by GEPIA dataset analysis. P<0.05 was considered statistically significant.

Figure S3

Kaplan Meier analysis of the correlation of COL4As expression levels with the overall survival (OS) by OncoLnc tool analysis. P<0.05 was considered statistically significant.

Predicted functions and pathways of the changes in COL4A factors and their frequently altered neighboring genes in patients with GC

The cBioPortal online tool (30,35) was used to analyze the COL4As alterations, correlations, and networks for GC (TCGA, Provisional). There were 152 samples out of 478 patients of COL4As altering with GC (32%). In almost half of the samples (72 samples), two or more alterations were detected (). The percentages of genetic alterations in individual genes of COL4A family members for GC varied from 6% to 13% (COL4A1, 13%; COL4A2, 11%; COL4A3, 7%; COL4A4, 7%; COL4A5, 8%; COL4A6, 6%; ). The correlations of COL4As with each other were calculated by analyzing their mRNA expressions (RNA Seq V2 RSEM) via the online tool mentioned above. Pearson’s correction was included. The results indicated significant and positive correlations in the following COL4As: COL4A1 with COL4A2, COL4A3 with COL4A4, and COL4A5 with COL4A6 (). Then we performed network analysis for COL4As and the 50 most frequently altered neighboring genes (). The names, abbreviations, and functions for these genes are shown in . The results showed that the integrin family of protein-coding genes (ITGA2B/3/4/6/7/E/L/V/X, ITGB4/5/7/8/L1) and the laminin family of protein-coding genes (LAM1/2/3/4/5, LAMB1/2/3/4, LAMC1/3) were closely associated with COL4As alterations.
Figure 5

COL4A gene expression and mutation analysis in GC (cBioPortal). (A) Oncoprint in cBioPortal represents the proportion and distribution of samples with alterations in COL4A factors, 1–6 of X-axis in the left panel refer to diffuse type stomach adenocarcinoma, papillary stomach adenocarcinoma, signet ring cell carcinoma of the stomach, tubular stomach adenocarcinoma, stomach adenocarcinoma and mucinous stomach adenocarcinoma. The figure was cropped on the right to exclude samples without alterations; (B) Pearson correlations of COL4A family members; (C) Gene–gene interaction network among COL4A family members and 50 most frequently altered neighboring genes.

Table S2

Functional roles of 50 frequently altered neighbor genes

NO.Gene symbolFull nameFunction
1 ACTB Actin betaCell motility, structure, integrity, and intercellular signaling
2 ACTN2 Actinin alpha 2Bind actin to the membrane, anchor the myofibrillar actin filaments
3 ACTN4 Actinin alpha 4Bind actin to the membrane, anchor the myofibrillar actin filaments
4 CASP8 Caspase 8Be involved in the programmed cell death induced by Fas and various apoptotic stimuli
5 COL18A1 Collagen type XVIII alpha 1 chainInhibit angiogenesis and tumor growth
6 COL21A1 Collagen type XXI alpha 1 chainMaintain the integrity of the extracellular matrix
7 COL22A1 Collagen type XXII alpha 1 chainContribute to the stabilization of myotendinous junctions and strengthen skeletal muscle attachments during contractile activity
8 COL26A1 Collagen type XXVI alpha 1 chainBe associated with aspirin-intolerant asthma
9 COL28A1 Collagen type XXVIII alpha 1 chainBelong to a class of collagens containing von Willebrand factor type A (VWFA) domains
10 COL7A1 Collagen type VII alpha 1 chainFunction as an anchoring fibril between the external epithelia and the underlying stroma
11 FLNA Filamin ABe involved in remodeling the cytoskeleton to effect changes in cell shape and migration; interact with integrins, transmembrane receptor complexes, and second messengers
12 FLNB Filamin BRepair vascular injuries
13 FN1 Fibronectin 1Be involved in cell adhesion and migration processes including embryogenesis, wound healing, blood coagulation, host defense, and metastasis
14 ITGA2B Integrin subunit alpha 2bBlood coagulation
15 ITGA3 Integrin subunit alpha 3
Form an integrin that interacts with extracellular matrix proteins including members of the laminin family; be correlated with breast cancer metastasis
16 ITGA4 Integrin subunit alpha 4May play a role in cell motility and migration
17 ITGA6 Integrin subunit alpha 6Function in cell surface adhesion and signaling
18 ITGA7 Integrin subunit alpha 7Play a role in cell migration, morphologic development, differentiation, and metastasis
19 ITGAE Integrin subunit alpha EAdhesion; serve as an accessory molecule for human intestinal intraepithelial lymphocytes activation
20 ITGAL Integrin subunit alpha LLeukocyte intercellular adhesion; function in lymphocyte costimulatory signaling
21 ITGAV Integrin subunit alpha VRegulate angiogenesis and cancer progression
22 ITGAX Integrin subunit alpha XAdherence of neutrophils and monocytes to stimulated endothelium cells, and phagocytosis of complement coated particles
23 ITGB4 Integrin subunit beta 4Play a pivotal role in the biology of invasive carcinoma
24 ITGB5 Integrin subunit beta 5Participate in cell adhesion as well as cell-surface mediated signaling
25 ITGB7 Integrin subunit beta 7Play a role in leukocyte adhesion
26 ITGB8 Integrin subunit beta 8Play a role in human airway epithelial proliferation
27 ITGBL1 Integrin subunit beta like 1Contain integrin-like cysteine-rich repeats
28 LAMA1 Laminin subunit alpha 1Cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis
29 LAMA2 Laminin subunit alpha 2Mediate the attachment, migration, and organization of cells into tissues during embryonic development
30 LAMA3 Laminin subunit alpha 3Be essential for formation and function of the basement membrane and have additional functions in regulating cell migration and mechanical signal transduction
31 LAMA4 Laminin subunit alpha 4The exact function of laminin, alpha 4 is not known
32 LAMA5 Laminin subunit alpha 5The major noncollagenous constituent of basement membranes
33 LAMB1 Laminin subunit beta 1Inhibit metastasis
34 LAMB2 Laminin subunit beta 2The maturation of neuromuscular junctions and maintain glomerular filtration
35 LAMB3 Laminin subunit beta 3Belong to a family of basement membrane proteins
36 LAMB4 Laminin subunit beta 4Cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis
37 LAMC1 Laminin subunit gamma 1Cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis
38 LAMC3 Laminin subunit gamma 3Cell adhesion, differentiation, migration, signaling, neurite outgrowth and metastasis
39 MSR1 Macrophage scavenger receptor 1Mediate the endocytosis of modified low-density lipoproteins; regulation of scavenger receptor activity in macrophages.
40 P3H2 Prolyl 3-hydroxylase 2Collagen chain assembly, stability and cross-linking
41 P4HB Prolyl 4-hydroxylase subunit betaA highly abundant multifunctional enzyme
42 PDGFA Platelet derived growth factor subunit ABind and activate PDGF receptor tyrosine kinases
43 PLOD2 Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2Be critical for the stability of intermolecular crosslinks
44 PLOD3 Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3Be critical for the stability of intermolecular crosslinks
45 PTK2 Protein tyrosine kinase 2Cell growth and intracellular signal transduction pathways
46 SERPINH1 Serpin family H member 1A marker for cancer
47 THBS2 Thrombospondin 2A potent inhibitor of tumor growth and angiogenesis
48 THBS4 Thrombospondin 4Be activated during the stromal response to invasive breast cancer; play a role in inflammatory responses in Alzheimer's disease
49 TLN1 Talin 1Assist in the attachment of adherent cells to extracellular matrices and of lymphocytes to other cells
COL4A gene expression and mutation analysis in GC (cBioPortal). (A) Oncoprint in cBioPortal represents the proportion and distribution of samples with alterations in COL4A factors, 1–6 of X-axis in the left panel refer to diffuse type stomach adenocarcinoma, papillary stomach adenocarcinoma, signet ring cell carcinoma of the stomach, tubular stomach adenocarcinoma, stomach adenocarcinoma and mucinous stomach adenocarcinoma. The figure was cropped on the right to exclude samples without alterations; (B) Pearson correlations of COL4A family members; (C) Gene–gene interaction network among COL4A family members and 50 most frequently altered neighboring genes.

Functional gene and pathway enrichment analysis of COL4A factors and their frequently altered neighboring genes in patients with GC

The GO functions and KEGG pathway enrichment analysis of candidate COL4As and their frequently altered neighboring genes was performed based on the Metascape databases (31). As shown in , there were the top 20 GO and KEGG enrichment items (17 terms and 3 pathways) involved. It was indicated that these COL4As and their frequently altered neighboring genes were mainly enriched in extracellular matrix organization, integrin-mediated signaling pathway, endoplasmic reticulum lumen, endodermal cell differentiation, cell morphogenesis involved in differentiation, cell junction assembly, positive regulation of cell migration, blood vessel morphogenesis, platelet degranulation, laminin-5 complex, and laminin-1 complex, etc. Three significantly enriched pathways: focal adhesion, toxoplasmosis, and proteoglycans in cancer were identified in correlations with COL4As and their frequently altered neighboring genes. Furthermore, these enriched terms were closely connected with each other and clustered into intact networks ().
Figure 6

The enrichment analysis of COL4A family members and neighboring genes in OC (Metascape). (A) Heatmap of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enriched terms colored according to P values; (B) enriched terms or pathways are distinguished by nodes. Colors and node sizes indicate number of genes involved; (C) and (D) Protein-protein interaction (PPI) network and five most significant MCODE components form the PPI network.

The enrichment analysis of COL4A family members and neighboring genes in OC (Metascape). (A) Heatmap of Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enriched terms colored according to P values; (B) enriched terms or pathways are distinguished by nodes. Colors and node sizes indicate number of genes involved; (C) and (D) Protein-protein interaction (PPI) network and five most significant MCODE components form the PPI network. For a better understanding, the relationship between COL4A family members and GC, the Metascape database was used to perform protein-protein interaction (PPI) enrichment analysis. The PPI network is shown in . The five most significant MCODE components were extracted from the PPI network. Each MCODE component was applied by pathway and process enrichment analysis independently, and the three best-scoring terms of the corresponding components by P value were retained as the functional description shown in . The results suggested that ECM-receptor interaction, PID integrin pathway, extracellular matrix organization, focal adhesion, laminin interactions, and cell junction assembly were mainly associated with COL4A family members.
Table 3

Independent functional enrichment analysis of three Molecular Complex Detection (MCODE) components.

MCODEGODescriptionLog10(P)
MCODE_1hsa04512ECM-receptor interaction–23.9
MCODE_1M18PID INTEGRIN1 PATHWAY–21.6
MCODE_1R-HSA-1474244Extracellular matrix organization–21
MCODE_2hsa04510Focal adhesion–18.8
MCODE_2R-HSA-3000157Laminin interactions–12.6
MCODE_2GO:0034329Cell junction assembly–12.5
MCODE_3R-HSA-186797Signaling by PDGF–10.5
MCODE_3R-HSA-3000171Non-integrin membrane-ECM interactions–10.5
MCODE_3R-HSA-2214320Anchoring fibril formation–9.1
MCODE_4CORUM:6990THSD1-FAK-talin-vinculin complex–11.8
MCODE_4CORUM:5177Polycystin-1 multiprotein complex (ACTN1, CDH1, SRC, JUP, VCL, CTNNB1, PXN, BCAR1, PKD1, PTK2, TLN1)–10.2
MCODE_4M281PID FAK PATHWAY–7.9
MCODE_5R-HSA-1650814Collagen biosynthesis and modifying enzymes–7.7
MCODE_5R-HSA-1474290Collagen formation–7.3
MCODE_5R-HSA-1474244Extracellular matrix organization–5.7

Discussion

In this study, the mRNA expression levels, prognostic values, genetic alterations, correlations, and potential functions of different COL4As in GC, were systematically explored by bioinformatics analysis. COL4A1, the most classic member of the COL4A family is found to play a pivotal role in proliferation, metastasis, and invasion in most cancers (15,16,36). Zhang et al. (37) reported that miRNA-29c-3p represses proliferation of gastric adenocarcinoma BGC-823 cells by directly targeting COL4A1. Huang et al. (38) identified that COL4A1 is upregulated in trastuzumab resistance in GC cells and may induce trastuzumab resistant in GC in silico. In the present study, it was revealed that the mRNA expression of COL4A1 was upregulated in human GC as compared to normal tissues. High mRNA expression level of COL4A1 was found to be associated with poor OS by using both GEPIA and OncoLnc tool. However, expression of COL4A1 did not correlate with the DFS and clinical characteristics of the patients with GC. These phenomena indicate COL4A1 may serve as a new biomarker for the prognosis and a potential target of GC. As demonstrated in , COL4A1 is also upregulated in lymphoma and sarcoma. Moreover, Chida et al. (39) identified that COL4A1 is located predominantly in cancer stroma. Immunohistochemistry () also showed COL4A1 is expressed in stromal tissue but not in tumor cells. In addition, COL4A1 may be derived from stromal reaction during the tumor progression and not from tumor cells themselves. Miyake et al. (36) found that the formation of tumor budding is involved in the carcinogenesis of COL4A1 in human urothelial cancer of the bladder. However, in papillary thyroid cancer, exosomal miR-21-5p can increase endothelial tube formation by inhibiting COL4A1, consequently promoting angiogenesis (40). Angiogenesis may be involved in the effects of COL4A1 on tumors. Additional experiments need be performed in the future to elucidate whether COL4A1 promotes or inhibits angiogenesis in GC. In triple-negative breast cancer, knockdown of COL4A2 could inhibit the proliferation and migration of cancer cells (19). COL4A2 is identified as a methylation marker with high accuracy for the detection of colorectal cancer (41). Notch3 can upregulate COL4A2 and promote anoikis resistance in ovarian cancer (42). In our study, use of ONCOMINE, GEPIA, and UCSC Xena datasets also revealed the mRNA expression level of COL4A2 was up-regulated in GC. By using the GEPIA tool, it was analyzed that COL4A2 was not related to OS and DFS of patients in GC. High transcriptional expression level of COL4A2 was associated with poor OS in OncoLnc. Moreover, COL4A2 high expression positively correlated with pathological grade (P=0.044). These findings suggested that COL4A2 may be a promising therapeutic target and could predict the prognosis of patients in GC as a biomarker. Metodieva et al. (43) found that COL4A3 is downregulated in early-stage non-small cell lung cancer by real-time PCR. COL4A3 expressed to a lesser extent in GC than in normal mucosa at both mRNA and protein levels but its expression positively related to poor prognosis and worse clinicopathologic features of GC (20). A decreased mRNA expression of COL4A3 in GC is also shown in our study. Both GEPIA and OncoLnc results showed COL4A3 was not related to OS of patients in GC. However, to our surprise, high COL4A3 mRNA expression was significantly associated with poor DFS and the high mRNA expressions of COL4A3 was correlated with poor TNM stage, pathological grade, lymph node metastasis, and Lauren’s classification (P<0.05). It seemed COL4A3 might serve as a biomarker to indicate a worse prognosis of patients in GC. COL4A4 was confirmed to be down-regulated in esophageal cancer (17). However, in the present study, excluding GEPIA datasets, ONCOMINE and UCSC Xena showed an increased mRNA expression of COL4A4 in GC. High COL4A4 mRNA expression was associated with poor DFS and poor TNM stage, pathological grade, lymph node metastasis and Lauren classification (P<0.05). There was no association between COL4A4 expression level and OS of patients in GC. COL4A4 may play a role of an oncogene and a potential prognostic biomarker for GC. ONCOMINE, GEPIA and UCSC Xena datasets all revealed that the mRNA expression of COL4A5 was downregulated in GC. Except for GEPIA dataset, ONCOMINE, and XENA datasets showed low mRNA expression of COL4A6 in GC compared with normal samples. COL4A5 was not related to DFS. We found high mRNA expression of COL4A5 was correlated with poor OS. Multivariate analysis indicated that age, gender, pathological grade, metastasis, and COL4A5 expression are independent prognostic factors. COL4A6 harbored no link with OS, DFS and clinical characteristics. Loss of expressions of COL4A5 and COL4A6 were reported in colorectal cancer and might be involved in the remodeling of the epithelial BM during cancer cell invasion (18). COL4A5 may be an indicator of a worse prognosis of GC. Further research is needed to prove whether COL4A6 plays a role in GC. The percentages of genetic alterations in COL4A family members for GC were calculated to further illustrate the genetic alterations, potential functions, and carcinogenic mechanisms of the same. The percentages of genetic alterations ranged from 6% to 13% for individual genes based on TCGA Provisional dataset. In addition, we predicted COL4As alterations related 50 genes and constructed a network. COL4As alterations were closely associated with the integrin family of protein-coding genes, including ITGA2B/3/4/6/7/E/L/V/X, ITGB4/5/7/8/L1, and the laminin family of protein-coding genes, including LAM1/2/3/4/5, LAMB1/2/3/4, LAMC1/3. The GO and KEGG pathway analysis indicated they were enriched in pathways between ECM and the adhesion process. Adhesion-related pathways, such as extracellular matrix organization, focal adhesion and integrin-mediated signaling pathways were associated with the processes of proliferation, migration and invasion of GC (44-46). Combined with the results above, we hypothesized that COL4As may have impacts on adhesion-related pathways and integrin-mediated signaling pathways, thereby regulating the downstream of Akt pathway. Activation of Akt pathway could promote the proliferation and invasion of GC. This study is a descriptive research using bioinformatics analysis. In future, a large sample sizes with high quality and experimental studies in our hospital are needed to further elucidate and verify our research.

Conclusions

In this study, we used the GEPIA tool, cBioPortal, and Metascape tool to explore the expression and prognostic value of COL4As in GC from which we could have a further understanding of the molecular biological properties of GC. Our findings suggested that COL4A1/2 are potential therapeutic targets for GC, COL4A3/4/6 may have an impact on gastric carcinogenesis and subsequent progression and COL4A5 is found to be an independent prognostic marker for GC.
  45 in total

1.  Loss of expression of type IV collagen alpha5 and alpha6 chains in colorectal cancer associated with the hypermethylation of their promoter region.

Authors:  Koei Ikeda; Ken-ichi Iyama; Nobuyuki Ishikawa; Hiroshi Egami; Mitsuyoshi Nakao; Yoshikazu Sado; Yoshifumi Ninomiya; Hideo Baba
Journal:  Am J Pathol       Date:  2006-03       Impact factor: 4.307

2.  Restoration of microRNA-29c in type I endometrioid cancer reduced endometrial cancer cell growth.

Authors:  Michelle Van Sinderen; Meaghan Griffiths; Ellen Menkhorst; Keith Niven; Evdokia Dimitriadis
Journal:  Oncol Lett       Date:  2019-07-09       Impact factor: 2.967

3.  Exosomes increased angiogenesis in papillary thyroid cancer microenvironment.

Authors:  Feng Wu; Fuxingzi Li; Xiao Lin; Feng Xu; Rong-Rong Cui; Jia-Yu Zhong; Ting Zhu; Su-Kang Shan; Xiao-Bo Liao; Ling-Qing Yuan; Zhao-Hui Mo
Journal:  Endocr Relat Cancer       Date:  2019-05       Impact factor: 5.678

Review 4.  Circulating MicroRNAs: Valuable Biomarkers for the Diagnosis and Prognosis of Gastric Cancer.

Authors:  Najibeh Shekari; Behzad Baradaran; Dariush Shanehbandi; Tohid Kazemi
Journal:  Curr Med Chem       Date:  2018-02-21       Impact factor: 4.530

5.  Collagen (COL4A) mutations are the most frequent mutations underlying adult focal segmental glomerulosclerosis.

Authors:  Christine Gast; Reuben J Pengelly; Matthew Lyon; David J Bunyan; Eleanor G Seaby; Nikki Graham; Gopalakrishnan Venkat-Raman; Sarah Ennis
Journal:  Nephrol Dial Transplant       Date:  2015-09-07       Impact factor: 5.992

6.  Expression analysis of angiogenesis-related genes in Bulgarian patients with early-stage non-small cell lung cancer.

Authors:  Svetlana Nikolova Metodieva; Dragomira Nikolaeva Nikolova; Radostina Vlaeva Cherneva; Ivanka Istalianova Dimova; Danail Borisov Petrov; Draga Ivanova Toncheva
Journal:  Tumori       Date:  2011 Jan-Feb

7.  Gene expression profile analyze the molecular mechanism of CXCR7 regulating papillary thyroid carcinoma growth and metastasis.

Authors:  Hengwei Zhang; Xuyong Teng; Zhangyi Liu; Lei Zhang; Zhen Liu
Journal:  J Exp Clin Cancer Res       Date:  2015-02-12

8.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

9.  A panel of collagen genes are associated with prognosis of patients with gastric cancer and regulated by microRNA-29c-3p: an integrated bioinformatics analysis and experimental validation.

Authors:  Qiang-Nu Zhang; Hui-Li Zhu; Meng-Ting Xia; Juan Liao; Xiao-Tao Huang; Jiang-Wei Xiao; Cong Yuan
Journal:  Cancer Manag Res       Date:  2019-05-24       Impact factor: 3.989

10.  An immune-related gene signature predicts prognosis of gastric cancer.

Authors:  Bitao Jiang; Qingsen Sun; Yao Tong; Yuzhuo Wang; Haifen Ma; Xuefei Xia; Yu Zhou; Xingguo Zhang; Feng Gao; Peng Shu
Journal:  Medicine (Baltimore)       Date:  2019-07       Impact factor: 1.889

View more
  2 in total

1.  Identification of Extracellular Matrix Signatures as Novel Potential Prognostic Biomarkers in Lung Adenocarcinoma.

Authors:  Zhen Zeng; Yuanli Zuo; Yang Jin; Yong Peng; Xiaofeng Zhu
Journal:  Front Genet       Date:  2022-05-30       Impact factor: 4.772

2.  MALDI-MSI: A Powerful Approach to Understand Primary Pancreatic Ductal Adenocarcinoma and Metastases.

Authors:  Juliana Pereira Lopes Gonçalves; Christine Bollwein; Anna Melissa Schlitter; Mark Kriegsmann; Anne Jacob; Wilko Weichert; Kristina Schwamborn
Journal:  Molecules       Date:  2022-07-27       Impact factor: 4.927

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.