Literature DB >> 33330022

Blood Cell DNA Methylation of Aging-Related Ubiquitination Gene DZIP3 Can Predict the Onset of Early Stage Colorectal Cancer.

Yuan Quan1,2, Fengji Liang2,3, Deqing Wu4, Xueqing Yao4, Zhihuang Hu5, Yuexing Zhu2, Ying Chen2, Andong Wu2, Danian Tang6, Bingyang Huang7, Ruifeng Xu1, Zejian Lyu4, Qian Yan4, Lang Luo8, Zhengzhi Ning8, Yong Li4, Jianghui Xiong2,3.   

Abstract

There is a body of evidence that the aging immune system is linked to cancer. In this study, with aging- and immune-related DNA methylation data, we investigated the DNA methylation regulation changes in promoters with other regions of genes during aging and their association with the immune-cell proportion in the circulating whole blood of individuals. The analyses for aging- and CD4+ T cell proportion-derived differential genes showed that ubiquitination plays an important role in the aging immune system and tumorigenesis. Therefore, starting from a set of pre-annotated ubiquitination genes, we found that among the differentially ubiquitinated genes, DZIP3, an E3 ubiquitin ligase with no reports on its function in immune cells and tumorigenesis, was significantly associated with both aging (P-value = 3.86e-06) and CD4+ T cell proportion (P-value = 1.97e-05) in circulating blood. By collecting a cohort of 100 colon cancer patients and 50 healthy individuals, we validated that the 1st exon DNA methylation of DZIP3 could predict the onset of early stage (AUC = 0.833, OR = 8.82) and all pTNM stages of colorectal cancer (AUC = 0.782, OR = 5.70). Thus, the epigenetically regulated ubiquitination machine plays an important role in immune aging and tumorigenesis.
Copyright © 2020 Quan, Liang, Wu, Yao, Hu, Zhu, Chen, Wu, Tang, Huang, Xu, Lyu, Yan, Luo, Ning, Li and Xiong.

Entities:  

Keywords:  DZIP3; E3 ubiquitin ligase; aging immune system; cancer screening; colon cancer; early detection; liquid biopsy

Year:  2020        PMID: 33330022      PMCID: PMC7729090          DOI: 10.3389/fonc.2020.544330

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   6.244


Introduction

In recent years, colorectal cancer (CRC) has gradually developed into the third most common cancer worldwide and is one of the leading causes of cancer-triggered deaths (1). In the past few decades, due to a large number of studies on the pathogenesis of CRC, the diagnostic tests and treatment strategies have made significant progress, which has led to the improvement of CRC survival time (2). Previous studies have found that there are several pathogenic factors of cancer, such as genome instability and mutation, intestinal flora imbalance and unreasonable diet (3). Immune destruction is also an important cause of cancer. Therefore, chronic inflammation is one of the hallmarks of cancer (3). It is well known that the earliest stages of tumorigenesis are often accompanied by the appearance of chronic inflammation (4), and CRC is considered to be one of the most representative examples of tumors closely related to chronic inflammation (5). A large number of studies have shown that prolonged inflammation of the intestine, such as Crohn’s disease or ulcerative colitis and other inflammatory bowel diseases (IBDs), often develop into CRC at later stage (6–8). Notably, studies have found that individuals can prevent or delay CRC by using anti-inflammatory drugs (9, 10), which also suggests that the inflammatory process is closely linked to tumor onset. The inflammatory reactions are usually associated with microbial responses, and the intestinal tract is composed of numerous bacterial strains. Under normal circumstances, these bacterial strains in harmony with their human hosts. However, any substantial change in the bacterial population may have a significant impact on inflammatory responses and lead to tumorigenesis. In addition, age is a major risk factor for many cancers. There is a body of evidence that aging immune system is linked to cancer (11–13). There is a growing interest in immunosenescence and how it may contribute to the increased risk of cancer onset with aging. For example, CD4+ T cells, especially T follicular helper cells, are essential for the body to produce the robust humoral response against infection and vaccination (14). Evidence suggested that the aging microenvironment can cause age-related functional deficits in mouse CD4+ T cells (15). And age-related damage of the humoral response to influenza is related to disturbances of antigen-specific T follicular helper cell responses (16). On the other hand, ubiquitination refers to the covalent attachment of ubiquitin molecules with proteins, which is a mechanism widely used by organisms to rapidly regulate cell signaling. Recent studies showed that ubiquitination has an important role in regulating various signals of innate and adaptive immune cells (17–19). A good example is that PD-1 ubiquitination could regulate the antitumor immunity of T cells. In the tumor microenvironment, PD-1 of dysfunctional T cells is abnormally highly expressed; hence antibody inhibitors against PD-1 or its ligand (PD-L1) have become one of the commonly used drugs for the treatment of various tumors. A study demonstrated that the surface PD-1 protein of activated T cells undergoes internalization, followed by ubiquitination and proteasome degradation (20). Here, we hypothesized that ubiquitination might play a key gatekeeper role in immune system aging and tumorigenesis in CRC. The study of tumor immune interactions presents great technical challenges due to the great heterogeneity in the tumor microenvironment and global immune system function. Blood cell DNA methylome analysis provides a promising tool to probe the key components of immune cell dysregulation in aging and tumorigenesis. In recent years, researchers have identified several DNA methylation markers for quantification of tumor immune cell infiltration, which have significantly improved specificity compared to other markers. For instance, FOXP3’s DNA methylation value can be used as a marker to distinguish T lymphocyte subpopulations, which can distinguish regulatory T lymphocytes from other types of T lymphocytes (21). However, because these T lymphocyte subpopulations can express similar surface markers, it is difficult for researchers to distinguish based on other common methods, such as gene expression or immunohistochemistry (21, 22). With the development of DNA methylation signature, that can coinstantaneous quantification of diverse immune cell populations in human blood, the ability of DNA methylation as a cell typing tool will be further enhanced. There are reports that a broad signature of hepatocellular carcinoma (HCC) in peripheral blood mononuclear cell and T cell DNA methylation, which can distinguish HCC from chronic hepatitis B and C as well as healthy individuals in the early stage, escalates with the progression of HCC (23). In the present study, we will systematically study the DNA methylation of ubiquitination regulatory system genes in blood cells and screen candidate genes that are correlated with both aging and immune cell compositions. Then, we will use low-cost nucleic acid mass spectrometry (Sequenom) to test whether ubiquitination genes can be used as candidate markers for the early screening of CRC.

Materials and Methods

Identification of Differential Genes Associated With Aging and Immune Cell Proportions

In our study, aging-related and immune-related DNA methylation data were collected from the Gene Expression Omnibus (GEO) database (www.ncbi.nlm.nih.gov/geo, GEO accession number: GSE40279 and GSE69270). GEO is an open-source functional genomics database composed of high-throughput resources, including microarray data, gene expression data, and DNA methylation data, etc. The DNA methylation platform’s annotation information was used to transform the probes into homologous gene symbols. The GSE40279 dataset contained DNA methylation profiles across approximately 450K CpGs in the human whole blood of 656 samples from patients whose age ranged from 19 to 101 years. The GSE69270 dataset contained DNA methylation profiles in the whole blood of 184 samples with annotations of six immune cell proportions, including CD8+ T and CD4+ T cells, monocytes, granulocytes, and NK and B cells. Next, we calculated the significantly differential genes during aging and their association with immune cell proportion in the circulating whole blood of healthy individuals. Recently, a study published in Genome Res. found that there is a strong relationship between the methylation of gene body difference to promoter (MeGDP) and gene expression, which correlation coefficient is up to 0.67 (24). Accordingly, MeGDP can be used as a predictor of gene expression. The higher the MeGDP is, the higher the expression value of a gene, suggesting that MeGDP is closely linked to the corresponding phenotypes. Depending on the results of the above research, our other paper proposed the Statistical Difference of DNA Methylation between Promoter and other regions (SIMPO) to calculate the DNA methylation characteristics of different genes () (25). Recently, based on SIMPO algorithm, our group has achieved promising results in the research of DNA methylation biomarker identification for type 2 diabetes (26).
Figure 1

Method pipeline of candidate marker validation and prediction model construction for colorectal cancer.

Method pipeline of candidate marker validation and prediction model construction for colorectal cancer. The input data of the SIMPO algorithm is the DNA methylation values of cg probes that are located in the gene promoter and the other regions. The significant difference method T-test is used in the SIMPO algorithm, and the degree of difference (SIMPO score) is used to characterize the DNA methylation of corresponding genes as follows (21): where : average DNA methylation value of all probes that are located in the promoter region; : average DNA methylation value of all probes that are located in the other region; m: number of probes that are located in the promoter region; n: number of probes that are located in the other region; : variance of DNA methylation values of probes that are located in the promoter region; and : variance of DNA methylation values of probes that are located in the other region. In addition, since the SIMPO score relates to the number of probes, to ensure the reliability of the SIMPO score, we only selected genes with a number of promoter region-located and other region-located probes greater than or equal to five for further calculation. Based on the SIMPO algorithm, this study separately calculated the SIMPO score of each gene in the aging-related and immune cell proportion-related DNA methylation data. Next, we calculated the differences of gene SIMPO scores between the old age (>50 year) and middle age (≤50 year) individuals based on the adjusted T-test. Then, for the immune-related methylation data, we identified the differential genes that were significantly associated with immune cell proportion based on the adjusted Spearman correlation test.

Ubiquitination Genes Collection

In this study, we collected 233 reviewed E3 ubiquitin ligase genes in humans by querying the UniProt database (https://www.uniprot.org) with “E3 ubiquitin-protein ligase” as a key word (, ). The biological functions were enriched for 233 E3 ubiquitin ligases using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway records in the Database for Annotation, Visualization and Integrated Discovery (DAVID, https://david.ncifcrf.gov/) (27). The corresponding catalogs and diseases of the KEGG pathways were downloaded from the KEGG Pathway Database (https://www.genome.jp/kegg/pathway.html) (28). The results showed that these E3 ubiquitin ligases were enriched in four KEGG pathways (Bonferroni-adjusted P-value ≤ 0.05), of which the most significant is the ubiquitin-mediated proteolysis pathway (). This result indicates that these 233 E3 ubiquitin ligases are significantly associated with ubiquitination and could be used for the next analysis.

Survival Analysis

In this study, we performed a Kaplan–Meier survival analysis and a multivariate Cox regression analysis to examine the relationship between the DNA methylation level (SIMPO scores) of E3 ubiquitin ligase genes and survival times of colon adenocarcinoma (COAD) patients. The methylation data of the COAD patient’s tumor tissue and their clinical survival information were downloaded from the Cancer Genome Atlas (TCGA, https://portal.gdc.cancer.gov), which contain 293 individuals. According to the values of SIMPO scores, the patients with COAD were divided into high-expression (top 50%) and low-expression (bottom 50%) groups for each gene. Then, SangerBox package (http://sangerbox.com/) was utilized to plot Kaplan–Meier curves for SIMPO scores, and the significances of the difference between Kaplan–Meier curves of these two groups were calculated separately for each gene by the Log-rank tests. In addition, the overall survival analysis also contained the Cox proportional hazard ratio (HR) and the 95% confidence interval. Finally, P-value <0.05 was considered to indicate the statistically significant result.

Clinical Blood Samples Collection

We analyzed blood samples from 100 diagnosed CRC patients (male: 67, female: 33) from Guangdong Provincial People’s Hospital and FoShan New RongQi Hospital. A total of 53% of the samples were diagnosed with pTNM stage I/II. We also collected 50 normal control samples (male: 18; female: 32) from FoShan New RongQi Hospital. All peripheral whole blood cell samples were treated with EDTA anticoagulant.

MassARRAY Analysis

We identified the sequences 500 bp up and down from the CpG position at the 1st exon of DZIP3 (probe cg14787155 annotated position in HM450K chip) with the UCSC genome browser (http://genome.ucsc.edu/) (chr3:108308033-108309033), and designed one primer set for methylation analysis of the region by EpiDesigner software (http://epidesigner.com). Quantitative DNA methylation analysis of was carried out by using the MassARRAY platform (SEQUENOM) according to the official pipelines. In brief, DNA was treated with sodium bisulfite and PCR amplified, and bisulfite reactions were designed. The DNA methylation status of the samples was quantitatively tested by using the matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Finally, 25 CpG sites were tested, and the methylation data of 18 individual units (one to two CpG sites per unit) were generated by the EpiTyper v1.0.5 software.

Result

Biological Analysis of Aging and CD4+ T Cell Proportion-Derived Differential Genes

Based on the SIMPO algorithm, we calculated the SIMPO score for each gene of the aging and immune-related DNA methylation data. In the aging-related methylation data, we identified a total of 2,358 genes that were significantly associated with the age of the patient (based on the adjusted T-test between the old age (>50 year) and middle age (≤50 year) individuals). In the immune-related methylation data, we identified a total of 1,331 genes that were significantly associated with the CD4+ T cell proportion (based on the adjusted Spearman correlation test). First, we performed the KEGG pathway enrichment analyses for the above significant gene sets by using the Enrichr database (https://amp.pharm.mssm.edu/Enrichr/) (29). The results showed that the aging-associated genes were enriched in 32 significant KEGG pathways, and the immune-associated genes were enriched in 23 significant KEGG pathways. Among them, it is worth noting that the cellular senescence pathway is enriched in both the aging- and immune-associated gene sets (). In addition, the ubiquitin-mediated proteolysis pathway was identified by an immune-related gene set (). Therefore, the results indicated that genes that are significantly disturbed in DNA methylation level during the aging and immunization processes are associated with ubiquitination.
Table 1

The significantly enriched KEGG pathways of the aging- and CD4+ T cell proportion-derived differential genes (top 20).

AgingCD4+ T cell proportion
KEGG pathwayP-valueKEGG pathwayP-value
Glyoxylate and dicarboxylate metabolism7.83e-05Ferroptosis1.04e-03
Non-small cell lung cancer4.67e-04Cysteine and methionine metabolism3.40e-03
Axon guidance1.11e-03Platelet activation8.16e-03
Cell cycle2.39e-03Cellular senescence1.01e-02
Cellular senescence6.67e-03Non-small cell lung cancer1.15e-02
Nucleotide excision repair7.05e-03AMPK signaling pathway1.35e-02
DNA replication7.18e-03TNF signaling pathway1.44e-02
Valine, leucine and isoleucine degradation8.41e-03Hepatitis C1.50e-02
Melanoma8.68e-03Hippo signaling pathway2.01e-02
Human cytomegalovirus infection8.70e-03Base excision repair2.02e-02
C-type lectin receptor signaling pathway9.24e-03Fatty acid degradation2.51e-02
Dopaminergic synapse1.00e-02Viral carcinogenesis2.70e-02
Oocyte meiosis1.07e-02Chronic myeloid leukemia2.87e-02
Glioma1.29e-02Cushing syndrome2.91e-02
Pancreatic cancer1.29e-02Tyrosine metabolism3.00e-02
Hepatitis C1.40e-02Fatty acid elongation3.07e-02
Chronic myeloid leukemia1.46e-02Progesterone-mediated oocyte maturation3.14e-02
TNF signaling pathway1.72e-02Glycosaminoglycan degradation3.39e-02
Ubiquitin mediated proteolysis1.72e-02Lysosome3.42e-02
Hippo signaling pathway2.08e-02Leukocyte transendothelial migration3.51e-02
The significantly enriched KEGG pathways of the aging- and CD4+ T cell proportion-derived differential genes (top 20). In addition, the Gene Ontology (GO) molecular function results further validate the above conclusion. The aging-derived differential genes were enriched in the ubiquitin-like protein ligase binding and ubiquitin protein ligase binding terms (). CD4+ T cell proportion-derived differential genes were also enriched in the ubiquitin binding term ().
Table 2

The significantly enriched GO molecular function terms of the aging- and CD4+ T cell proportion-derived differential genes (top 20).

AgingCD4+ T cell proportion
GO term (molecular function)P-valueGO term (molecular function)P-value
RNA binding8.10e-07Microtubule plus-end binding9.81e-05
Protein-DNA loading atpase activity3.88e-04Kinase binding5.82e-04
DNA clamp loader activity3.88e-04Phosphatidylinositol 3-kinase regulator activity1.10e-03
Bubble DNA binding3.88e-04RNA binding1.70e-03
Microtubule plus-end binding2.19e-03Protein kinase binding2.23e-03
Ubiquitin-like protein ligase binding3.96e-03Clathrin adaptor activity2.96e-03
Ubiquitin protein ligase binding4.26e-03Endocytic adaptor activity2.96e-03
5′-3′ RNA polymerase activity7.76e-03Prenyltransferase activity4.41e-03
Aminoacyl-trna ligase activity7.93e-03Transcription corepressor activity4.74e-03
3-hydroxyacyl-coa dehydrogenase activity (GO:0003857)9.11e-03Neurotrophin TRKA receptor binding5.05e-03
Kinase activity (GO:0016301)1.32e-02RNA polymerase II transcription factor activity, sequence-specific transcription regulatory region DNA binding6.86e-03
Protein kinase binding1.40e-02Ubiquitin binding8.33e-03
Signal sequence binding1.55e-02Ribosomal protein S6 kinase activity8.40e-03
Acetyltransferase activity2.21e-02Protein-DNA loading atpase activity8.40e-03
Clathrin adaptor activity2.25e-02DNA clamp loader activity8.40e-03
G-rich strand telomeric DNA binding2.25e-02Single-stranded DNA endodeoxyribonuclease activity8.40e-03
I-SMAD binding2.25e-02N-methyltransferase activity1.08e-02
Magnesium ion transmembrane transporter activity2.25e-02Hydrolase activity, hydrolyzing N-glycosyl compounds1.14e-02
Endocytic adaptor activity2.25e-02Ligand-dependent nuclear receptor transcription coactivator activity1.15e-02
Double-stranded DNA binding2.37e-02Damaged DNA binding1.16e-02
The significantly enriched GO molecular function terms of the aging- and CD4+ T cell proportion-derived differential genes (top 20). The GO biological process results show that the aging-derived differential genes were enriched in the proteasome-mediated ubiquitin-dependent protein catabolic process and ubiquitin-dependent protein catabolic process terms (). The CD4+ T cell proportion-derived differential genes were enriched in the positive regulation of the proteasomal ubiquitin-dependent protein catabolic process, the regulation of the proteasomal ubiquitin-dependent protein catabolic process and protein polyubiquitination ().
Table 3

The significantly enriched GO biological process terms of the aging- and CD4+ T cell proportion-derived differential genes (top 20).

AgingCD4+ T cell proportion
GO term (biological process)P-valueGO term (biological process)P-value
DNA metabolic process3.23e-05Positive regulation of proteasomal ubiquitin-dependent protein catabolic process5.46e-05
Cellular macromolecule biosynthetic process3.34e-05Positive regulation of proteasomal protein catabolic process3.28e-04
Translation3.71e-05Cellular response to DNA damage stimulus3.55e-04
Regulation of histone modification5.74e-05Negative regulation of protein serine/threonine kinase activity4.77e-04
Cellular response to DNA damage stimulus7.21e-05Retinoic acid receptor signaling pathway5.81e-04
Transcription, DNA-templated7.45e-05Regulation of proteasomal ubiquitin-dependent protein catabolic process7.40e-04
Transcription-coupled nucleotide-excision repair8.76e-05Pyruvate metabolic process8.84e-04
DNA repair1.18e-04Positive regulation of smooth muscle cell apoptotic process1.10e-03
Nucleotide-excision repair1.25e-04Negative regulation of response to biotic stimulus1.48e-03
Proteasome-mediated ubiquitin-dependent protein catabolic process1.56e-04Regulation of transcription from RNA polymerase ii promoter1.61e-03
Mitochondrial gene expression1.62e-04DNA repair1.73e-03
Proteasomal protein catabolic process2.54e-04Negative regulation of protein phosphorylation1.84e-03
RNA processing2.84e-04Toll-like receptor 3 signaling pathway1.88e-03
Positive regulation of DNA biosynthetic process5.41e-04Negative regulation of cyclin-dependent protein kinase activity1.96e-03
Gene expression6.40e-04Negative regulation of peptidyl-threonine phosphorylation2.21e-03
TRNA aminoacylation for protein translation7.50e-04Regulation of defense response to virus by virus2.43e-03
Positive regulation of chromatin silencing9.33e-04Protein polyubiquitination2.49e-03
Regulation of endodeoxyribonuclease activity9.33e-04Hexose biosynthetic process2.50e-03
Postreplication repair9.45e-04Response to laminar fluid shear stress2.96e-03
Ubiquitin-dependent protein catabolic process1.03e-03G1 DNA damage checkpoint2.96e-03
The significantly enriched GO biological process terms of the aging- and CD4+ T cell proportion-derived differential genes (top 20). Furthermore, we performed the disease association analysis of the aging- and immune-related gene sets. Starting from the disease-gene associations of the ClinVar database (25), we found that the differential genes of aging were significantly associated with 12 diseases, and the differential genes of the immune system were significantly associated with 11 diseases (P-value ≤ 0.05). Interestingly, both are significantly associated with bowel cancer-related diseases, including colon carcinoma, hereditary nephrotic syndrome, and familial CRC (). These results indicated that aging and CD4+ T cell proportion may play important roles in the development of CRC.
Table 4

The significantly enriched ClinVar diseases of the aging- and CD4+ T cell proportion-derived differential genes.

AgingCD4+ T cell proportion
DiseaseP-valueDiseaseP-value
Hyperphosphatasia-intellectual disability syndrome2.37e-03Hyperphosphatasia-intellectual disability syndrome5.05e-03
Diabetes mellitus type 23.68e-03Severe congenital neutropenia8.40e-03
Familial hyperinsulinism9.11e-03Neoplasm of stomach1.28e-02
Neoplasm of stomach9.11e-03Familial colorectal cancer2.31e-02
Congenital central hypoventilation1.36e-02Combined oxidative phosphorylation deficiency3.39e-02
Carcinoma of colon1.99e-02Adult junctional epidermolysis bullosa3.87e-02
Beckwith–Wiedemann syndrome2.49e-02Epidermolysis bullosa, junctional3.87e-02
Hereditary nephrotic syndrome2.49e-02Familial adenomatous polyposis 13.87e-02
Hirschsprung disease2.49e-02Galloway–Mowat syndrome3.87e-02
Combined oxidative phosphorylation deficiency2.51e-02Acute myeloid leukemia4.02e-02
Neonatal diabetes mellitus3.97e-02Carcinoma of colon4.60e-02
Familial colorectal cancer4.17e-02
The significantly enriched ClinVar diseases of the aging- and CD4+ T cell proportion-derived differential genes.

E3 Genes Associated With Aging

In the above results, we not only proved that the aging immune system is related to the onset of CRC but also further verified that ubiquitinated genes play an important role in the aging immune system. Therefore, we conducted further analyses on the 233 E3 ubiquitin ligase genes. A total of 103 of the 233 E3 genes have cg probe annotations on the Illumina Infinium 450K Human Methylation Beadchip. Then, we obtained the SIMPO scores of 42 E3 genes. The methylation SIMPO scores of 23 E3 genes were significantly associated with the patient ages based on the adjusted T-test, and the E3 gene DZIP3 ranked sixth ( and ) ().
Figure 2

(A) The E3 gene list that is significantly associated with aging (top 10). (B) The E3 gene list that is significantly associated with CD4+ T cell proportion (top 10). (C) The intersecting genes that are significantly associated with both aging (top 10) and CD4+ T cell proportion (top 10).

Figure 3

(A) DZIP3 is significantly associated with aging. (B) DZIP3 is significantly associated with the CD4+ T cell proportion.

(A) The E3 gene list that is significantly associated with aging (top 10). (B) The E3 gene list that is significantly associated with CD4+ T cell proportion (top 10). (C) The intersecting genes that are significantly associated with both aging (top 10) and CD4+ T cell proportion (top 10). (A) DZIP3 is significantly associated with aging. (B) DZIP3 is significantly associated with the CD4+ T cell proportion.

E3 Genes Associated With Immune Cell Proportions

Then, we obtained the SIMPO scores of 38 E3 genes based on the GSE69270 dataset. We calculated the Pearson correlations between the SIMPO scores of the 38 E3 ubiquitin ligases with immune cell proportions (including CD8+ T and CD4+ T cells, monocytes, granulocytes, and NK and B cells) and the sum of CD8+ T and CD4+ T cells. The Pearson correlation results showed that 15 E3 genes were significantly associated with the CD4+ T cell proportion ( and ), including the E3 gene DZIP3 (). On the other hand, according to annotations provided by the GeneCards database (https://www.genecards.org/cgi-bin/carddisp.pl?gene=DZIP3), the DZIP3-encoded protein is significantly overexpressed in CD4+ T cells in the blood. It is worth noting that there were two genes (UBR1 and DZIP3) that appeared in the top ten genes related to aging and CD4+ T cell proportion (). Through literature searches in NCBI PubMed (https://www.ncbi.nlm.nih.gov/pubmed), it has been found that hundreds of published studies have demonstrated the associations of UBR1 with cancers. Interestingly, however, no article has reported the association of DZIP3 with colon cancer or even cancer. Considering the novelty of this gene, we conducted further clinical validation on DZIP3. In addition, according to the annotations in the Reactome pathway database (http://www.reactome.org/), DZIP3 is involved in multiple immune-related pathways, including adaptive immune system, antigen processing- ubiquitination and proteasome degradation, class I MHC mediated antigen processing and presentation and immune system. Due to the strong association between immunity and cancer, we speculated that DZIP3 may use the above immune-related pathways as the bridge to affect CRC.

Survival Analysis Results

The survival analysis of 293 COAD patients showed that the SIMPO scores of the three genes (DZIP3, RNF26, and UBR3) were significantly associated with the patient’s survival times. Among them, the most significant associated gene is DZIP3, which Hazard Ratio (HR) is 40.91 and P-value of Cox regression is 1.62e-02 ( and ). This result demonstrated that compared to aging-related and CD4+ T cell proportion-derived gene UBR1, DZIP3 may be more biologically related to CRC, which deserves further study.
Table 5

The survival analysis results of COAD patients for E3 genes.

Gene nameHR [exp(coef)]coef95% CI lower95% CI upperP-value
DZIP340.913.710.696.741.62e-02
RNF260.15−1.89−3.60−0.192.95e-02
UBR30.17−1.78−3.46−0.113.66e-02
Figure 4

The survival analysis results of COAD patients for DZIP3.

The survival analysis results of COAD patients for E3 genes. The survival analysis results of COAD patients for DZIP3.

Methylation Variations in DZIP3 Amplicon-cg14787155

The clinicopathological characteristics of CRC patients are presented in . By MassARRAY analysis, we examined the methylation status of 25 CpG sites, and 15 methylation units were effectively detected in the majority of the samples for . For each unit, we kept all the effective detected samples and tested the significance of methylation change in Ctrl vs. I and II and Ctrl vs. III. We found units showing significantly decreased methylation in CRC patients compared to normal controls (P-value ≤ 0.05, T-test). Specifically, we found that significantly changed methylation units were different between genders (, ).
Table 6

Clinicopathological characteristics of the CRC patients (n = 150).

VariablePatients (n = 100)Control (n = 50)
Male/Female67/3318/32
Mean Age62.0765.74
Age range27–6816–86
TNM stageI/II (53); III (47)
Figure 5

DNA methylation analysis of amplicon-cg14787155 in DZIP3. (A) Sex-specific methylation level change between colorectal cancer and control in amplicon-cg14787155. (B) ROC analysis of the early-pTNM-stage colorectal cancer prediction model based on the MassARRAY data of amplicon-cg14787155 features. The model with the best performance had an AUC of 0.833 and an OR of 8.82. (C) ROC analysis of the all-pTNM-stage colorectal cancer prediction model based on the MassARRAY data of amplicon-cg14787155 features. The model with the best performance had an AUC of 0.782 and an OR of 5.7.

Clinicopathological characteristics of the CRC patients (n = 150). DNA methylation analysis of amplicon-cg14787155 in DZIP3. (A) Sex-specific methylation level change between colorectal cancer and control in amplicon-cg14787155. (B) ROC analysis of the early-pTNM-stage colorectal cancer prediction model based on the MassARRAY data of amplicon-cg14787155 features. The model with the best performance had an AUC of 0.833 and an OR of 8.82. (C) ROC analysis of the all-pTNM-stage colorectal cancer prediction model based on the MassARRAY data of amplicon-cg14787155 features. The model with the best performance had an AUC of 0.782 and an OR of 5.7.

Colorectal Cancer Prediction Model Based on Amplicon-cg14787155

With EpiDesigner software, we obtained the sequence of the MassARRAY primer relative to the amplicon-cg14787155 region (). Based on the effective detected methylation data of 15 methylation units, we constructed a prediction model to classify CRC individuals from normal controls. As showed above, sex would be an important factor in predicting cancer state, and we also set sex as an input factor in the prediction model.
Table 7

Sequence of MassARRAY primer and position relative to .

AmpliconPrimerSequence (5′->3′)
amplicon-cg1478715510FaggaagagagGGAAGTTTTTAGGTATTTTAGGGGAT
T7RcagtaatacgactcactatagggagaaggctTAAAACCCAAAATTCTTCTCCTCA
Sequence of MassARRAY primer and position relative to . A data set of 150 samples (100 CRC in stage I/II/III and 50 normal controls) with effective detected values in all 15 methylation units was kept for the model construction. The data were analyzed in R (v3.6) environment, the caret (v6.0-84) and ROCR (v1.0-7) packages were used to construct the logistic regression model, which is essentially used to predict the probability of a binary (Cancer/Normal) event occurring. Logistic regression yields good performance to predict cancer outcome broadly in clinical cancer prediction studies. In this study, we collected the clinical CRC/normal sample information and chose the well-accepted logistic regression model to ensure the result could reflect the real clinical conditions. And we use leave-one-out cross-validation (LOOCV) to validate the model performance. First, we constructed the early-stage CRC prediction model in a dataset of 103 samples (53 CRC in pTNM stage I/II and 50 normal controls). In model construction, we set clinical “CRC/normal” outcomes as 0/1, gender information “male/female” as 1/0. We use gender information and all methylation levels of 15 effectively detected units in amplicon-cg14787155 as inputs. R package caret was used to perform the model construction. We applied backward stepwise feature selection method, which begins with model containing all inputs and then iteratively removes the least useful input, one-at-a-time. Results showed that a model with the optimized input CpG sites (CpG_8, CpG_12, CpG_17, CpG_14.15, CpG_16, CpG_18, CpG_3.4, CpG_1.2, CpG_11, CpG_6) and sex showed the best performance (AUC = 0.833 and OR = 8.82) (). Additionally, we constructed the all-stage CRC prediction model in the full data set. The model construction process was the same as early-stage CRC prediction model. A model with the optimized input CpG sites (CpG_12, CpG_18, CpG_1.2, CpG_6, CpG_11, CpG_3.4) and sex showed the best performance (AUC = 0.782 and OR = 5.7) (). The results showed that the methylation state of the amplicon-cg14787155 region has specific features in predicting early-stage and all-stage CRC.

Discussion

Ubiquitination is one of the main types of covalent modification of intracellular proteins. It participates in almost all life activities, such as cell survival, cell differentiation, and innate and acquired immunity (30–32). The abnormalities of ubiquitination are closely related to the occurrence of various tumors (33–37). It is worth noting that, consistent with the findings of this study, E3 ligase-mediated ubiquitination plays an important role in the development of CRC (38, 39). The in-depth study of ubiquitination will open up a new path for the study of CRC pathogenesis, diagnosis and treatment. The results show that ubiquitination is an important regulatory mechanism of the surface proteins of immune cells and plays an important regulatory role in aging and changes in the composition of immune cells. The next step is to focus on whether DZIP3 plays an important role in the development of other types of cancer and on ubiquitinated pathway-modified proteins. In this study, starting with E3 ubiquitin ligase genes, we screened early markers of CRC related to immune system aging and immune escape. The results showed that the ubiquitination regulatory gene DZIP3 was significantly correlated with age and changes in immune cell components. In addition, previous studies have shown that genes originating in different stages have different biological functions, which are closely related to human phenotypes (40–45). Therefore, the evolutionary information of genes often implies important clues to the analysis of human disease mechanisms (40, 41, 44). For example, a study had shown that the gene network composed of eumetazoa-originated genes can regulate the hallmarks of cancer (44), reflecting the potential associations of eumetazoa-originated genes with cancer. Interestingly, through Liebeskind’s research on the origin of human genes (46), DZIP3 originated in the eumetazoa stage, suggesting that DZIP3 may have an influence in the progression of cancer to a certain extent. Furthermore, the survival analysis for 293 COAD patients showed that DNA methylation value of DZIP3 is significantly associated with the patient’s survival times. Considering the association between DZIP3 and cancer, we designed a low-cost nucleic acid mass spectrometry kit, which was validated by recruiting patients with early CRC. At present, the US FDA-approved CRC screening methods mainly include colonoscopy, fecal DNA testing, the fecal occult blood test (FOBT), the conventional blood carcinoembryonic antigen (CEA) indicator, and the circulating methylated SEPT9 test. However, the above technologies still have certain limitations. For example, the early preparation of colonoscopy is cumbersome, invasive and subject to the subjective influence of the operator’s skills and clinical experience. Based on circulating methylated SEPT9 DNA for detecting CRC in a study by Church et al., the methylation of SEPT9 DNA had a sensitivity of 35.0% for patients with stage I CRC, 63% for stage II, 46.0% for stage III, and 77.4% for stage IV, with a specificity of 91.5% (47). In recent years, peripheral blood circulating tumor cells (CTCs) have made great progress, and this technique is convenient for blood collection, with high public acceptance. However, there are still many problems in the application of CTCs. For example, positive results of CTCs still require confirmation by colonoscopy, and the appropriate testing interval of CTCs should be further explored. However, this study used the DNA methylation characteristics of DZIP3 as early screening markers of early-stage CRC (sensitivity = 0.760, specificity = 0.774) and all-stage CRC (sensitivity = 0.720, specificity = 0.720). In addition, this screening technique only needs to extract the whole blood of individuals for MassARRAY analysis; the samples are easy to obtain, and the results are objective. Therefore, this method may have important applications in clinical CRC screening. we concluded that DZIP3 has the potential to become an effective biomarker of CRC, which is worthy of experimental verification in vivo and in vitro in the future. Another interesting feature of the present study suggests a novel early cancer screening approach based on blood cells. This blood cell methylation analysis mainly reflects the status of the immune system rather than cancer cells. Traditionally, liquid biopsy is focused on DNA mutation or DNA methylation tests on circulating free DNAs, which are mainly released by cancerous cells. Our results confirmed that aging and immune escape are important links in the development of intestinal cancer. Considering that a certain frequency of gene mutations will occur in healthy people and healthy tissues, the speed-limiting step of cancer occurrence, that is, the bottleneck, is not gene mutation but changes in the immune system. As cancer cells and immune systems reflect the yin-yang aspects of tumorigenesis, our results raise a new option for detecting the early and rate-limiting steps of tumorigenesis. Considering that blood cell methylation analysis could be conducted in healthy individuals, it is possible that our method could outline a continuous curve of healthy, inflammation and transformation processes from enteritis to cancer. Thus, our method holds specific value for use in health management compared to the traditional liquid biopsy method, in which strong signals will appear only after enough cancer cells have been produced.

Pre-print statement

We have made a pre-print version of this manuscript available on the bioRxiv (The bioRxiv 2020.01.13.905299; doi: https://doi.org/10.1101/2020.01.13.905299).

Data Availability Statement

Publicly available datasets were analyzed in this study, these can be found in the NCBI Gene Expression Omnibus (GSE40279 and GSE69270). The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by the institutional ethics committee of the Guangdong Provincial People’s Hospital and FoShan New RongQi Hospital (No.GDREC2016203H(R1)). The patients/participants provided their written informed consent to participate in this study.

Author Contributions

YQ and FL conducted the data mining and bioinformatics analyses. DW, XY, ZL, and QY performed the clinical sample and data collection. YL and ZN were responsible for the clinical trial design and organization. ZN and LL performed part of the clinical data mining. YZ, YC, and AW took part in the lab experimental design. ZH, DT, and BH took part in designing the clinical validation. RX helped in preparing the manuscript. JX led the epigenetic research and designed the strategy for the integrated analysis of DNA methylation data. YQ, FL, and JX wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This research was partly funded by grants from the Shenzhen Science & Technology Program (CKFW2016082915204709, JCYJ20151029154245758, CYZZ20160530183500723) and the Science and Technology Planning Project of Guangdong Province, China (No. 2016A020215128).

Conflict of Interest

ZN and LL were employed by the company Shenzhen Taikontek Health technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  44 in total

Review 1.  Potential role of immunosenescence in cancer development.

Authors:  Tamas Fulop; Rami Kotb; Carl F Fortin; Graham Pawelec; Flavia de Angelis; Anis Larbi
Journal:  Ann N Y Acad Sci       Date:  2010-06       Impact factor: 5.691

2.  Colorectal cancer statistics, 2020.

Authors:  Rebecca L Siegel; Kimberly D Miller; Ann Goding Sauer; Stacey A Fedewa; Lynn F Butterly; Joseph C Anderson; Andrea Cercek; Robert A Smith; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2020-03-05       Impact factor: 508.702

Review 3.  The roles of Pellino E3 ubiquitin ligases in immunity.

Authors:  Paul N Moynagh
Journal:  Nat Rev Immunol       Date:  2014-01-21       Impact factor: 53.106

4.  Colorectal Cancer in Inflammatory Bowel Diseases: A Population-Based Study in Utah.

Authors:  N Jewel Samadder; John F Valentine; Stephen Guthery; Harminder Singh; Charles N Bernstein; Yuan Wan; Jathine Wong; Kenneth Boucher; Lisa Pappas; Kerry Rowe; Mary Bronner; Cornelia M Ulrich; Randall W Burt; Karen Curtin; Ken R Smith
Journal:  Dig Dis Sci       Date:  2017-01-03       Impact factor: 3.199

5.  Enrichr: a comprehensive gene set enrichment analysis web server 2016 update.

Authors:  Maxim V Kuleshov; Matthew R Jones; Andrew D Rouillard; Nicolas F Fernandez; Qiaonan Duan; Zichen Wang; Simon Koplev; Sherry L Jenkins; Kathleen M Jagodnik; Alexander Lachmann; Michael G McDermott; Caroline D Monteiro; Gregory W Gundersen; Avi Ma'ayan
Journal:  Nucleic Acids Res       Date:  2016-05-03       Impact factor: 16.971

6.  Regulation of androgen receptor transcriptional activity and specificity by RNF6-induced ubiquitination.

Authors:  Kexin Xu; Hermela Shimelis; Douglas E Linn; Richeng Jiang; Xi Yang; Feng Sun; Zhiyong Guo; Hege Chen; Wei Li; Hegang Chen; Xiangtian Kong; Jonathan Melamed; Shengyun Fang; Zhen Xiao; Timothy D Veenstra; Yun Qiu
Journal:  Cancer Cell       Date:  2009-04-07       Impact factor: 31.743

Review 7.  Hallmarks of cancer: the next generation.

Authors:  Douglas Hanahan; Robert A Weinberg
Journal:  Cell       Date:  2011-03-04       Impact factor: 41.582

Review 8.  Ubiquitin signaling in immune responses.

Authors:  Hongbo Hu; Shao-Cong Sun
Journal:  Cell Res       Date:  2016-03-25       Impact factor: 25.617

9.  Evolutionary Origins of Cancer Driver Genes and Implications for Cancer Prognosis.

Authors:  Xin-Yi Chu; Ling-Han Jiang; Xiong-Hui Zhou; Ze-Jia Cui; Hong-Yu Zhang
Journal:  Genes (Basel)       Date:  2017-07-14       Impact factor: 4.096

10.  The signature of liver cancer in immune cells DNA methylation.

Authors:  Yonghong Zhang; Sophie Petropoulos; Jinhua Liu; David Cheishvili; Rudy Zhou; Sergiy Dymov; Kang Li; Ning Li; Moshe Szyf
Journal:  Clin Epigenetics       Date:  2018-01-18       Impact factor: 6.551

View more
  4 in total

1.  Mining the Selective Remodeling of DNA Methylation in Promoter Regions to Identify Robust Gene-Level Associations With Phenotype.

Authors:  Yuan Quan; Fengji Liang; Si-Min Deng; Yuexing Zhu; Ying Chen; Jianghui Xiong
Journal:  Front Mol Biosci       Date:  2021-03-26

2.  NEDD4L suppresses PD-L1 expression and enhances anti-tumor immune response in A549 cells.

Authors:  Bin Zhong; Jie Zheng; Huilan Wen; Xinhui Liao; Xingxiang Chen; Yunwei Rao; Ping Yuan
Journal:  Genes Genomics       Date:  2022-03-30       Impact factor: 2.164

3.  Obesity-Associated Differentially Methylated Regions in Colon Cancer.

Authors:  John J Milner; Zhao-Feng Chen; James Grayson; Shyang-Yun Pamela Koong Shiao
Journal:  J Pers Med       Date:  2022-04-20

4.  DNA Methylation Markers and Prediction Model for Depression and Their Contribution for Breast Cancer Risk.

Authors:  Ning Wang; Jing Sun; Tao Pang; Haohao Zheng; Fengji Liang; Xiayue He; Danian Tang; Tao Yu; Jianghui Xiong; Suhua Chang
Journal:  Front Mol Neurosci       Date:  2022-02-23       Impact factor: 5.639

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.