Literature DB >> 25997610

Identification of diagnostic markers in colorectal cancer via integrative epigenomics and genomics data.

Teow Kok-Sin1, Norfilza Mohd Mokhtar1, Nur Zarina Ali Hassan1, Ismail Sagap2, Isa Mohamed Rose3, Roslan Harun4, Rahman Jamal1.   

Abstract

Apart from genetic mutations, epigenetic alteration is a common phenomenon that contributes to neoplastic transformation in colorectal cancer. Transcriptional silencing of tumor-suppressor genes without changes in the DNA sequence is explained by the existence of promoter hypermethylation. To test this hypothesis, we integrated the epigenome and transcriptome data from a similar set of colorectal tissue samples. Methylation profiling was performed using the Illumina InfiniumHumanMethylation27 BeadChip on 55 paired cancer and adjacent normal epithelial cells. Fifteen of the 55 paired tissues were used for gene expression profiling using the Affymetrix GeneChip Human Gene 1.0 ST array. Validation was carried out on 150 colorectal tissues using the methylation-specific multiplex ligation-dependent probe amplification (MS-MLPA) technique. PCA and supervised hierarchical clustering in the two microarray datasets showed good separation between cancer and normal samples. Significant genes from the two analyses were obtained based on a ≥2-fold change and a false discovery rate (FDR) p-value of <0.05. We identified 1,081 differentially hypermethylated CpG sites and 36 hypomethylated CpG sites. We also found 709 upregulated and 699 downregulated genes from the gene expression profiling. A comparison of the two datasets revealed 32 overlapping genes with 27 being hypermethylated with downregulated expression and 4 hypermethylated with upregulated expression. One gene was found to be hypomethylated and downregulated. The most enriched molecular pathway identified was cell adhesion molecules that involved 4 overlapped genes, JAM2, NCAM1, ITGA8 and CNTN1. In the present study, we successfully identified a group of genes that showed methylation and gene expression changes in well-defined colorectal cancer tissues with high purity. The integrated analysis gives additional insight regarding the regulation of colorectal cancer-associated genes and their underlying mechanisms that contribute to colorectal carcinogenesis.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25997610      PMCID: PMC4484611          DOI: 10.3892/or.2015.3993

Source DB:  PubMed          Journal:  Oncol Rep        ISSN: 1021-335X            Impact factor:   3.906


Introduction

In recent decades, the incidence of colorectal cancer (CRC) has increased by 2- to 4-fold in many Eastern Asia countries such as China, Japan, South Korea and Singapore (1,2). The high risk of CRC among the Asian population, including Malaysia, is associated with a low fiber diet and high tobacco consumption (3). One of the screening methods to detect early stage of CRC is by measuring the level of carcinoembryonic antigen (CEA) in the serum, however, the sensitivity of the marker was reported to be <80% (4–6). Therefore, the identification of new biological markers for CRC is crucial. Epigenetic markers such as methylation markers in CRC were first reported 10 years ago in DNA from stool and blood samples (7). DNA methylation is an epigenetic mechanisms that involves the enzymatic process of adding the methyl group to the 5-carbon position of the cytosine to form 5-meth-ylcytosine (8–10). This modification mostly occurs in the CG enriched site known as CpG islands (CGI), which are present in 70% of the annotated gene promoter regions (11). Two typical DNA methylation patterns, global hypomethylation and CGI hypermethylation, have emerged as potential signatures in the cancer genome (12). Hypermethylation is generally found in the promoter CGI region, whereas global hypomethylation frequently occurs in CpG dinucleotides that are located in the repetitive sequences of DNA (satellite repeats or retrotransposon) (13). CGI hypermethylation in the promoter region is thought to be linked with the transcriptional inactivation of tumor-suppressor genes (14,15). This is mediated through the methyl-CpG binding proteins (MBDs) resulting in a compacted chromatin conformation (15,16). The compacted chromatin hinders the accessibility of the transcriptional machinery from binding to the promoter region, thereby leading to the repression of gene expression (17). The inverse relationship between DNA methylation and transcript level was reported in a study involving chromosomes 6, 20 and 22 in 43 healthy human tissue and primary cells (18). The study revealed that one-third of the differentially methylated genes (representing 17% of the 873 analyzed genes) were found to be inversely associated with their transcript levels (18). Apart from the conventional polymerase chain reaction (PCR) method, the microarray chip-based study is a widely used approach in exploring methylation markers in CRC (14,19,20). In a study using the Illumina GoldenGate® methylation array on 28 normal mucosa and 91 CRC samples, 202 CpG sites with 90 hypermethylated and 42 hypomethylated loci that involved 132 genes were identified (21). Using the level of CpG island methylator phenotype (CIMP), CRC was divided into three different subgroups (21). A more recent study identified 169 hypermethylated loci and validated 11 of these loci that could be distinguished between CRC and non-neoplastic colonic mucosa (22). Among the genes were dedicator of cytokinesis 8 (DOCK8), visual system homeobox 2 (VSX2), microRNA 34b (miR-34b), glucagon-like peptide 1 receptor (GLP1R1), B-cell translocation gene 4 (BTG4), BEN domain containing 4 (BEND4), neuronal pentraxin I (NPTX1), ALX homeobox 3 (ALX3), zinc finger protein 583 (ZNF583), homer homolog 2 (HOMER2) and gap junction protein, gamma 1 (GJC1) (22). Hypermethylated markers identified previously in CRC using a single array profile cannot reflect completely the complexity of the disease. To address this issue, epigenomic and genomic data from microarray analyses have been used to obtain a more comprehensive insight of the molecular mechanisms involved in CRC (17,23–25). The Cancer Genome Atlas study revealed a comprehensive molecular image of CRC by integrating data between promoter methylation, DNA copy number, exome sequencing, microRNA expression and messenger RNA expression in 224 CRC and normal samples (26). Apart from identifying gene mutations involved in the well-established signaling pathways, the authors of that study also documented the new critical role of MYC in directing the transcriptional activation and repression in CRC. Additional findings included repetitive mutations in APC membrane recruitment protein 1 (FAM123B) and AT rich interactive domain 1A (ARID1A) as well as a novel mutation in SRY (sex determining region Y)-box 9 (SOX9) (26). Studies using a similar integrative approach of analyses on Asian patients are lacking. Therefore, the aim of the present study was to investigate the biological complexity of CRC by integrating DNA methylation and gene expression profiling signatures using paired samples from local patients. The integrated signatures may later serve as potential diagnostic markers for the prognostication of CRC. We also hypothesized that CpG hypermethylation in the promoter region of CRC-associated genes led to a low expression of the respective genes.

Materials and methods

Clinical samples

A total of 55 paired colorectal carcinoma and their corresponding adjacent normal epithelial cells (10 cm away from the tumor) were collected at surgery from patients at the Universiti Kebangsaan Malaysia Medical Centre, Kuala Lumpur, Malaysia. Patients provided written informed consent to participate in the present study. This study was approved by the UKM Research Ethics Committee (Reference no: UKM 1.5.3.5/244/UMBI-004-2012).

Sample preparation

The clinical tissues were snap-frozen in liquid nitrogen prior to sectioning. Tissue samples were sectioned into 5–7 μm thickness using a cryostat (Microtome Cryostat HM550; Microm International GmbH, Walldorf, Germany). The sections were stained with hematoxylin and eosin (H&E) and were examined by the histopathologist. Tissue samples were considered representative when >80% malignant cells were present. Normal cells also contained >80% normal epithelial cells and were free from malignant or inflammatory cells. Patients with chemotherapy or radiotherapy prior to surgery were excluded from the study.

DNA methylation profiling assay

Genomic DNA was isolated using the Qiagen DNeasy Tissue kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The quantity and purity of DNA were quantified using the NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific, Leicester, UK). Gel electrophoresis was used to check the integrity of the DNA. Only samples with good purity were included in the study. Methylation profiling was performed in 110 samples using the HumanMethylation27 Beadchip to analyze 27,578 CpG sites covering 14,495 genes. Bisulphite conversion was carried out in the methylation assay using the EZ DNA Methylation-Gold kit (Zymo Research, Irvine, CA, USA). The microarray study was carried out according to the Infinium II Methylation Assay manual protocol. All the chips were scanned on a single BeadArray reader to avoid bias.

Gene expression profiling assay

Total RNA from 15 paired representative tissues were extracted using the Qiagen QIAampMini Plus kit (Qiagen) according to the manufacturer’s instructions. The quantity and purity of RNA were quantified using the NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific). The integrity of RNA was measured using the Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, CA, USA). Only samples with an OD 260/280 of 1.8–2.1 and RNA integrity number (RIN) ≥6.0 were included in the gene expression study. The GeneChip Human Gene 1.0 ST array (Affymetrix, Santa Clara, CA, USA) that has 764,885 distinct probes covering 28,869 well-annotated genes was used. The assay was carried out using the Affymetrix expression protocol. The microarray chips were scanned using the GeneChip Scanner 3000 7G (Affymetrix). The Affymetrix® Genotyping Console™ (Affymetrix) was used to extract the expression data and Partek Genomic Suite 6.6 (version 6.12.0713) (Partek Inc., St. Louis, MO, USA) was utilized for subsequent analysis.

Statistical analysis of DNA methylation profiling

The control panel from Illumina BeadStudio software 2011 (version 1.9.0) (Illumina, San Diego, CA, USA) was used to determine the quality of our methylation microarray assay. β-value was used to determine the methylation status of each sample. This value was derived from each locus from the microarray and range from the lowest methylation value (β=0) to the highest methylation value (β=1). β-value was generated as the intensity of the methylated probe/total of the intensity of the methylated probe and the intensity of the unmethylated probe. The generated β-value was exported to the Partek Genomic Suite 6.6 (version 6.12.0713) (Partek) for subsequent analysis. PCA mapping was used to determine the quality of the samples. Three-way analysis of variance (ANOVA) with ≥2-fold change and P<0.05 with FDR were used to compare the differential methylated CpG loci between the cancer and normal groups. To remove the batch effect, the scanned date of the chips was controlled. A Gene Ontology enrichment analysis was carried out to investigate whether the genes found to be differentially methylated could be classified into a Gene Ontology category more often than expected by chance. A functional group with a high enrichment score was considered the leading group.

Statistical analysis of genome-wide gene expression profiling

Differential gene expression analysis was carried out using the Partek Genomic Suite 6.6 software with three-way ANOVA analysis. The expression data were normalized using quantile normalization and robust multi-array analysis (RMA) background correction. The differentially expressed genes were reported to be significant in the cancer group when the fold-change was >2.0 and P-value with FDR was <0.05. The batch effect was removed as a source of variation.

Sub-analysis of methylation and expression profile

The analysis was performed using the Partek Genomic Suite 6.6 software. We used the Pearson (linear) correlation to determine the correlation of our data. The correlated genes were considered significant at P<0.05.

Integration of methylation and expression profile statistical analysis

Overlapped genes were identified based on significant gene symbols from the methylation and expression datasets. Datasets were imported into the MySQL relational database for downstream data analysis. The MySQL database allows rapid and accurate data filtering across different datasets. The datasets were compared in the pair-wise manner. Unique gene symbols identified between the overlapping comparisons were used in downstream analysis. Along with the unique gene list, methylation and expression values were extracted from the datasets for chromosome mapping, circular map generation and KEGG Pathway mapping. Mapping of the integrated gene list allowed visualization of overlapping genes on chromosome map overview, circular map overview and also KEGG pathway maps.

Validation of genes using MS-MLPA

A total of four significant genes (SFRP2, BTG4, APC and GPX7) were selected from the DNA methylation profile for validation purpose. Validation was carried out using the MS-MLPA and followed the manufacturer’s instructions. The primers were designed and customized following the guidelines from MRC Holland (MRC-Holland, Amsterdam, The Netherlands). The amplification PCR product was carried out by using the 3500 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). The electrophoresis result was assessed by using the Coffalyser software version 1.0.0.43 (MRC-Holland). For the analyses, dataset of MS-MLPA was analyzed by the method described in a previous study (27). To quantify the methylation status of each of the genes, the probe relative peak area ratio of the digested sample was compared with that of the undigested samples. The digested sample with the probes of a relative peak area ratio of ≥0.25 was defined as methylated.

Results

Principal component analysis and sources of variation determine the quality of the microarray studies

Principal component analysis (PCA) showed that the normal group (indicated by red color) was clustered distinctly from the tumor group (indicated by blue color) from the profiling data (Fig. 1). For the methylation study, one normal and one tumor sample were removed since the samples were classified in the opposite group. By applying three-way ANOVA, sources of variation were generated. The ’Sentrix barcode’ factor (for methylation study) and ‘Scan date’ (for expression study) factor were removed to avoid the batch effects.
Figure 1

PCA mapping of (A) methylation and (B) expression profiling. Normal group (indicated by red color) was clustered distinctly from tumor group (indicated by blue color).

Genome-wide DNA methylation profiling of CRC in matched samples

The methylation profiling involved 110 CRC samples together with their neighboring non-cancerous colonic cells. The mean age for all patients was 60.4±12.83 years (Table I). The methylation chip analyses revealed a total of 27,578 CpG loci, covering 14,495 consensus coding sequences with an average of 1.9 CpG loci per sequence. We detected for each sample, on average, 27,506.64 loci at P<0.05 and 27,444.05 loci at P<0.01. A locus was considered to be detected at the two cut-off levels when the mean signal intensity from multiple probes for a particular CpG locus was significantly higher than the negative control on the same chip.
Table I

Distribution of clinicopathological characteristics of 55 paired matched samples.

CharacteristicsNo.(%)
Gender
 Male2240.00
 Female3360.00
Age (years)
 <50916.36
 >504683.64
Ethnicity
 Malay2749.09
 Chinese2545.46
 India35.45
Duke’s staging
 A47.27
 B3156.36
 C2036.36
Differentiation
 Well differentiated3156.36
 Moderately differentiated2138.18
 Poorly differentiated35.46
Location
 Right1323.64
 Left4276.36
A total of 1,123 loci (845 genes) were found to be differentially methylated in CRC compared with the normal colonic epithelial samples. These loci were further classified into 1,081 hypermethylated loci (804 genes), 36 hypomethylated loci (36 genes) and 6 sex-chromosome methylated loci (5 genes). Sex-chromosomes loci were eliminated for the subsequent analysis to avoid bias to the present study. This is due to the methylation involved in the X-inactivation process that silences one out of the two copies of X chromosome in female in order to compensate for the gene dosage effect. Supervised hierarchical clustering of the significant differentially methylated loci in our study is shown in Fig. 2A. The top 10 highly significant hypermethylated loci were protein kinase, cAMP-dependent, regulatory, type I, β (PRKAR1B), cannabinoid receptor interacting protein 1 (C2orf32), zinc finger protein 542 (ZNF542), KH domain containing, RNA binding, signal transduction-associated 3 (KHDRBS), interferon regulatory factor 4 (IRF4), tissue factor pathway inhibitor 2 (TFPI2), potassium voltage-gated channel, KQT-like subfamily, member 5 (KCNQ5), filamin-binding LIM protein 1 (FBLIM1), eyes absent homolog 4 (EYA4) and spastic paraplegia 20 (SPG20). The top 10 most significantly hypomethylated loci were Fc receptor-like 3 (FCRL3), Granzyme K (Granzyme 3; Tryptase II) (PRSS1), bactericidal/permeability-increasing protein (BPI), v-akt murine thymoma viral oncogene homolog 3 (AKT3), pipecolic acid oxidase (PIPOX), peptidase inhibitor 3 (PI3), bactericidal/permeability-increasing protein-like 3 (BPIL3), solute carrier family 26, member 4 (SLC26A4), long intergenic non-protein coding RNA 152 (MGC4677) and defensin, β 119 (DEFB119).
Figure 2

Supervised hierarchical clustering of methylation and expression profiling. (A) Supervised hierarchical clustering of methylation profile revealed 1,081 hypermethylated loci (red) and 36 hypomethylated loci (green). (B) Supervised hierarchical clustering of gene expression profile 709 genes were upregulated (red) and 699 genes were downregulated (green).

Genome wide expression profiling of CRC identifies differentially expressed genes in the same group of patients

The data of the genome wide expression profiling have been recently reported (28). Statistical analysis following normalization of data identified 1,408 differentially expressed genes in CRC compared to the normal samples. Supervised hierarchical clustering clearly showed a total of 709 genes were upregulated and 699 were downregulated genes (Fig. 2B). The top 10 most significant differentially upregulated genes were claudin 1 (CLDN1), phosphoribosyl pyrophosphate amidotransferase (PPAT), tumor protein D52-like 2 (TPD52L2), serine hydroxymethyltransferase 2 (SHMT2), cell division cycle associated 7 (CDCA7), chaperonin containing TCP1, subunit 3 (CCT3), eukaryotic translation initiation factor 2, subunit 2 β (EIF2S2), thyroid hormone receptor interactor 13 (TRIP13), inhibin, β A (INHBA) and negative elongation factor complex member C/D (TH1L). The top 10 most significant differentially downregulated genes were chromosome 2 open reading frame 88 (C2orf88), alcohol dehydrogenase 1B (ADH1B), erythrocyte membrane protein band 4.1-like 4A (EPB41L4A), transmembrane and immunoglobulin domain containing 1 (TMIGD1), serum/glucocorticoid regulated kinase 1 (SGK1), histone deacetylase 9 (HDAC9), UDP-glucose pyrophosphorylase 2 (UGP2), sodium channel, voltage-gated, type IX, α subunit (SCN9A), membrane-spanning 4-domains, subfamily A, member 12 (MS4A12) and solute carrier family 4, sodium bicarbonate cotransporter, member 4 (SLC4A4).

Methylation specific-multiple ligation probe amplification (MS-MLPA) confirmed the methylation profiling results

Four hypermethylated genes, i.e., secreted frizzled-related protein 2 (SFRP2), BTG4, adenomatous polyposis coli (APC) and glutathione peroxidase 7 (GPX7), to validate the methylation profiling data. Data analysis of MS-MLPA showed high methylation for all four validated genes in CRC compared to the normal samples (Fig. 3).
Figure 3

MS-MLPA analysis. MLPA-analysis revealed the percentage of methylation in tumor samples was higher than normal samples. The red bar on the figure shows tumor samples whereas the blue bar shows normal samples. The SFRP2 gene has the highest percentage of methylation in tumor samples with 81% followed by BTG4 (76%), GPX7 (60%) and APC (19%).

Sub-analysis of methylation and gene expression profiling of 15 pair-matched samples

We correlated the data at P<0.05 and identified 188 significant loci (covered by 136 genes) from the two profiles including 83 negatively (66 genes) and 105 positively correlated loci (70 genes). The top 5 positively correlated genes were heat shock 60 kDa protein 1 (HSPD1) (r= 0.799443) (Fig. 4A), SLC4A4 (0.77751), family with sequence similarity 5, member C (FAM5C) (0.777301), partner of NOB1 homolog (PNO1) (0.769136) and CD163 molecule-like 1 (CD163L1) (0.766757). The top 5 negatively correlated genes were myosin light chain kinase (MYLK) (−0.756185) (Fig. 4B), nuclear receptor subfamily 5, group A, member 2 (NR5A2) (−0.738724), transforming growth factor, β-induced (TGFBI) (−0.674433), von Willebrand factor A domain containing 5A (VWA5A) (−0.673862) and D-tyrosyl-tRNA deacylase 1 (DTD1) (−0.652264).
Figure 4

Correlation analysis of methylation and expression profiles. Positive and negative correlation of 15 pairs matched samples. Dark dots are normal samples whereas light dots are tumor samples. (A) Positively correlated gene HSPD1 (r=0.7994) and (B) negatively correlated gene MYKH (r=−0.7562) were shown. Tumor and normal samples are differentially methylated or expressed in both genes.

Integrated analysis reveals 32 important genes in CRC

We identified 32 significant overlapping genes from the methylation and gene expression profiles (Table II). Most of the significant genes (27 genes) have a negative association (hypermethylation and downregulation) and only 5 genes showed a positive association (hypermethylation and upregulation or hypomethylation and downregulation). We then used the Gene Ontology (GO) enrichment analysis to classify the genes into the categories of cellular component, biological process and molecular function. Under the cellular component category, the integrated genes were highly enriched in the extracellular matrix part [Enrichment score (ES), 13.60] (Fig. 5A). For the biological process and molecular function categories, significant genes were highly enriched in the wide group of detection of mechanical stimulus (ES=235.27) (Fig. 5B) and glycosaminoglycan binding (ES=11.19) (Fig. 5C), respectively. Using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, we identified a total of 95 pathways and 11 of these pathways were associated with the CRC. These includes well-documented pathways in CRC including the cell adhesion molecule, Hedgehog signaling, phosphatidylinositol 3′-kinase (PI3K)-Akt signaling, focal adhesion, pathways in cancer, basal cell carcinoma, colorectal cancer, Wnt signaling, Janus kinase/signal transducers and activators of transcription (JAK-STAT) signaling, transcriptional misregulation in cancer and mitogen-activated protein kinase (MAPK) signaling pathway. Only one pathway was found to be enriched (P<0.05) in our data, the cell adhesion molecules (CAMs) (Fig. 6). Significant genes detected in this pathway were JAM2, NCAM1, ITGA8 and CNTN1.
Table II

Integrated gene list.

No.Gene symbolGene nameMethylation profile
Gene expression profile
Probeset IDMethylation levelFold-changeP-valueβ-valueTranscriptIDExpression levelFold-changeP-value
1SPG20Spastic paraplegia 20cg18755783Hyper4.7593.07E-370.40637970999DR−2.0911.19E-04
2JAM2Junctional adhesion molecule 2cg03382304Hyper5.5291.60E-270.40618068024DR−3.2865.40E-07
3GSTM2Glutathione S-transferase μ 2cg16670497Hyper4.8741.16E-290.36307903753DR−2.2613.00E-05
4SFRP2Secreted frizzled-related protein 2cg23207990Hyper2.5601.37E-250.30798103254DR−2.7812.61E-03
5CNTN1Contactin 1cg27352992Hyper2.5903.79E-230.26397954899DR−2.1595.56E-06
6CHL1Close homolog of L1cg00903242Hyper2.3314.06E-220.25768077270DR−2.3942.08E-06
7TMEFF2Transmembrane protein with EGF-like and two follistatin-like domains 2cg18221862Hyper3.0201.77E-210.29758057803DR−2.5203.47E-04
8ITGA8Integrin, α 8cg16902509Hyper2.0838.70E-190.26317932254DR−2.3124.31E-04
9SSTSomatostatincgl3206017Hyper2.0931.78E-170.22708092682DR−2.1833.14E-06
10SLIT2Slit homolog 2cg18972811Hyper2.4624.94E-170.26678094301DR−2.0752.92E-05
11MAMDC2MAM domain containing 2cg11656547Hyper2.4572.03E-150.26368155754DR−3.0611.32E-05
12SCNN1BSodium channel, non-voltage-gated 1, β subunitcg23113963Hyper2.1291.14E-140.19937994074DR−4.9488.53E-06
13ZNF655Zinc finger protein 655cgl3636404Hyper6.1005.59E-140.21638134631DR−2.8823.72E-06
14ADAMTS1ADAM metallopeptidase with thrombospondin type 1 motif, 1cg00472814Hyper3.0481.53E-110.23778069676DR−2.6093.22E-05
15CKBCreatine kinase, braincg05786809Hyper2.6421.73E-100.14587981427DR−3.1861.34E-04
16HHIPHedgehog interacting proteincg13749822Hyper2.0122.48E-100.17988097628DR−2.4871.99E-05
17ELMOlEngulfment and cell motility 1cg08453021Hyper6.7083.66E-100.25308139057DR−2.1176.49E-04
18RSPO3R-spondin 3cg09979256Hyper13.0151.31E-090.21028121916DR−2.1484.85E-03
19TRPA1Transient receptor potential cation channel, subfamily A, member 1cg0l610488Hyper2.6301.46E-080.21608151341DR−2.0891.56E-03
20GNAO1Guanine nucleotide binding protein (G protein), α activating activity polypeptide Ocg21530453Hyper2.8391.49E-080.13497995739DR−2.3143.25E-04
21MYH11Myosin, heavy chain 11, smooth musclecg17880199Hyper3.6201.48E-070.14707999674DR−7.6422.08E-05
22CAMK4Calcium/calmodulin-dependent protein kinase IVcg05497616Hyper7.2293.85E-060.12258107307DR−2.2772.95E-06
23RNF152Ring finger protein 152cg07980518Hyper2.3223.94E-060.08728023598DR−5.5662.46E-08
24NCAM1Neural cell adhesion molecule 1cg20268522Hyper2.5693.74E-050.10957943892DR−2.2122.10E-05
25CAV1Caveolin 1, caveolae protein, 22 kDacg22126032Hyper2.2520.00036460.05978135594DR−2.2896.45E-03
26BMP3Bone morphogenetic protein 3cgO1049530Hyper2.0100.00305890.08178096070DR−5.5832.15E-07
27GLI3GLI family zinc finger 3cg09405612Hyper2.5910.00469250.04148139212DR−2.0991.94E-04
28COL12A1Collagen, type XII, α 1cg08009622Hyper3.1182.27E-140.21188127563UR3.3672.95E-05
29MMEMembrane metallo-endopeptidasecg16580737Hyper8.7565.00E-080.19048083494UR3.5122.13E-03
30HIST1H3IHistone cluster 1, H3icgl2181621Hyper2.7370.00024270.11428124531UR2.3292.95E-03
31GRIN2BGlutamate receptor, ionotropic, N-methyl D-aspartate 2Bcgl3264741Hyper2.2080.00172830.05187961422UR2.1451.10E-04
32AKT3v-akt murine thymoma viral oncogene homolog 3 (protein kinase B, γ)cgl1314684Hypo−2.2986.96E-32−0.18087925531DR−2.2612.49E-03

Hyper, hypermethylation; Hypo, hypomethylation; DR, downregulation; UR, upregulation.

Figure 5

GO enrichment analysis of integration profile. GO enrichment analysis of 32 overlapping genes revealed the enriched (A) cellular component, (B) biological process and (C) molecular function. The number is the enrichment score. The high enrichment score shows the genes found more frequently in a particular group.

Figure 6

Cell adhesion molecules (CAMs) pathway. CAMs pathway (hsa04514) is an enriched pathway in the integration profile with P-value 0.0122. The highlighted genes were the genes found in our integration profile.

Circular map shows the chromosome distribution of three profiles

We analyzed the frequency of the methylated loci, the upregulated or downregulated genes and the integrated genes in each chromosome. Fig. 7 shows the distribution of genes for each profile. We found that the genes with hypermethylated promoters in CpG islands and also those that were differentially expressed were generally identified in all chromosomes. Chromosome 1 has the highest frequency of the hypermethylated loci (91 loci) followed by chromosome 7 (78 loci), chromosome 19 (77 loci) and chromosome 6 (73 loci). Conversely, chromosome 20 has the highest frequency of the hypomethylated loci (7 loci) followed by chromosome 1 (6 loci), chromosome 17 (4 loci) and chromosome 7 (3 loci). The integrated genes were distributed in all the chromosomes.
Figure 7

Circular map of chromosome distribution. Circular map shows the distribution of genes in each chromosome of methylation, gene expression and integration profiles. The outermost circular ring is the chromosome.

We identified four clusters of genes when we examined closely the distribution of methylated genes in each chromosome. These clusters were located in chromosome 6, 16 and 19 and belonged to specific gene families. One cluster was within the HIST1H family [histone cluster 1, H2bb (HIST1H2BB) with histone cluster 1, H3c (HIST1H3C) and histone cluster 1, H3f (HIST1H3F) with histone cluster 1, H3g (HIST1H3G), another in the FOX family (forkhead box F1 (FOXF1), fork-head box F1 (FOXC2) and forkhead box FL (FOXL1)] and two large clusters in the zinc finger (ZNF) family. For the gene expression study, 16 clusters were distributed across the chromosomes including one single cluster which encompassed >10 genes. This cluster included the metallothionein 2A (MT2A), nucleoporin 93 kDa (NUP93) and 8 other genes derived from the metallothionein 1 (MT1) family.

Discussion

The low sensitivity and specificity of the current serum-based markers in the detection and prognostication of CRC have rendered the identification of new candidates crucial. The emergence of methylation biomarkers for CRC has important implications since the methylation events are reversible. However, methylation markers derived from a single profiling are not sufficient to reveal the overall mechanism involved in CRC. To overcome this issue, the integrative approach combining different genomic profiling analyses could provide new insights into the biology of the CRC and help in identifying potential diagnostic markers. To show the validity of our results, we correlated our findings with those of a recent methylation study conducted in 2010 which involved 91 cancer and 28 normal samples (21). Our results showed a higher number of hypermethylated genes with 804 genes compared to the 132 genes reported in that study. Overlapping our data with theirs revealed 59 genes, 7 of which were in our top 50 significant gene list based on P-values (21). We then compared our data with those from a Korean study that used 22 paired colorectal cancer and adjacent normal mucosa (29). We found 16 hypermethylated genes overlapped between our data and their top 20 hypermethylation genes with 10 genes appearing on our top 50 significant gene list. These included alcohol dehydrogenase, iron containing, 1 (ADHFE1), bol, boule-like (BOLL), cut-like homeobox 2 (CUTL2), KCNQ5, protocadherin γ subfamily C, 4 (PCDHGC4), PRKAR1B, solute carrier family 6 (neutral amino acid transporter), member 15 (SLC6A15), SPG20, TFPI2 and unc-5 homolog C (UNC5C) (29). The ADHFE1, BOLL, SLC6A15 and TFPI2 genes were validated using pyrosequencing (29). Our integrative analysis correlating methylation and gene expression data revealed 27 genes with a negative association (hyper-down) and 5 with a positive association (hyper-up and hypo-down). Of the 27 hypermethylated genes with low gene expression, 18 were related to CRC including GSTM2, ZNF655, MYH11, HHIP, RSPO3, RNF152, CAV1, SFRP2, JAM2, SPG20, TMEFF2, SST, SLIT2, SCNN1B, ADAMST1, GLI3, CKB and BMP3 (19,30–38). Of these, CAV1, SFRP2, TMEFF2, SST and SLIT2 have been identified as tumor-suppressor genes (39–43). The remaining genes were associated with other cancer types. CHL1 and ITGA8 were identified as specific potential biomarkers for renal and ovarian cancer, respectively (44,45). SFRP2 is a dominant-negative inhibitor in the Wnt signaling pathway that is involved in regulating cell proliferation, migration, and differentiation (46,47). Hypermethylation of SFRP2 that resulted in the downregulation of SFRP2 was detected via methylation-specific PCR (MSP) and quantitative PCR (qPCR) in colorectal carcinoma as compared to adenoma (48). Another study reported that hypermethylation of SFRP2 in stool DNA may be a potential biomarker in the detection of CRC (31). Another gene, SPG20, encodes Spartin, which is a multifunctional protein that plays a vital role in the turnover of lipid droplet and intracellular epidermal growth factor receptor trafficking (49,50). Hypermethylation of SPG20 downregulates Spartin and may lead to cytokinesis arrest, which in turn is associated with carcinogenesis (33). A recent study showed that SPG20 has 80.2% sensitivity and 100% specificity in detecting colorectal cancer in stool samples using the MSP approach (22). We also identified the COLI2A1, MME, HIST1H3B and GRIN2B genes, which were hypermethylated with a high expression. Hypermethylation in the promoters of MME and GRIN2B were previously reported to contribute to Alzheimer’s disease and seizures, respectively (51,52). In the integrated list of genes, AKT3 was the only gene with hypomethylation and a low gene expression. These data were supported by a study on hepatocellular carcinoma in which the gene was also found to be hypomethylated (53). In a separate study, the specific knock-down of AKT3 reduced the level of phosphorylated Akt and inhibited cell growth in malignant melanoma (54). The low mRNA level of AKT3 was associated with a higher grade of malignant glioma (55). The positive association exhibited for AKT3 has suggested that there is possibly another layer of regulation. One of the possible explanations for this phenomenon is the existence and influence of long non-coding RNAs (lncRNA). Functional lncRNA may act as an activator or repressor for the expression of genes at the post-transcriptional level via mechanisms such as alternative splicing, influencing RNA polymerase binding efficiency and even by modification of epigenetic state of the gene (56–59). From the integrative analysis, we found that 11 out of 95 KEGG pathways were associated with colorectal carcinogenesis. The most enriched pathway identified in our data was cell adhesion molecules, covered by the integrated genes JAM2, NCAM1, ITGA8 and CNTN1. CAM plays a vital physical function in defining the multicellular organism’s structure and signaling as well (60,61). It is known to facilitate intercellular adherence and is able to promote cell invasion, motility and migration (62). CAMs such as L1-CAM and neurone glial-related (Nr)-CAM have been shown to be associated with the Wnt signaling pathway and their overexpression has been associated with poor prognosis (63,64). To the best of our knowledge, there is no data available describing the methylation status of the CAMs genes with the exception of JAM2. One study reported an inverse association between methylation and the gene expression of JAM2 in pre-malignant and malignant colorectal tissues (32). This observation supported our findings on JAM2, which is involved in cell-cell adhesion and cell-environment interaction. In conclusion, our integrative analysis which combined DNA methylation and gene expression profiling datasets revealed a potential panel of biomarkers for the diagnosis or prognosis of colorectal cancer. Our integrative analysis also revealed a list of novel genes not previously reported in colorectal cancer.
  64 in total

Review 1.  DNA methylation patterns and epigenetic memory.

Authors:  Adrian Bird
Journal:  Genes Dev       Date:  2002-01-01       Impact factor: 11.361

Review 2.  Signal transduction by cell adhesion receptors and the cytoskeleton: functions of integrins, cadherins, selectins, and immunoglobulin-superfamily members.

Authors:  R L Juliano
Journal:  Annu Rev Pharmacol Toxicol       Date:  2002       Impact factor: 13.820

Review 3.  Carcinoembryonic antigen as a marker for colorectal cancer: is it clinically useful?

Authors:  M J Duffy
Journal:  Clin Chem       Date:  2001-04       Impact factor: 8.327

Review 4.  Epigenetic biomarkers of colorectal cancer: Focus on DNA methylation.

Authors:  Fabio Coppedè
Journal:  Cancer Lett       Date:  2011-12-24       Impact factor: 8.679

5.  Unexplained rising carcinoembryonic antigen (CEA) in the postoperative surveillance of colorectal cancer: the utility of positron emission tomography (PET).

Authors:  P Flamen; O S Hoekstra; F Homans; E Van Cutsem; A Maes; S Stroobants; M Peeters; F Penninckx; L Filez; R P Bleichrodt; L Mortelmans
Journal:  Eur J Cancer       Date:  2001-05       Impact factor: 9.162

6.  Nr-CAM is a target gene of the beta-catenin/LEF-1 pathway in melanoma and colon cancer and its expression enhances motility and confers tumorigenesis.

Authors:  Maralice E Conacci-Sorrell; Tamar Ben-Yedidia; Michael Shtutman; Elena Feinstein; Paz Einat; Avri Ben-Ze'ev
Journal:  Genes Dev       Date:  2002-08-15       Impact factor: 11.361

7.  Highly methylated genes in colorectal neoplasia: implications for screening.

Authors:  Hongzhi Zou; Jonathan J Harrington; Abdirashid M Shire; Rafaela L Rego; Liang Wang; Megan E Campbell; Ann L Oberg; David A Ahlquist
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2007-12       Impact factor: 4.254

8.  Loss of expression and promoter methylation of SLIT2 are associated with sessile serrated adenoma formation.

Authors:  Andrew D Beggs; Angela Jones; Neil Shepherd; Abed Arnaout; Caroline Finlayson; A Muti Abulafi; Dion G Morton; Glenn M Matthews; Shirley V Hodgson; Ian P M Tomlinson
Journal:  PLoS Genet       Date:  2013-05-09       Impact factor: 5.917

9.  Genome-wide methylation profiling demonstrates hypermethylation in maternal leukocyte DNA in preeclamptic compared to normotensive pregnancies.

Authors:  Wendy M White; Brian Brost; Zhifu Sun; Carl Rose; Iasmina Craici; Steven J Wagner; Stephen T Turner; Vesna D Garovic
Journal:  Hypertens Pregnancy       Date:  2013-06-19       Impact factor: 2.108

10.  Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues.

Authors:  Nur Zarina Ali Hassan; Norfilza Mohd Mokhtar; Teow Kok Sin; Isa Mohamed Rose; Ismail Sagap; Roslan Harun; Rahman Jamal
Journal:  PLoS One       Date:  2014-04-02       Impact factor: 3.240

View more
  17 in total

1.  Novel epigenetic markers for gastric cancer risk stratification in individuals after Helicobacter pylori eradication.

Authors:  Masahiro Maeda; Satoshi Yamashita; Taichi Shimazu; Naoko Iida; Hideyuki Takeshima; Takeshi Nakajima; Ichiro Oda; Sohachi Nanjo; Chika Kusano; Akiko Mori; Hiroshi Moro; Harumi Yamada; Shoichiro Tsugane; Toshiro Sugiyama; Yoshiharu Sakai; Toshikazu Ushijima
Journal:  Gastric Cancer       Date:  2018-02-09       Impact factor: 7.370

Review 2.  MicroRNA Methylome Signature and Their Functional Roles in Colorectal Cancer Diagnosis, Prognosis, and Chemoresistance.

Authors:  Rashidah Baharudin; Nurul Qistina Rus Bakarurraini; Imilia Ismail; Learn-Han Lee; Nurul Syakima Ab Mutalib
Journal:  Int J Mol Sci       Date:  2022-06-30       Impact factor: 6.208

3.  Local triple-combination therapy results in tumour regression and prevents recurrence in a colon cancer model.

Authors:  João Conde; Nuria Oliva; Yi Zhang; Natalie Artzi
Journal:  Nat Mater       Date:  2016-07-25       Impact factor: 43.841

4.  Extracellular matrix gene expression profiling using microfluidics for colorectal carcinoma stratification.

Authors:  Christopher J Hayes; Catriona M Dowling; Susan Dwane; Mary E McCumiskey; Shona M Tormey; B Anne Merrigan; John C Coffey; Patrick A Kiely; Tara M Dalton
Journal:  Biomicrofluidics       Date:  2016-10-31       Impact factor: 2.800

5.  Effect of junctional adhesion molecule-2 expression on cell growth, invasion and migration in human colorectal cancer.

Authors:  Huishan Zhao; Hefen Yu; Tracey A Martin; Yuxiang Zhang; Gang Chen; Wen G Jiang
Journal:  Int J Oncol       Date:  2016-01-15       Impact factor: 5.650

Review 6.  The role of JAM-B in cancer and cancer metastasis (Review).

Authors:  Huishan Zhao; Hefen Yu; Tracey A Martin; Xu Teng; Wen G Jiang
Journal:  Oncol Rep       Date:  2016-04-27       Impact factor: 3.906

7.  Different effection of p.1125Val>Ala and rs11954856 in APC on Wnt signaling pathway.

Authors:  Fei-Feng Li; Zhi-Xun Zhao; Peng Yan; Song Wang; Zheng Liu; Qiong Zhang; Xiao-Ning Zhang; Chang-Hao Sun; Xi-Shan Wang; Gui-Yu Wang; Shu-Lin Liu
Journal:  Oncotarget       Date:  2017-08-05

Review 8.  PALTEM: What Parameters Should Be Collected in Disaster Settings to Assess the Long-Term Outcomes of Famine?

Authors:  Alexandra Moraru; Maria Moitinho de Almeida; Jean-Marie Degryse
Journal:  Int J Environ Res Public Health       Date:  2018-04-25       Impact factor: 3.390

9.  LogLoss-BERAF: An ensemble-based machine learning model for constructing highly accurate diagnostic sets of methylation sites accounting for heterogeneity in prostate cancer.

Authors:  K Babalyan; R Sultanov; E Generozov; E Sharova; E Kostryukova; A Larin; A Kanygina; V Govorun; G Arapidi
Journal:  PLoS One       Date:  2018-11-02       Impact factor: 3.240

10.  Estrogen receptor α is involved in the regulation of ITGA8 methylation in estrogen receptor-positive breast cancer.

Authors:  Jingxun Wu; Jianghong Cheng; Fuxing Zhang; Xianyang Luo; Zhiming Zhang; Shuai Chen
Journal:  Ann Transl Med       Date:  2020-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.