Literature DB >> 31642367

Epigenome-Wide Association Study for All-Cause Mortality in a Cardiovascular Cohort Identifies Differential Methylation in Castor Zinc Finger 1 (CASZ1).

Jawan W Abdulrahim1, Lydia Coulter Kwee1, Elizabeth Grass1, Ilene C Siegler2, Redford Williams2, Ravi Karra3, William E Kraus1,3, Simon G Gregory1, Svati H Shah1,3.   

Abstract

Background DNA methylation is implicated in many chronic diseases and may contribute to mortality. Therefore, we conducted an epigenome-wide association study (EWAS) for all-cause mortality with whole-transcriptome data in a cardiovascular cohort (CATHGEN [Catheterization Genetics]). Methods and Results Cases were participants with mortality≥7 days postcatheterization whereas controls were alive with≥2 years of follow-up. The Illumina Human Methylation 450K and EPIC arrays (Illumina, San Diego, CA) were used for the discovery and validation sets, respectively. A linear model approach with empirical Bayes estimators adjusted for confounders was used to assess difference in methylation (Δβ). In the discovery set (55 cases, 49 controls), 25 629 (6.5%) probes were differently methylated (P<0.05). In the validation set (108 cases, 108 controls), 3 probes were differentially methylated with a false discovery rate-adjusted P<0.10: cg08215811 (SLC4A9; log2 fold change=-0.14); cg17845532 (MATK; fold change=-0.26); and cg17944110 (castor zinc finger 1 [CASZ1]; FC=0.26; P<0.0001; false discovery rate-adjusted P=0.046-0.080). Meta-analysis identified 6 probes (false discovery rate-adjusted P<0.05): the 3 above, cg20428720 (intergenic), cg17647904 (NCOR2), and cg23198793 (CAPN3). Messenger RNA expression of 2 MATK isoforms was lower in cases (fold change=-0.24 [P=0.007] and fold change=-0.61 [P=0.009]). The CASZ1, NCOR2, and CAPN3 transcripts did not show differential expression (P>0.05); the SLC4A9 transcript did not pass quality control. The cg17944110 probe is located within a potential regulatory element; expression of predicted targets (using GeneHancer) of the regulatory element, UBIAD1 (P=0.01) and CLSTN1 (P=0.03), were lower in cases. Conclusions We identified 6 novel methylation sites associated with all-cause mortality. Methylation in CASZ1 may serve as a regulatory element associated with mortality in cardiovascular patients. Larger studies are necessary to confirm these observations.

Entities:  

Keywords:  cardiac biomarkers; epigenetics; mortality; outcome; transcriptome

Mesh:

Substances:

Year:  2019        PMID: 31642367      PMCID: PMC6898816          DOI: 10.1161/JAHA.119.013228

Source DB:  PubMed          Journal:  J Am Heart Assoc        ISSN: 2047-9980            Impact factor:   5.501


Clinical Perspective

What Is New?

Six novel differentially methylated CpG loci were found to be associated with all‐cause mortality in a cardiovascular cohort. Differential methylation in CASZ1 may serve as a genomic regulatory element associated with increased risk of death in cardiovascular patients.

What Are the Clinical Implications?

Differentially methylated loci may serve as potential biomarkers for patient risk stratification. Cardiovascular disease is the greatest cause of morbidity and mortality in the United States and worldwide.1, 2 Clinical factors, some biomarkers, and genetic factors predict incident cardiovascular events and mortality in secondary prevention patients.3, 4, 5 However, the ability to predict these adverse events and to understand the biology underlying this risk is incomplete. Epigenetic studies provide an opportunity to simultaneously identify biomarkers of disease risk and causal biological pathways, while integrating underlying genetic variation with clinical and environmental risk factors. The evolution of high‐throughput technologies has enabled comprehensive assessments including epigenome‐wide association studies (EWAS) by assessing methylation patterns at a genomic scale.6 EWAS in many diseases have identified biological mechanisms and unexplained heritability. They have established relationships between methylation signatures and a number of phenotypes, including, but not limited to, cancer,7, 8 coronary artery disease (CAD),9 obesity,10 and all‐cause mortality.11, 12 However, to our knowledge, there have been no EWAS for incident mortality in a cardiovascular cohort. Epigenetic modifications play physiological13 and pathophysiological7, 8 roles in phenotypic outcomes. Methylation of cytosine in CpG dinucleotides alters chromatin structure and typically suppresses gene transcription. Given the capacity of environmental exposures to alter gene expression by epigenetic mechanisms, nonphysiological alteration of DNA methylation has the potential to contribute to disease and, ultimately, mortality.11, 12, 14 However, the specific biological mechanisms by which DNA methylation signatures are associated with mortality remain poorly understood. We conducted an EWAS to identify differentially methylated loci predicting mortality in a secondary prevention cardiovascular cohort, CATHGEN (Catheterization Genetics).15

Methods

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Study Population

The CATHGEN biorepository consists of sequential patients undergoing cardiac catheterization at Duke from 2001 to 2010 and has been described in detail previously.15 Blood samples were collected from a femoral arterial catheter before administration of supplemental heparin and immediately processed for whole blood and plasma and frozen at −80°C within a few hours. Cases were defined as subjects with mortality at any time at or >7 days postcatheterization (to avoid procedure‐related deaths). Controls were comprised of CATHGEN individuals alive at the time of last known follow‐up with a minimum of 2 years of follow‐up and who had no subsequent interventional cardiac procedure, coronary bypass grafting, or myocardial infarction. Samples were selected in 2 sets to create a discovery and validation cohort. For each set, we prioritized available cases with the shortest time to death (to enhance potential genetic/epigenetic factors); controls were matched to cases based on genetic ancestry (African or European), sex, age, and ejection fraction (EF). After matching and accounting for quality control (QC) of DNA, the cases and controls were not perfectly matched on these risk factors and thus these variables were also included in multivariable adjusted models. The CATHGEN study was approved by the Duke University Institutional Review Board and participants provided written consent.

Epigenetic Profiling

Sample QC

Samples were quantified using the Quant‐iT PicoGreen dsDNA reagent in a 96‐well plate format (Life Technologies, Grand Island, NY). We used a modified version of the Life Technologies Quant‐iT PicoGreen dsDNA protocol designed by the DNA Bank at the Center for Human Genetics (Duke University, Durham, NC) found in Data S1. DNA quality was assessed by gel electrophoresis similar to those outlined by Chen et al.16 All samples were scored on a 0 to 5 scale; all samples used for the genotyping protocol scored a 3 to 5 on the grading scale. See Data S1 for grading scale descriptions (Duke University, Durham, NC).

Bisulfite conversion

Bisulfite conversion of CATHGEN DNA was carried out using the manufacturer's recommendations. Briefly, 500 ng was bisulfite treated using a Zymo EZ DNA Methylation kit (catalog number: D5001; Zymo Research, Irvine, CA) using PCR conditions for Illumina's Infinium Methylation assay (95°C for 30 seconds, 50°C for 60 minutes×16 cycles).

Methylation study

Methylation data from the discovery cohort were generated using the Illumina HumanMethylation450 DNA Analysis Beadchip (catalog number: WG‐314‐1001; Illumina, San Diego, CA), with validation data that were produced using the Illumina Infinium MethylationEPIC BeadChips (catalog number: WG317‐1001; Illumina), given that the 450K chips were no longer manufacturered. Briefly, a total of 4 μL of bisulfite converted DNA was hybridized to Illumina BeadChips using the manufacturer's protocols. Samples were denatured and amplified overnight for 20 to 24 hours. Fragmentation, precipitation, and resuspension of the samples followed overnight incubation, before hybridization to 450 or EPIC BeadChips for 16 to 24 hours. BeadChips were then washed to remove any unhybridized DNA and labeled with nucleotides to extend the primers to the DNA sample. Following the Infinium HD Methylation protocol, the BeadChips were imaged using the Illumina iScan system (Illumina).

Methylation Preprocessing and QC

For both the discovery and validation sets, raw .idat files were read and preprocessed using the R package, minfi.17, 18 Each data set was preprocessed using noob for background subtraction and dye‐bias normalization, followed by stratified quantile normalization. All methylation values with detection P>0.01 were set to missing, then probes with >1% missing values were removed from further analysis. Cell‐mixture distributions were estimated using the Houseman method19 as implemented in minfi.20 All samples were checked for concordance between reported sex and predicted sex from the methylation data, excessive missing data (>1%), unusual cell mixture estimates, and outlying control probe measurements. Finally, probes were filtered using functions implemented in the DMRcate package21, 22, 23 if the probe's CpG site was within 2 base pairs of a common single‐nucleotide polymorphism (minor allele frequency, >5%), the probe was reported to cross‐hybridize in multiple locations or if the probe was located on the X or Y chromosome. Each methylation data set was examined for potential batch effects using principal components calculated using the 20 000 most variable probes. The first 7 principal components were examined for association with plate effects during bisulfite conversion, methylation array, and row. Principal components were visualized on plots and tested for association with potential batches using linear models.

Statistical Analysis

To assess for differential methylation between cases and controls in each set of samples separately (discovery and validation), a linear model approach with empirical Bayes estimators, as implemented in limma,24, 25 was used. The resulting regression coefficients for the linear models are reported in these analyses as log2 fold change (negative when methylation is lower in cases than controls and positive when methylation is higher in cases than controls). Methylation was measured using , where β is the estimated proportion of cells methylated at a given CpG. Models were adjusted for age, ancestry (European versus African), sex, EF (the left ventricular ejection fraction measured by echocardiography), CAD, estimated cell type proportions, plate, and methylation array row. All probes with nominal P<0.05 (ie, adjusted for covariables, unadjusted for multiple comparisons) in the discovery set were then tested for differential methylation in the validation set and considered to be differentially methylated if the false discovery rate (FDR)‐adjusted P value26 was <0.10. We chose a liberal P value for the discovery set to generate a more expansive set of results, but chose a more conservative P value adjusted for multiple comparisons in the validation cohort to avoid type 1 error. We used METAL27 to conduct a meta‐analysis of P values from the discovery and validation experiments, using an FDR‐adjusted P‐value cutoff of 0.05 to determine significance, given the lack of a separate validation set in this analysis. We tested for differential expression of candidate genes and potential target genes in the combined set of samples (discovery and validation) using a linear model, adjusting for age, ancestry, sex, and expression batch (regression coefficients for the linear models, is reported as log2 fold change). Results from this meta‐analysis were also used in an over‐representation analysis of both Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets using the gometh function from the missMethyl 28 package in R, and the default significance cutoff of P<0.001.The approach uses a modified version of the hypergeometric test that takes into account the probability of each gene being selected, based on the number of CpG sites tested in the gene. Probes that were only present in either the discovery or validation set (ie, the set of probes present on the 450K chip or on the EPIC chip, but not on both) were considered as a separate exploratory experiment, with no validation set available. These probes were considered potentially differentially methylated if the FDR‐adjusted P value was <0.10. Testing for differentially methylated regions (DMRs) was performed using comb‐p,29 which uses P values and chromosomal locations as input. The algorithm adjusts P values according to their correlation with neighboring P values, then identifies regions of low P values using parameters seed (threshold necessary to start identification of a region) and dist (distance to look for neighboring P values also below the threshold). Once regions are defined, a P value is calculated for each using the Sidak correction.30 We used parameters found in previous work to maximize the area under the precision‐recall curve.31 We set seed=0.05 and dist=750 and counted regions as DMRs if the correction region‐based P value was <0.05. Similar to the probe‐based approach, results from this meta‐analysis were used in an over‐representation analysis of both Gene Ontology and KEGG gene sets using the preranked tool in gene set enrichment analysis.32

Gene Expression Profiling

Gene expression profiling was previously described in detail in an earlier work.33 In brief, RNA was initially purified using Qiagen PaxGene Blood RNA MDx Kits in PAXgene tubes containing whole blood. The Illumina TotalPrep RNA amplification kit (Life Technologies) was then used to biotinylate the RNA. Afterward, quantity and quality of the RNA were assessed using the Quant‐iT RiboGreen RNA Assay Kit and Bioanalyzer RNA Nano chip assay (Agilent, Santa Clara, CA), respectively. Expression data from the Human HT‐12v3 Expression BeadChip (Illumina) were used for assessment of the genes of interest. QC was carried out using the Illumina Genome Studio software, and probes detected in >50% of samples and that had a detection P<0.05 were utilized. Plate, nested chip, and sample effects were also assessed using variance components and principal components analyses, and outliers at each level were excluded. Robust multichip average methods were used to log2 transform and quantile normalize the expression data.

Results

Baseline Characteristics

Baseline characteristics of the CATHGEN discovery and validation cohorts are summarized in Table 1. Cases and controls were similar with regard to age, sex, ancestry, EF, body mass index, smoking history, as well as presence of CAD. The mean EF is lower in the discovery set; this is most likely because of the discovery set having been ascertained first where cases were prioritized based on the shortest time to death. Highlighting the separation of these 2 groups, within each set, there is no overlap between time‐to‐event (cases) and last follow‐up (controls). In the discovery set, cases died a median of 141 days (interquartile range [52, 355]; range, 8–527) after their cardiac catheterization procedure, whereas all controls had at least 2138 days of event‐free follow‐up (median, 2981; interquartile range [2734, 3293]). In the validation set, the median time‐to‐event for cases was 1500 days (interquartile range [1054, 1872]; range, 563–2346), whereas all controls had at least 2803 days of event‐free follow‐up (median, 3017; interquartile range [2939, 3303]). Although we focused on all‐cause mortality as a primary phenotype (as is often performed in clinical trials of cardiovascular secondary prevention patients), 53% of the discovery and 30% of the validation cohort were definitively adjudicated to be attributed to cardiovascular causes. In contrast, the remaining individuals were a combination of noncardiac medical, unspecified, and other causes. Given the high prevalence of CAD in this sample (70–71%) and the fact that cardiovascular disease is the most common cause of death,2 the majority of the remaining deaths are assumed to be cardiovascular in origin.
Table 1

Baseline Characteristics of Participants

DiscoveryValidation
Cases (N=55)Controls (N=49)Cases (N=108)Controls (N=108)
Age, mean (SD), y66.8 (11.4)66.2 (10.1)69.3 (10.1)67.8 (9.4)
European ancestry, n (%)45 (81.8)41 (83.7)93 (86.1)91 (84.3)
Female, n (%)17 (30.9)17 (34.7)41 (38.0)41 (38.0)
Diabetes mellitus, n (%)25 (45.5)9 (18.4)34 (31.5)29 (26.9)
Dyslipidemia, n (%)28 (50.9)30 (61.2)71 (65.7)58 (53.7)
History of smoking, n (%)27 (49.1)27 (55.1)53 (49.1)53 (49.1)
BMI, mean (SD)27.6 (5.8)27.8 (6.0)29.0 (7.0)29.9 (5.8)
Ejection fraction, median [IQR]49.0 [33.3, 55.6]50.0 [35.0, 60.5]55.9 [52.9, 65.2]60.0 [55.0, 68.5]
Coronary artery disease, n (%)43 (78.2)35 (71.4)77 (71.3)71 (65.7)
Time to death (cases) or last follow‐up (controls), median [IQR], days141.0 [51.5, 355.0]2981.0 [2734.0, 3293.0]1500.5 [1053.8, 1872.0]3017.0 [2939.0, 3303.0]

BMI indicates body mass index; IQR, interquartile range.

Baseline Characteristics of Participants BMI indicates body mass index; IQR, interquartile range.

EWAS Results: Discovery and Validation of Differentially Methylated Probes

In the discovery cohort, 1715 probes (0.35%) had >1% missing values and were removed, whereas 8678 probes (1.0%) were removed from the validation set. After removing cross‐hybridizing probes, probes near common single‐nucleotide polymorphisms, and nonautosomal probes, we retained 426 122 probes for analysis in the discovery set and 790 644 probes in the validation set. A total of 397 024 probes present on both versions of the Illumina Infinium methylation array were available for analysis in both sample sets. After examining distributions of estimated cell‐type proportions and Illumina control probes during QC, we removed 2 samples from the validation set that were extreme outliers. There were no mismatches between predicted and reported sex in either set, and all samples had <0.6% missing probes, so no other samples were removed during QC. In the discovery cohort, 25 629 (6.5%) were differently methylated between cases and controls (nominal P<0.05) using a model adjusted for age, ancestry, sex, EF, CAD, estimated cell‐type proportions, plate, and methylation array row. These probes were carried forward as candidates to be tested in the validation samples. In the validation set, 3 of the probes validated (at an FDR‐adjusted P<0.10): cg08215811 within intron 15 of solute carrier family 4 member 9 (SLC4A9); cg17845532 within the promoter region of megakaryocyte‐associated tyrosine kinase (MATK); and cg17944110 in the 5′ untranslated region exon 1 of castor zinc finger 1 (CASZ1; Table 2). Using a more‐conservative FDR‐adjusted P<0.05, only cg17845532 would be significant. Each of the 3 validated probes had relatively small observed Δβ values, but displayed consistent directions of effect in the discovery and validation cohorts. The methylation levels for the 3 probes were approximately normally distributed, with no outliers. We also conducted a sensitivity analysis that included body mass index and smoking in the model, and the same 3 probes met our significance criteria. Analysis of the methylation probes present only on the 450K array (discovery cohort, 29 098) or EPIC (validation cohort, 393 620) did not identify any significant differentially methylated probes between cases and controls using an FDR‐adjusted P<0.10. Given the sample size, we also conducted a meta‐analysis to place our primary analysis results into context and to identify any additional loci. We identified 6 probes that were significant after meta‐analysis (FDR‐adjusted P<0.05): the 3 identified above, together with cg20428720 (intergenic), cg17647904 (nuclear receptor corepressor 2; NCOR2), and cg23198793 (Calpain 3 CAPN3) (Table 2). The 2 genes nearest to cg20428720 are C5orf38 and IRX2. IRX2 and NCOR2 encode for a transcription factor34 and transcriptional repressor,35 respectively, whereas CAPN3 encodes for a muscle‐specific protein that binds to titin.36
Table 2

Methylation Probes Associated With All‐Cause Mortality in the Discovery and Validation Cohorts

ProbeGeneChrPos (hg19)DiscoveryValidationMeta‐Analysis
Nominal P Value∆βLog2 Fold Changea Nominal P ValueFDR‐Adjusted P Value∆βLog2 Fold Changea Nominal P ValueFDR‐Adjusted P Value
cg17944110 CASZ1 b c 110 856 6570.00580.00640.184.59E‐060.0590.00780.269.44E‐080.014
cg08215811 SLC4A9 b c 5139 742 9000.0131−0.0068−0.139.37E‐060.08−0.0117−0.154.31E‐070.029
cg17845532 MATK b c 193 786 6430.0115−0.0086−0.221.79E‐060.046−0.0133−0.268.15E‐080.014
cg20428720Intergenicc 53 093 0710.05710.02370.263.55E‐07>0.10.04920.381.38E‐070.014
cg17647904 NCOR2 c 12124 908 6010.0003−0.00080.162.33E‐04>0.10.01160.153.56E‐070.028
cg23198793 CAPN3 c 1542 694 4070.00100.01630.173.65E‐05>0.10.01910.161.44E‐070.014

FDR indicates false discovery rate; FDR‐adjusted, false discovery rate‐adjusted.

Log2 fold change: positive when methylation is higher in cases than controls and negative when methylation is lower in cases than controls.

Probes that were identified in the discovery set using a nominal P<0.05 and validated in the validation set using an FDR‐adjusted P<0.1.

Probes that were identified from the meta‐analysis of both the discovery and validation sets using an FDR‐adjusted P<0.05.

Methylation Probes Associated With All‐Cause Mortality in the Discovery and Validation Cohorts FDR indicates false discovery rate; FDR‐adjusted, false discovery rate‐adjusted. Log2 fold change: positive when methylation is higher in cases than controls and negative when methylation is lower in cases than controls. Probes that were identified in the discovery set using a nominal P<0.05 and validated in the validation set using an FDR‐adjusted P<0.1. Probes that were identified from the meta‐analysis of both the discovery and validation sets using an FDR‐adjusted P<0.05. Results from the meta‐analysis were used in an over‐representation analysis of both Gene Ontology and KEGG gene sets. This analysis identified 1 potentially enriched KEGG gene set, Circadian entrainment (FDR‐adjusted P=0.048). However, the pathway is primarily involved in regulating sleep and circadian rhythm and does not appear to be relevant to the study phenotype. Comb‐p meta‐analysis identified 133 DMRs (Table S1) with an FDR‐adjusted P<0.05, and the most significant DMR (FDR‐adj P=1.37E‐15) was chr1: 2344699‐2345476, which is located upstream of peroxisomal biogenesis factor 10 (PEX10). Unfortunately, there were no expression data for PEX10 available for analysis. Interestingly, the second‐most significant DMR (chr12:12490860‐124908932; FDR‐adj P=3.19E‐12) is located in intron 13 of NCOR2 and includes cg17647904, the NCOR2 probe that was identified in the meta‐analysis of the discovery and validation sets. Another DMR (chr19:3785573‐3786958; FDR‐adj P=3.07E‐05) in the top 50 results included the MATK probe, cg17845532, which encompasses a CpG island and the promoter region of the MATK gene. KEGG and Gene Ontology enrichment analysis of the significant DMRs did not yield any significant results (FDR‐adjusted P<0.1).

Correlation Between Methylation and Expression

We next determined whether there was a correlation between differentially methylated loci identified in our discovery, validation, and meta‐analyses and expression of the adjacent gene. MATK contains 2 (of 3) HT‐12 expression array probes that passed QC: ILMN_1669321 and ILMN_2319000 that capture transcripts NM_139355.2 and NM_139354.2, respectively. Given that both probes have lower expression in cases and controls (P=0.0066 and 0.0090), transcript NM_139355.2 specifically shows differential expression. However, ILMN_1669321 did not show a significant correlation between methylation and expression (P=0.45), whereas ILMN_2319000 showed a weak negative correlation between methylation and expression (P=0.045). The CASZ1 transcript did not show significant differential expression between cases and controls (P=0.15) and did not show correlation between methylation and expression (P=0.18); the SLC4A9 transcript did not pass QC. Both NCOR2 and CAPN3 genes each had 2 expression array probes that passed QC. None of the transcripts displayed differential expression between cases and controls (Table 3).
Table 3

Expression Probe Changes of Genes That Contained Significant Methylation Probes and of the Target Genes of GH01J010790

GeneProbeExpression Levels (log2 Fold Changea)Fold Change P Value
MATK ILMN_2319000−0.240.850.0065
MATK ILMN_1669321−0.610.660.0084
NCOR2 ILMN_2340052−0.070.950.17
NCOR2 ILMN_1698419−0.070.950.44
CAPN3 ILMN_2332691−0.10.940.47
CAPN3 ILMN_1687971−0.010.990.90
Target genes of GH01J010790
UBIAD1 ILMN_1651872−0.330.80.011
CLSTN1 ILMN_2415179−0.130.910.033
EXOSC10 ILMN_1711189−0.100.930.14
CASZ1 ILMN_1655191−0.070.950.15
EXOSC10 ILMN_16707960.041.030.62
TARDBP ILMN_1677532−0.040.970.65
EXOSC10 ILMN_2402168−0.030.980.73

Log2 fold change: positive when expression is higher in cases than controls and negative when expression is lower in cases than controls.

Expression Probe Changes of Genes That Contained Significant Methylation Probes and of the Target Genes of GH01J010790 Log2 fold change: positive when expression is higher in cases than controls and negative when expression is lower in cases than controls. CASZ1 is a transcription factor; the significantly differentially methylated probe, cg17944110, is located in the 5′ untranslated region exon 1 within a promoter/enhancer as predicted by GeneHancer37 identifier: GH01J010790. Given that CASZ1 was the most statistically significant of the 3 genes in the discovery set, we chose to focus on this gene. We identified the predicted target genes of this promoter/enhancer using GeneCards38 and tested for differential expression of predicted target genes between cases and controls (in the combined discovery and validation cohorts). Of the 9 identified target genes, expression data were available for 7 probes from 5 genes (Table 3). Two of these target genes showed differential expression between cases and controls: UBIAD1 (P=0.01) and CLSTN1 (P=0.03), with both showing lower expression in cases as compared with controls, consistent with the direction of effect of CASZ1 methylation. Methylation at the CASZ1 locus was inversely associated with expression levels of UBIAD1 and CLSTN1, with unadjusted P values of 0.02 and 0.013, respectively; however, there was no significance after adjusting for experimental batch (P=0.35 and 0.97, respectively).

Discussion

We present herein the first EWAS of all‐cause mortality in a cardiovascular cohort of secondary prevention patients. Using discovery and validation sets, we identified 3 genes, including SLC4A9, MATK, and the transcription factor CASZ1, which contain probes differentially methylated in patients who have subsequent mortality compared with patients who are alive at a median of 3000 days of follow‐up. These loci have not been previously identified in other EWAS of mortality. Although those studies did not focus on cardiovascular patients,11, 12 these results suggest that these loci may be unique to high‐risk cardiovascular disease patients at risk of mortality. One of our most significant findings was for a differentially methylated probe out of a total of 157 CASZ1‐annotated probes located in the 5′ untranslated region exon 1 of CASZ1 associated with mortality in both the discovery and validation cohorts. It also lies within the promoter/enhancer, GeneHancer identifier GH01J010790 (Figure 1). The region is also listed as an enhancer in the ENCODE database39 and as a “super enhancer” in dbSUPER40 for the following cell types: colon crypt, small intestine, gastric, esophagus, pancreatic islets, left ventricle, psoas muscles, lung, and sigmoid colon. Furthermore, we identified differential expression of 2 of the predicted target genes of the promoter/enhancer in mortality cases and controls, suggesting that epigenetic modification of this gene may influence risk of death in secondary prevention cardiovascular patients. To understand the functional relationship between differential methylation of the predicted CASZ1 promoter/enhancer element and all‐cause mortality, we note the tracks within the University of California, Santa Cruz genome browser (illustrated in Figure 1, using the GViz R package41), particularly GeneHancer,42 that support the locus as an active regulatory site. This locus is correlated with a DNase I hypersensitivity site from the encyclopedia of DNA elements (ENCODE)39 in 114 cell types (Figure 1). Highlighting the putative regulatory role of the locus containing the differentially methylated CASZ1 probe, the locus is also hypomethylated in the majority of cell types and is evolutionarily conserved at the DNA level in multiple species (Figure 1). Epigenetic histone signatures also exemplify the locus’ role as a regulatory element. Histone modification data (mono‐ and trimethylation of histone H3 lysine 4 signatures), from the ENCODE project39 in the University of California, Santa Cruz browser, suggest that the region is an enhancer in 1 cell line (K562) and a promoter in another (NT2‐D1)13 (Figure 1). Our putative model holds that increased methylation of the CASZ1 enhancer decreases the accessibility of regulatory proteins leading to decreased expression of the target genes, including UBIAD1 and CLSTN1 (Figure 2).
Figure 1

The Manhattan plot shows the results for the EWAS conducted on the discovery set. Probes that reached significance using a nominal P value of 0.05 were carried forward as candidates to be tested in the discovery set. CpG probes that are marked with a red X were those that were found to be significant using an FDR‐adjusted P value of 0.1 in the validation set. The bottom 3 panels show the 3 CpG probes that were significant in the validation set in the genome using UCSC genome browser tracks. The 3 CpGs identified are located within CpG islands and DNase clusters (ie, DNase hypersensitivity sites). The probe in CASZ1 (cg17944110) was located in the promoter/enhancer GH01J010790 (red track) and is in a region that is highly conserved. CASZ1 indicates castor zinc finger 1; EWAS, epigenome‐wide association study; FDR, false discovery rate; UCSC, University of California, Santa Cruz.

Figure 2

Plausible biological model displaying the consequences of increased methylation in the enhancer/promoter region of CASZ1, with the resulting changes in the expression of the target genes, UBIAD1 and CLSTN1, of GH01J010790. With increased methylation in the regulatory region in CASZ1, DNA becomes inaccessible to regulatory proteins such as transcription factors (TF). Therefore, expression of GH01J010790's target genes are decreased. With decreased UBIAD1 expression, cardiovascular morbidity and mortality are possibly increased through exaggerated vascular calcification, pathologic cardiac remodelling, oxidative stress, and increased HMG‐CoA availability, ultimately leading to increased all‐cause mortality in the cases. CASZ1 indicates castor zinc finger 1; HMG‐CoA, β‐hydroxy β‐methylglutaryl coenzyme A.

The Manhattan plot shows the results for the EWAS conducted on the discovery set. Probes that reached significance using a nominal P value of 0.05 were carried forward as candidates to be tested in the discovery set. CpG probes that are marked with a red X were those that were found to be significant using an FDR‐adjusted P value of 0.1 in the validation set. The bottom 3 panels show the 3 CpG probes that were significant in the validation set in the genome using UCSC genome browser tracks. The 3 CpGs identified are located within CpG islands and DNase clusters (ie, DNase hypersensitivity sites). The probe in CASZ1 (cg17944110) was located in the promoter/enhancer GH01J010790 (red track) and is in a region that is highly conserved. CASZ1 indicates castor zinc finger 1; EWAS, epigenome‐wide association study; FDR, false discovery rate; UCSC, University of California, Santa Cruz. Plausible biological model displaying the consequences of increased methylation in the enhancer/promoter region of CASZ1, with the resulting changes in the expression of the target genes, UBIAD1 and CLSTN1, of GH01J010790. With increased methylation in the regulatory region in CASZ1, DNA becomes inaccessible to regulatory proteins such as transcription factors (TF). Therefore, expression of GH01J010790's target genes are decreased. With decreased UBIAD1 expression, cardiovascular morbidity and mortality are possibly increased through exaggerated vascular calcification, pathologic cardiac remodelling, oxidative stress, and increased HMG‐CoA availability, ultimately leading to increased all‐cause mortality in the cases. CASZ1 indicates castor zinc finger 1; HMG‐CoA, β‐hydroxy β‐methylglutaryl coenzyme A. CASZ1 encodes a zinc finger transcription factor expressed in a wide variety of cells, including, but not limited to, the brain and heart43; it is frequently deleted in neural‐derived tumors (eg, neuroblastoma).44 In the cardiovascular system, CASZ1 expression is essential for cardiomyocyte cell‐cycle progression45; loss‐of‐function mutations have been implicated in congenital heart disease46 and dilated cardiomyopathy.47 Finally, a single‐nucleotide polymorphism within an intron of CASZ1, rs880315, is associated with hypertension,48 atrial fibrillation,49 and acute ischemic stroke50 in genome‐wide association studies. One of the target genes of the regulatory element of CASZ1, UBIAD1, encodes a menaquinone‐4 biosynthetic enzyme that converts vitamin K derivatives to menaquinone‐4.51 It is also required for the synthesis of extramitochondrial ubiquinone in the Golgi apparatus.52 Interestingly, UBIAD1 has been shown to play a pleiotropic role in cardiovascular physiology. It is involved in cholesterol and phospholipid metabolism through accelerating the degradation of β‐hydroxy β‐methylglutaryl coenzyme A reductase53, 54 and contributes to the pathogenesis of vascular calcification.55 It is also implicated in the pathophysiology of heart failure where angiotensin II–mediated cardiac hypertrophy is exaggerated with knockdown of Ubiad1 in cardiomyocytes,56 partly through increased oxidative stress. Because of its numerous protective roles and the deleterious repercussions of its absence, decreased Ubiad1 levels potentially contribute to cardiovascular mortality in the cases group. Similarly, expression levels of CLSTN1 were lower in the cases. CLSTN1 encodes a transmembrane protein known as calsyntenin‐1 or alcadein‐alpha. Almost ubiquitously expressed, calsyntenin‐1 is involved in postsynaptic signaling in the nervous system57 and in anterograde axoplasmic vesicular transport.58 Studies outside the nervous system are lacking, and it currently has no putative role in cardiovascular disease. How decreased CLSTN1 expression is potentially related to cardiovascular mortality requires further exploration. The methylation probe in the MATK gene is located in its second intron within a CpG island and is located within the significant DMR, chr19:3785573‐3786958 (Table S1). Methylation at the probe site and expression levels were lower in cases than in controls. There is robust evidence that decreased intragenic methylation can lead to decreased expression of the respective gene,59, 60 similar to that observed in our results. The MATK gene encodes for the megakaryocyte‐associated tyrosine kinase, which phosphorylates the Src protein and partakes in regulatory pathways in megakaryocytes.61 MATK has been generally found to serve a protective role in the field of oncology; through its inhibition of Src, it prevents cell growth and invasion.62, 63 The third significant differentially methylated associated with mortality localized to intron 7 of SLC4A9. SLC4A9 (AE4) encodes an anion transporter that is primarily expressed in the kidney.64 However, expression data regarding this particular gene were not available (probe did not pass QC) in our data. Therefore, the functional implications of the increased methylation within the gene cannot be determined. The DMR meta‐analysis yielded differential methylation upstream of the PEX10 gene. The region (chr1: 2344699‐2345476) is located within the promoter of the PEX10 (peroxisomal biogenesis factor 10) gene. PEX10 is a peroxisomal membrane protein that plays a role in transport of peroxisomal matrix proteins and, when mutated, leads to Zellweger syndrome.65 Interestingly, a DMR in APOB (chr2:21266500‐21267335; FDR‐adj P=7.00E‐04) was also identified in the meta‐analysis. The apolipoprotein B (ApoB) protein has a very well‐established role in low‐density lipoprotein transport and metabolism and, when mutated, can lead to familial hypercholestorelemia, which leads to premature atherosclerosis.66 Unfortunately, the APOB transcript was not available for analysis. Therefore, whether or not the aforementioned DMR plays a role in modulating its expression and contributing to atherosclerosis in our sample is unknown. Our findings are unique when compared with 2 previous EWAS of all‐cause mortality.11, 12 Our study focused on secondary prevention patients, referred for cardiac catheterization, whereas the previous studies included healthier middle‐aged/elderly twins11 and 2 elderly cohorts (Epidemiologische Studie zu Chancen der Verhütung, Früherkennung und optimierten Therapie chronischer Erkrankungen in der älteren Bevölkerung [ESTHER] and KORA [Cooperative Health Research in the Region of Augsburg]),12 respectively. Also, both of the aforementioned studies used larger sample sizes (N=870 and N=1900). Furthermore, there might be underlying heterogeneity between the causes of death in our study and the aforementioned EWASs. Three probes from the ESTHER/KORA study12 and 1 from the twin study11 were nominally significant in our study, using a fully adjusted model, in both the discovery and validation cohorts (Table S2), but they were no longer significant after FDR adjustment. Interestingly, 2 of the 3 probes from the ESTHER/KORA study12 displayed methylation levels in the similar direction as the CATHGEN participants where cases had lower methylation levels compared with controls. Although our findings are interesting and suggest functional consequences of the identified epigenetic modifications, some limitations should be noted. The sample size of our study is relatively small, therefore our study warrants replication. However, we did use a discovery and validation cohort to minimize type 1 error. In addition, cases used for analysis may be sicker than controls, given that they died earlier than controls, but we note that models were adjusted for potential confounders. Relatedly, controls were selected based on the absence of a subsequent coronary revascularization procedure or myocardial infarction to ensure that controls were truly free of cardiovascular disease, but this could also contribute to sampling bias. We used an FDR adjustment for multiple comparisons with a P‐value threshold of 0.10 in the validation cohort, which could be viewed as too liberal of a cutoff; thus, results placed within a more‐conservative FDR cutoff of 0.05 are also presented, and furthermore, we present meta‐analysis results which place combined stronger effects in context and maximize power. Although statistically significant, the ∆β values from both the discovery and validation cohorts were modest. Also, even though there was a statistically significant difference between the expression levels of UBIAD1 and CLSTN1 between cases and controls, we did not observe a significant direct correlation between methylation at the CASZ1 enhancer locus and expression levels of the downstream targets. Given the dynamic nature of DNA methylation and other epigenetic mechanisms, our study only demonstrates a “snapshot” of a single genomic methylation profile. Temporally, regulatory elements demonstrate changing methylation profiles and are not permanent.13, 14 Therefore, the sites of altered methylation demonstrate correlation with, rather than causation of, early‐onset death. The sample used for our study represents an overall “sick” group of patients, and as such, survival bias might be playing a role. Because all of the participants enrolled had undergone a cardiac catheterization procedure, the results may not be generalizable to population‐based cohorts. In conclusion, using an EWAS in discovery and validation cohorts of secondary prevention patients, we found evidence of differential methylation of a regulatory element with consequent decrease in 2 of its potential targets, of which 1 has demonstrated cardioprotective benefits. Future studies are required to replicate our findings and further ascertain the potential role of UBIAD1 role in cardiovascular mortality.

Sources of Funding

This work was supported by National Heart, Lung, and Blood Institute grants 5P01‐HL036587 (Williams, Shah) and 5R01‐HL095987 (Shah).

Disclosures

None. Data S1. Methods for DNA quantification and DNA qualitative assessment. Table S1. Comb‐p DMR Meta‐Analysis Results of the Discovery and Validation Sets Table S2. Probes With Nominal P<0.05 Found in Both Discovery and Validation Cohorts That Are Common to Previous EWAS of All‐Cause Mortality Click here for additional data file.
  62 in total

1.  Change in Epigenome-Wide DNA Methylation Over 9 Years and Subsequent Mortality: Results From the InCHIANTI Study.

Authors:  Ann Zenobia Moore; Dena G Hernandez; Toshiko Tanaka; Luke C Pilling; Mike A Nalls; Stefania Bandinelli; Andrew B Singleton; Luigi Ferrucci
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2015-09-09       Impact factor: 6.053

Review 2.  A Guide for a Cardiovascular Genomics Biorepository: the CATHGEN Experience.

Authors:  William E Kraus; Christopher B Granger; Michael H Sketch; Mark P Donahue; Geoffrey S Ginsburg; Elizabeth R Hauser; Carol Haynes; L Kristin Newby; Melissa Hurdle; Z Elaine Dowdy; Svati H Shah
Journal:  J Cardiovasc Transl Res       Date:  2015-08-14       Impact factor: 4.132

Review 3.  DNA methylation and human disease.

Authors:  Keith D Robertson
Journal:  Nat Rev Genet       Date:  2005-08       Impact factor: 53.242

4.  Calsyntenin-1 docks vesicular cargo to kinesin-1.

Authors:  Anetta Konecna; Renato Frischknecht; Jochen Kinter; Alexander Ludwig; Martin Steuble; Virginia Meskenaite; Martin Indermühle; Marianne Engel; Chuan Cen; José-Maria Mateos; Peter Streit; Peter Sonderegger
Journal:  Mol Biol Cell       Date:  2006-06-07       Impact factor: 4.138

5.  Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries (the INTERHEART study): case-control study.

Authors:  Salim Yusuf; Steven Hawken; Stephanie Ounpuu; Tony Dans; Alvaro Avezum; Fernando Lanas; Matthew McQueen; Andrzej Budaj; Prem Pais; John Varigos; Liu Lisheng
Journal:  Lancet       Date:  2004 Sep 11-17       Impact factor: 79.321

6.  Genetic Risk, Adherence to a Healthy Lifestyle, and Coronary Disease.

Authors:  Amit V Khera; Connor A Emdin; Isabel Drake; Pradeep Natarajan; Alexander G Bick; Nancy R Cook; Daniel I Chasman; Usman Baber; Roxana Mehran; Daniel J Rader; Valentin Fuster; Eric Boerwinkle; Olle Melander; Marju Orho-Melander; Paul M Ridker; Sekar Kathiresan
Journal:  N Engl J Med       Date:  2016-11-13       Impact factor: 91.245

7.  GeneHancer: genome-wide integration of enhancers and target genes in GeneCards.

Authors:  Simon Fishilevich; Ron Nudel; Noa Rappaport; Rotem Hadar; Inbar Plaschkes; Tsippi Iny Stein; Naomi Rosen; Asher Kohn; Michal Twik; Marilyn Safran; Doron Lancet; Dana Cohen
Journal:  Database (Oxford)       Date:  2017-01-01       Impact factor: 3.451

8.  DNA methylation signatures in peripheral blood strongly predict all-cause mortality.

Authors:  Yan Zhang; Rory Wilson; Jonathan Heiss; Lutz P Breitling; Kai-Uwe Saum; Ben Schöttker; Bernd Holleczek; Melanie Waldenberger; Annette Peters; Hermann Brenner
Journal:  Nat Commun       Date:  2017-03-17       Impact factor: 14.919

9.  Genetic and epigenetic changes in the common 1p36 deletion in neuroblastoma tumours.

Authors:  H Carén; S Fransson; K Ejeskär; P Kogner; T Martinsson
Journal:  Br J Cancer       Date:  2007-10-16       Impact factor: 7.640

10.  Ae4 (Slc4a9) is an electroneutral monovalent cation-dependent Cl-/HCO3- exchanger.

Authors:  Gaspar Peña-Münzenmayer; Alvin T George; Gary E Shull; James E Melvin; Marcelo A Catalán
Journal:  J Gen Physiol       Date:  2016-05       Impact factor: 4.086

View more
  5 in total

1.  Lifetime marijuana use and epigenetic age acceleration: A 17-year prospective examination.

Authors:  Joseph P Allen; Joshua S Danoff; Meghan A Costello; Gabrielle L Hunt; Amanda F Hellwig; Kathleen M Krol; Simon G Gregory; Stephanie N Giamberardino; Karen Sugden; Jessica J Connelly
Journal:  Drug Alcohol Depend       Date:  2022-02-17       Impact factor: 4.492

Review 2.  Genetic and epigenetic analyses of panic disorder in the post-GWAS era.

Authors:  Yoshiro Morimoto; Shinji Ono; Naohiro Kurotaki; Akira Imamura; Hiroki Ozawa
Journal:  J Neural Transm (Vienna)       Date:  2020-05-09       Impact factor: 3.575

3.  An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies.

Authors:  Lauren A Vanderlinden; Randi K Johnson; Patrick M Carry; Fran Dong; Dawn L DeMeo; Ivana V Yang; Jill M Norris; Katerina Kechris
Journal:  BMC Res Notes       Date:  2021-09-08

Review 4.  DNA methylation signatures of incident coronary heart disease: findings from epigenome-wide association studies.

Authors:  Yujing Xia; Alison Brewer; Jordana T Bell
Journal:  Clin Epigenetics       Date:  2021-10-09       Impact factor: 6.551

Review 5.  Ten Years of EWAS.

Authors:  Siyu Wei; Junxian Tao; Jing Xu; Xingyu Chen; Zhaoyang Wang; Nan Zhang; Lijiao Zuo; Zhe Jia; Haiyan Chen; Hongmei Sun; Yubo Yan; Mingming Zhang; Hongchao Lv; Fanwu Kong; Lian Duan; Ye Ma; Mingzhi Liao; Liangde Xu; Rennan Feng; Guiyou Liu; The Ewas Project; Yongshuai Jiang
Journal:  Adv Sci (Weinh)       Date:  2021-08-11       Impact factor: 16.806

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.