| Literature DB >> 34440357 |
Jun Dai1, Ming Leung2, Weihua Guan3, Han-Tian Guo4, Ruth E Krasnow5, Thomas J Wang6, Wael El-Rifai7, Zhongming Zhao8,9, Terry Reed10.
Abstract
Epigenetics is a mechanism underlying cardiovascular disease. It is unknown whether DNA hydroxymethylation is prospectively associated with the risk for cardiovascular death independent of germline and common environment. Male twin pairs middle-aged in 1969-1973 and discordant for cardiovascular death through December 31, 2014, were included. Hydroxymethylation was quantified in buffy coat DNA collected in 1986-1987. The 1893 differentially hydroxymethylated regions (DhMRs) were identified after controlling for blood leukocyte subtypes and age among 12 monozygotic (MZ) pairs (Benjamini-Hochberg False Discovery Rate < 0.01), of which the 102 DhMRs were confirmed with directionally consistent log2-fold changes and p < 0.01 among additional 7 MZ pairs. These signature 102 DhMRs, independent of the germline, were located on all chromosomes except for chromosome 21 and the Y chromosome, mainly within/overlapped with intergenic regions and introns, and predominantly hyper-hydroxymethylated. A binary linear classifier predicting cardiovascular death among 19 dizygotic pairs was identified and equivalent to that generated from MZ via the 2D transformation. Computational bioinformatics discovered pathways, phenotypes, and DNA motifs for these DhMRs or their subtypes, suggesting that hydroxymethylation was a pathophysiological mechanism underlying cardiovascular death that might be influenced by genetic factors and warranted further investigations of mechanisms of these signature regions in vivo and in vitro.Entities:
Keywords: cardiovascular disease; dizygotic; hydroxymethylation; monozygotic; twins
Mesh:
Year: 2021 PMID: 34440357 PMCID: PMC8392630 DOI: 10.3390/genes12081183
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Graph-based visualization of the 102 candidate differentially hydroxymethylated regions (DhMRs) among 19 monozygotic twin pairs discordant for cardiovascular death. (a). Heatmap. Abbreviation: CD8T = CD8+ T-cells; CD4T = CD4+ T-cells; NK = natural killer cells; Bcel = B-cells; Mono = monocytes; Cov1 = age; 5hmC = 5-hydroxymethylcytosine. Each row represents a DhMR, and each column represents a participant. The normalized count values are log2 transformed and then scaled with z-score in the blue-red color scale. The 102 DhMRs were adjusted for leukocyte subtypes and age. The lower level of hydroxymethylation among co-twins than their twin brothers is in blue, and the higher level of hydroxymethylation than their twin brothers is in red; (b). volcano plot. The horizontal axis represents the log2 fold-change (FC) (case versus control twins), and the vertical axis corresponds to the negative log10 (adjusted p-value). Red dots represent the significant (−log10 adjusted p-value > 2.0) hyper-DhMRs (log2 FC > 2) and blue dots represent significant (−log10 adjusted p-value > 2.0) hypo-DhMRs (log2 FC < −2) when comparing cases with their corresponding control co-twins; and (c). scatter plot. The overall effect of the 102 DhMRs was represented with principal components (PC) 1 and 4 from principal component analysis. The horizontal axis represents PC1 (i.e., dimension 1), and the vertical axis displays PC4 (i.e., dimension 4). Red dots represent case twins, while blue dots represent their control twin brothers.
Annotation of the nearest genes mapped to the 102 differentially hydroxymethylated regions.
| Annotation 1 | Total DhMRs | Hyper-DhMRs | Hypo-DhMRs |
|---|---|---|---|
| n | 102 | 84 | 18 |
| Exons1 | 2 (2.0 3) | 2 (2.3 3) | 0 |
| ncRNA2 | 1 (50 4) | 1 (50 4) | 0 |
| protein-coding2 | 1 (50 4) | 1 (50 4) | 0 |
| Pseudo2 | 0 | 0 | 0 |
| snoRNA2 | 0 | 0 | 0 |
| Intergenic regions1 | 47 (46 3) | 42 (50 3) | 5 (28 3) |
| ncRNA | 14 (30 4) | 13 (31 4) | 1 (20 4) |
| protein-coding | 30 (64 4) | 26 (62 4) | 4 (80 4) |
| pseudo | 3 (6 4) | 3 (7 4) | 0 |
| snoRNA | 0 | 0 | 0 |
| Introns1 | 49 (48 3) | 37 (44 3) | 12 (67 3) |
| ncRNA | 11 (22 4) | 10 (27 4) | 1 (8 4) |
| protein-coding | 35 (71 4) | 25 (68 4) | 10 (83 4) |
| pseudo | 3 (6 4) | 1 (3 4) | 1 (8 4) |
| snoRNA | 1 (2 4) | 1 (3 4) | 0 |
| TSS1 | 2 (2.0 3) | 1 (1.2 3) | 1 (5.6 3) |
| ncRNA | 0 | 0 | 0 |
| protein-coding | 1 (50 4) | 0 | 1 (100 4) |
| pseudo | 1 (50 4) | 1 (100 4) | 0 |
| snoRNA | 0 | 0 | 0 |
| TTS 1 | 2 (2.0 3) | 2 (2.3 3) | 0 |
| ncRNA | 0 | 0 | 0 |
| protein-coding | 2 (100 4) | 2 (100 4) | 0 |
| pseudo | 0 | 0 | 0 |
| snoRNA | 0 | 0 | 0 |
Abbreviations: TSS = promoter transcription start sites; TTS = transcription terminal site; ncRNA = non-coding RNA; DhMRs = differentially hydroxymethylated regions; hyper-DhMRs = differentially hyper-hydroxymethylated regions (case twins vs. control twins); hypo-DhMRs = differentially hypo-hydroxymethylated regions (case twins vs. control twins). Data are presented as n (%) unless otherwise specified. 1 annotation, while 3′-UTR, 5′-UTR, and non-coding regions are not shown as none of the nearest genes, are within/overlapped with them; 2 gene type; 3 percentage of each annotation for all DhMRs and each type of DhMRs, respectively; 4 percentage of each gene type per annotation.
Figure 2Visualization of the binary linear classifier among 19 dizygotic twin pairs (DZ) discordant for cardiovascular death (CVD-dDZ) generated with the 102 candidate DhMRs identified from monozygotic twin pairs (MZ) discordant for cardiovascular death (CVD-dMZ) using 1000 times simulation. (a) A principal component analysis of the 102 DhMRs generated principal component 1 (PC1) and principal component 4 (PC4). The points in the area below the binary linear classifier are case twins while points above the classifier are control twins. Binary linear classifier ; (b) receiver operating characteristic (ROC) curve and the area under the curve (AUC) using the binary linear classifier for the CVD-dDZ; (c) graph-based 2D topology of genetic influences on the binary linear classifier by comparison between MZ and DZ pairs. Two blues arrows show the two-step topological changes (i.e., 2-D transformation) of the MZ classifier (i.e., the vertical line across the origin) to the DZ binary classifier line with a slope of 0.214. The first step is indicated with a clockwise half-circle arrow representing the clockwise rotation of the MZ classifier at the origin by 72.92°. The second step is indicated with an upward arrow in blue representing an upward movement of the rotated MZ classifier along the vertical axis by 0.664 units.
Figure 3Biological functions and pathways, phenotypes, disorders, and (rare) diseases apparently relevant to males. (a) The number of enriched terms for biological functions and pathways (p < 0.05); (b) the number of enriched terms for phenotypes, disorders, diseases and rare diseases (p < 0.05); (c) the number of enriched terms for rare diseases (p < 0.05); (d–h) up to top 5 enriched terms with the smallest nominal p-values < 0.05 for the top terms containing any terms with padj < 0.05. * padj: multiple-testing adjusted p-value using the Benjamini-Hochberg False Discovery Rate (FDR) procedure.
Figure 4De novo motif identification from genes mapped to differentially hydroxymethylated regions (DhMRs) with HOMER. Abbreviations: DhMRs = differentially hydroxymethylated regions; hyper-DhMRs = differentially hyper-hydroxymethylated regions (case vs. control twins); hypo-DhMRs = differentially hypo-hydroxymethylated regions (case vs. control twins). The value next to the motif sequence logo is the negative log10 (p-value).