| Literature DB >> 28094813 |
D M Ciuculete1, A E Boström1, S Voisin1, H Philipps1, O E Titova1, M Bandstein1, L Nikontovic1, M J Williams1, J Mwinyi1, H B Schiöth1.
Abstract
Genome-wide association studies have identified a number of single-nucleotide polymorphisms (SNPs) that are associated with psychiatric diseases. Increasing body of evidence suggests a complex connection of SNPs and the transcriptional and epigenetic regulation of gene expression, which is poorly understood. In the current study, we investigated the interplay between genetic risk variants, shifts in methylation and mRNA levels in whole blood from 223 adolescents distinguished by a risk for developing psychiatric disorders. We analyzed 37 SNPs previously associated with psychiatric diseases in relation to genome-wide DNA methylation levels using linear models, with Bonferroni correction and adjusting for cell-type composition. Associations between DNA methylation, mRNA levels and psychiatric disease risk evaluated by the Development and Well-Being Assessment (DAWBA) score were identified by robust linear models, Pearson's correlations and binary regression models. We detected five SNPs (in HCRTR1, GAD1, HADC3 and FKBP5) that were associated with eight CpG sites, validating five of these SNP-CpG pairs. Three of these CpG sites, that is, cg01089319 (GAD1), cg01089249 (GAD1) and cg24137543 (DIAPH1), manifest in significant gene expression changes and overlap with active regulatory regions in chromatin states of brain tissues. Importantly, methylation levels at cg01089319 were associated with the DAWBA score in the discovery group. These results show how distinct SNPs linked with psychiatric diseases are associated with epigenetic shifts with relevance for gene expression. Our findings give a novel insight on how genetic variants may modulate risks for the development of psychiatric diseases.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28094813 PMCID: PMC5545735 DOI: 10.1038/tp.2016.275
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 6.222
Figure 1Workflow of the study. DAWBA, Development and Well-Being Assessment; mQTL, methylation quantitative trait loci; SNP, single-nucleotide polymorphism.
Discovery and replication data sets’ description
| P | P | |||||
|---|---|---|---|---|---|---|
| Sex (male) | 33 (25.6) | 4 (3.1) | NS | 14 (15.0) | 5 (5.3) | |
| Age (years)±s.d. | 15.34±0.59 | 15.25±0.68 | NS | 15.77±0.61 | 15.70±0.64 | NS |
| BMI (kg/m2)±s.d. | 21.37±2.89 | 24.68±6.3 | NS | 22.23±3.16 | 22.06±3.32 | NS |
Abbreviations: BMI, body mass index; DAWBA, Development and Well-Being Assessment.
Continuous variables are shown as mean±s.d.
Individuals with a general DAWBA psychiatric risk score below 50% were defined as ‘Low risk’ and included 0 (<0.1%), 1 (≈0.5%), 2 (≈3%) and 3 (≈15%) level bands of the DAWBA score. Individuals with level bands 4 (≈50%) and 5 (>70%), having a risk higher than 50%, were assigned to the ‘High risk’ category.
Two-tailed analysis tests the difference between the ‘Low risk’ and ‘High risk’ group using the Student’s t-test for continuous variables and the χ2-test for categorical variables. Bold value signifies P-values<0.05.
Figure 2Illustration of cell-type composition effect using principal component analysis (PCA). The heatmap indicates significant correlations between the first two principal components and the estimations of blood cell types. After cell-type correction, no association between the first two principal components and cell type are longer observed. Bcell, B cell; Gran, granulocyte; Mono, monocytes; NK, natural killer; PC, principal component.
Genome-wide significant psychiatric-associated CpG sites
| cg00112260 | 5298 | rs10914453 ( | 29 885 | 81 (0.5) | 78 (0.4) | −0.44 | 4.72e−08 | 82 (0.03) | 78 (0.03) | −0.40 | 3.68e−08 | |||
| cg01089319 | 3610 | rs2058725 ( | 13 311 | 15 (0.04) | 18 (0.06) | 0.55 | 1.00e−09 | 15 (0.03) | 22 (0.05) | 0.65 | 1.87e−12 | |||
| cg01089319 | 3610 | rs2241165 ( | 1569 | 15 (0.04) | 18 (0.06) | 0.60 | 8.57e−12 | 15 (0.03) | 22 (0.05) | 0.64 | 1.90e−11 | |||
| cg01089249 | 3354 | rs2241165 ( | 1825 | 18 (0.04) | 19 (0.03) | 0.32 | 4.72e−08 | 18 (0.01) | 22 (0.03) | 0.34 | 4.24e−12 | |||
| cg24137543 | 12 394 | rs2530223 ( | 120 860 | 10 (0.04) | 7 (0.04) | −0.95 | 3.24e−08 | 11 (0.03) | 6 (0.02) | −1.05 | 2.41e−11 | |||
| cg08155325 | 2245 | rs2530223 ( | 156 681 | 71 (0.09) | 62 (0.09) | −0.67 | 1.86e−08 | 68 (0.08) | 60 (0.06) | — | 1.62e−07 | NS | ||
| cg02569698 | 518 | rs9296158 ( | 86 953 | 20 (0.04) | 16 (0.03) | −0.41 | 8.91e−08 | 17 (0.02) | 14 (0.02) | — | 9.72e−05 | NS | ||
| cg18766608 | −31 764 | rs2058725 ( | 110 201 764 | 94 (0.01) | 92 (0.01) | -0.32 | 1.41e−07 | — | — | — | NS | NS | ||
Abbreviations: bp, base pair; Bonf, Bonferroni corrected; NS, nonsignificant; PC, principal component; SNP, single-nucleotide polymorphism; TSS, transcription start site; Unadj., unadjusted.
The table shows the associations that remain significant after the Bonferroni correction. All 37 investigated SNPs and the top hit associated CpG using unadjusted analyses are shown in Supplementary Table 2.
Associations between SNPs and DNA methylation in the discovery cohort were performed genome-wide using linear models (limma package, R). In the replication data set, analyses were restricted to the significantly associated SNPs and CpGs. Covariates included were age, sex, BMI, PC1 and PC2. Shown are the raw and Bonferroni P-values.
Non-carriers of the coding allele.
Carriers of the coding allele. Bold values signifiy P-values<0.05.
Figure 3Associations between genotype data at the four validated SNPs and methylation levels at three unique CpG sites (cg01089319, cg01089249 and cg24137543) in the discovery set. Distribution of the beta values at the methylation sites is illustrated for individuals carrying zero, one and two minor alleles. *P-value<0.05 (Student’s t-test); **P-value<0.01 (Student’s t-test); ***P-value<0.001 (Student’s t-test). SNP, single-nucleotide polymorphism.
Figure 4Genomic context of the most significant CpG sites associated with SNPs rs2241165 and rs2058725. Genomic positions of RefSeq genes are displayed in the top part, indicated by blue arrows. The positions of the significant CpG sites are highlighted by black lines. For the investigation of specificity of the associations, long-range interactions were derived from four cell lines targeting two transcription factors. Associations are represented by arcs. Only long-range interactions containing significant CpGs were illustrated. The intensity of the arc is proportional to the strength of the interaction between the two regions. As analyses were performed based on data obtained in blood, chromatin marks overlapping in brain and blood cells were investigated. Chromatin states of eight tissues downloaded from the 37/hg19 WashU Epigenome Browser are illustrated. Each functional role of a segment is indicated by a particular color. BrainAC, brain anterior caudate; BrainAG, brain angular gyrus; BrainCG, brain cingulate gyrus; BrainDPC, brain dorsolateral prefrontal cortex; BrainHIPPO, brain hippocampus; BrainITL, brain inferior temporal lobe; BrainSN, brain substantia nigra; PBMC, peripheral blood mononuclear primary cells; SNP, single-nucleotide polymorphism; TSS, transcription start site.
Figure 5Genomic context of the most significant CpG sites (cg08155325 and cg24137543) associated with SNP rs2530223. Genomic positions of RefSeq genes are displayed in the top part, indicated by blue arrows. The positions of the significant CpG sites are highlighted by black lines. For the investigation of the potential regulatory effect of the significant CpG sites on other genes and of the specificity of the associations, long-range interactions were derived from four cell lines targeting two transcription factors. Associations are represented by arcs. Only long-range interactions containing significant CpGs are illustrated. The intensity of the arc is proportional to the strength of the interaction between the two regions. As analyses were performed based on data obtained in blood, chromatin marks overlapping in brain and blood cells were investigated. Chromatin states of eight tissues downloaded from the 37/hg19 WashU Epigenome Browser are illustrated. Each functional role of a segment is indicated by a particular color. BrainAC, brain anterior caudate; BrainAG, brain angular gyrus; BrainCG, brain cingulate gyrus; BrainDPC, brain dorsolateral prefrontal cortex; BrainHIPPO, brain hippocampus; BrainITL, brain inferior temporal lobe; BrainSN, brain substantia nigra; PBMC, peripheral blood mononuclear primary cells; SNP, single-nucleotide polymorphism; TSS, transcription start site.
Associations of the significant and validated CpG sites and gene expressions in 11 adult healthy individuals
| P | P | |||||
|---|---|---|---|---|---|---|
| cg01089319 | NM_000817 | NS | ||||
| NM_001130823, NM_001379 | NS | NS | ||||
| NM_022552, NM_153759 | NS | NS | ||||
| NM_144589 | NS | NS | ||||
| cg01089249 | NM_000817 | NS | NS | |||
| NM_001130823, NM_001379 | NS | NS | ||||
| NM_022552, NM_153759 | NS | NS | ||||
| NM_144589 | NS | |||||
| cg24137543 | NM_003883 | 1.49 | NS | |||
| NM_018919 | ||||||
| cg00112260 | NM_001525 | NS | NS |
Abbreviations: Coef, coefficient; NS, nonsignificant.
According to GeneChip Human Gene 2.1 ST Array annotation file.
Correlations between methylation levels at CpG sites (M-values) and gene expression levels were performed using robust linear regression models. Shown are P-values and coefficients.
Correlations between methylation levels at CpG sites (M-values) and gene expression levels are performed using Pearson’s correlation analyses. Shown are P-values and coefficients. Bold values signify P-values<0.05.