Literature DB >> 34341332

Genome-wide DNA methylation and gene expression analyses in monozygotic twins identify potential biomarkers of depression.

Weijing Wang1, Weilong Li2, Yili Wu1, Xiaocao Tian3, Haiping Duan3, Shuxia Li4, Qihua Tan4,5, Dongfeng Zhang6.   

Abstract

Depression is currently the leading cause of disability around the world. We conducted an epigenome-wide association study (EWAS) in a sample of 58 depression score-discordant monozygotic twin pairs, aiming to detect specific epigenetic variants potentially related to depression and further integrate with gene expression profile data. Association between the methylation level of each CpG site and depression score was tested by applying a linear mixed effect model. Weighted gene co-expression network analysis (WGCNA) was performed for gene expression data. The association of DNA methylation levels of 66 CpG sites with depression score reached the level of P < 1 × 10-4. These top CpG sites were located at 34 genes, especially PTPRN2, HES5, GATA2, PRDM7, and KCNIP1. Many ontology enrichments were highlighted, including Notch signaling pathway, Huntington disease, p53 pathway by glucose deprivation, hedgehog signaling pathway, DNA binding, and nucleic acid metabolic process. We detected 19 differentially methylated regions (DMRs), some of which were located at GRIK2, DGKA, and NIPA2. While integrating with gene expression data, HELZ2, PTPRN2, GATA2, and ZNF624 were differentially expressed. In WGCNA, one specific module was positively correlated with depression score (r = 0.62, P = 0.002). Some common genes (including BMP2, PRDM7, KCNIP1, and GRIK2) and enrichment terms (including complement and coagulation cascades pathway, DNA binding, neuron fate specification, glial cell differentiation, and thyroid gland development) were both identified in methylation analysis and WGCNA. Our study identifies specific epigenetic variations which are significantly involved in regions, functional genes, biological function, and pathways that mediate depression disorder.
© 2021. The Author(s).

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34341332      PMCID: PMC8329295          DOI: 10.1038/s41398-021-01536-y

Source DB:  PubMed          Journal:  Transl Psychiatry        ISSN: 2158-3188            Impact factor:   6.222


Introduction

Depression is currently the leading cause of disability worldwide [1], and it is predicted to be one of the three leading causes of illness by 2030 [2]. Although the heavy social and economic burden, the potential molecular mechanisms underlying depression remain poorly understood. The depression risk is influenced by both genetic and environmental factors. It is suggested that epigenetic modification could mediate the lasting increasing depression risk resulting from exposure to adverse life events and provide a mechanistic framework, where genetic and environmental factors were integrated [3, 4]. DNA methylation was one important form of epigenetic modification, and one recent review of 67 studies concluded that there was evidence for the association of DNA methylation variation with depression risk [5]. Additionally, candidate gene studies discovered that BDNF and SLC6A4 hypermethylation were related to depression or major depressive disorder (MDD) [5]. Even currently some significant methylation modifications were found to be associated with depression, however, no consistent results were identified. Nowadays, using monozygotic twins discordant for a trait or disease has been proved to be a powerful and popular design for EWAS in linking the environmental basis of epigenetic modification variation to disease status while controlling for individual genetic component [6, 7]. This design has been extensively used to explore specific DNA methylation variation associated with diseases, such as cognitive function decline [8], Alzheimer’s disease [9], and rheumatoid arthritis [10]. Since the Chinese population are different from other ethnic populations worldwide concerning genetic constitutions, environmental exposure, a multitude of life styles and occupations, the DNA methylation variation may also differ. However, to our knowledge, yet no EWAS has been performed to explore the DNA methylation variation associated with depression using Chinese monozygotic twin samples. Accordingly, we aimed to conduct an EWAS to detect DNA CpG sites associated with depression and further integrate with gene expression data in a sample of middle and ole-aged Chinese monozygotic twins.

Material and methods

The primary materials and methods of this study were similar to our previously published studies [8, 11, 12].

Participants

Participants recruitment and collection were described in detail previously [13]. Participants who suffered from cerebrovascular disorders, stroke, traumatic brain injury, central nervous system (CNS) tumor, CNS infections, and alcohol or drug dependence were excluded. Meanwhile, participants who were unconscious, unable, or unwilling to cooperate were also dropped. Finally, a total of 58 complete monozygotic twin pairs with a mean age of 52 years (SD: 7.5) were included in the methylation analysis, and a subsample consisted of 12 twin pairs were included in the gene expression analysis. The median of absolute values of intrapair depression score difference (∆(depression score)) of all twins was 4 (range: 1–15). The number of twin pairs for ∆(depression score) ranging in 1–5, 6–10, and 11–15 was 39, 15, and 4 in the methylation analysis and 9, 3, and 0 in the gene expression analysis, respectively. The median of ratio difference calculated as |∆(depression score)|/max(depression score) was 0.41 (range: 0.14–1.00). After providing written informed consent, all participants took a standardized questionnaire and underwent a health examination. This study was approved by the Regional Ethics Committee of the Qingdao CDC Institutional Review Boards. And the ethical principles of the Helsinki Declaration were also followed.

Assessment of depression

Depression was assessed by the Geriatric Depression Scale-30 (GDS-30) [14]. The GDS-30 had 30 items, and participants were asked to answer “yes” or “no” to the items based on how they felt over the past 1 week. The higher the total score was, the more severe the participant’s mental condition was.

Reduced representation bisulfite sequencing (RRBS) analysis

The total DNA was extracted from whole blood and sent to one biomarker technology corporation in China for the RRBS experiment. Briefly, genomic DNA was first digested to generate short fragments that contained CpG dinucleotides at the ends. Then the CpG-rich DNA fragments were extracted and bisulfite-converted. The cDNA library was constructed and sequenced to get raw sequencing data, which was then preprocessed and mapped by Bismark [15] and smoothed by R package BiSeq [16]. The methylation β-value was transformed to M-value for statistical modeling. Finally, a total of 551,447 CpG sites were included.

Cell-type composition estimation

Considering total DNA was extracted from whole blood, different methylation profiles of distinct cell types may lead to false discoveries [17]. We used ReFACTor method to control for the cell-type composition effect on DNA methylation in EWAS [18]. ReFACTor is an unsupervised reference-free method that selects methylation sites, which are informative about the cell composition in the data to apply to principal component analysis (PCA) and further uses the top components of PCA to construct surrogates for the underlying cell-type compositions for adjustment in statistical analysis. In our study, the top five components were chosen as covariates to correct the cell-type heterogeneity.

RNA library construction, sequencing, and quality control

Briefly, after total mRNA was extracted from whole blood, the RNA-Seq library was constructed and sequenced to get the sequenced data. The data was then mapped to the human genome by TopHat2 [19]. The gene expression level was estimated by FPKM value through Cufflinks software [20].

Statistical analysis

Methylation analysis

Epigenome-wide association analysis

The association between the methylation level of each CpG site and depression score was tested by a linear mixed effect model, which was equivalent to the regression model as proposed by Tan et al. [6]. The fixed effect variables of age, gender, and cell-type composition as well as random effect variable of twin pairing were adjusted for in the model.

Predicting functions of cis-regulatory regions and ontology enrichments analysis

The identified epigenome CpG sites (P < 0.05) were submitted to the Genomic Regions Enrichment of Annotations Tool (GREAT) online to analyze the functional significance of cis-regulatory regions and ontology enrichments [21]. The default “basal plus extension” association rule was chosen. In this rule, a “basal regulatory region” irrespective of the presence of neighboring genes which extended 5 kb upstream and 1 kb downstream of the transcription start site (TSS) were firstly assigned. Then each gene’s regulatory domain was extended up to the basal regulatory region of the nearest upstream and downstream genes, but no longer than 1 Mb in each direction. FDR < 0.05 was set as statistically significant in ontology enrichments analysis.

Differentially methylated region (DMR) analysis

Based on the bisulfite sequencing data and corresponding EWAS results, the DMRs associated with depression score were detected by using the comb-p approach [22]. Significant enriched DMRs were evaluated by Stouffer-Liptak-Kechris (slk) corrected P-value < 0.05.

Gene expression analysis

Weighted gene co-expression network analysis (WGCNA)

The WGCNA package is a comprehensive collection of R functions for performing various aspects of weighted correlation network analysis [23, 24]. Briefly, we first established a weighted adjacency matrix. Then the topological overlap matrix (TOM) was constructed [25-27] and used as input for hierarchical clustering analysis [28]. Gene modules were detected by using a dynamic tree cutting algorithm. The module eigengenes (MEs) were correlated with the trait of depression score. Relationships among modules were illustrated by a hierarchical clustering dendrogram of MEs [29], and a heatmap plot of the corresponding eigengene network. Intramodular hub genes were defined following criteria of depression score based gene significance (GS) > 0.70 and module membership (MM) > 0.90 with a threshold of P-value < 0.01 [30]. For the genes clustered in the module associated with depression score, GO enrichment analysis and BIOCARTA, KEGG, and REACTOME pathway enrichment analysis were conducted by the DAVID tool [31, 32]. The modified fisher exact P-value < 0.05 was considered as enrichment cut-off criterion.

Differentially expressed genes analysis

Five depression cases (depression score > 10) and eight health controls were included. The gene expression levels of 46 genes (including the genes where the top CpG sites and the DMRs were located) between the two groups were compared by the Wilcoxon rank sum test. The P-value < 0.05 was considered as statistically significant.

Results

Methylation analysis

A total of 58 monozygotic twin pairs with a mean age of 52 years (SD: 7.5) were included. The median of depression score was 8 (95% range: 0–21). Most of the clinical indicators were statistically intrapair correlated, indicating that the co-twin design could be beneficial (Table 1).
Table 1

Basic characteristics of the participants.

CharacteristicsValuesIntrapair correlation
rP-value
Number of twin pairs58
Gender, pairs (%)
Male29 (50)
Female29 (50)
Age, mean (SD) (years)52 (7.5)
Depression score, M (P2.5, P97.5)8 (0, 21)0.360.006
Cognitive function score, M (P2.5, P97.5)21 (8, 30)0.39*0.002
BMI, mean (SD), (kg/m2)25.18 (3.63)0.63**<0.001
Systolic, M (P2.5, P97.5) (mmHg)130 (104, 179)0.44*0.001
Diastolic, M (P2.5, P97.5) (mmHg)82 (62, 100)0.31*0.020
SUA, mean (SD) (μmol/L)302 (95)0.52**<0.001
GLU, M (P2.5, P97.5) (mmol/L)5.4 (3.60, 10.86)0.57**<0.001
CHOL, mean (SD) (mmol/L)4.97 (1.18)0.54**<0.001
TG, M (P2.5, P97.5) (mmol/L)1.19 (0.20, 5.67)0.58**<0.001
HDLC, M (P2.5, P97.5) (mmol/L)1.34 (0.66, 2.71)0.83**<0.001
LDLC, mean (SD) (mmol/L)2.89 (0.88)0.45**<0.001

Continuous variables were presented as mean (standard deviation (SD)) or median (P2.5, P97.5); Categorical variables were presented as numbers with percentages.

BMI body mass index, CHOL total cholesterol, GLU fasting glucose, HDLC high-density lipoprotein cholesterol, LDLC low-density lipoprotein cholesterol, SUA serum uric acid, TG triglyceride.

Basic characteristics of the participants. Continuous variables were presented as mean (standard deviation (SD)) or median (P2.5, P97.5); Categorical variables were presented as numbers with percentages. BMI body mass index, CHOL total cholesterol, GLU fasting glucose, HDLC high-density lipoprotein cholesterol, LDLC low-density lipoprotein cholesterol, SUA serum uric acid, TG triglyceride.

Epigenome-wide association analysis

As shown in the Manhattan plot (Fig. 1) and Table 2, the association of DNA methylation levels of 66 top CpG sites with depression score reached the level of P < 1 × 10−4. The numbers of these top CpG sites located at chr1, chr4, chr16, chr19, chr20, chr11, chr7, chr6, chr12, and chr17 were 9, 8, 7, 6, 6, 5, 5, 4, 3, and 3, respectively. The strongest association (β = 0.403, P = 2.951 × 10−8) was detected for the CpG site (chr16: 90,143,728 bp) located near PRDM7. All of the top CpG sites (P < 1 × 10−4) were located at 34 genes, and there were 5, 4, 4, 4, 3, 3, 3, 3, and 3 CpG sites located at/near PTPRN2, HES5, PRDM7, RIOK1, FCGBP, HELZ2, HPF1, LAMA5, and TRIM69, respectively.
Fig. 1

Circular Manhattan plot for epigenome-wide association study of depression.

The numbers of chromosome and the −log10 of P-values for statistical significance are shown. The dots represent the observed CpG sites.

Table 2

The results of epigenome-wide association study on depression score (P-value <1 × 10−4).

ChromosomePosition (bp)CoefficientP-valueEnsembl gene IDHGNC symbol
chr1690,143,7280.4032.951E−08ENSG00000126856PRDM7
chr1690,143,7200.4024.592E−08ENSG00000126856PRDM7
chr1690,143,7340.3614.714E−08ENSG00000126856PRDM7
chr12,460,4310.0544.083E−07ENSG00000197921HES5
chr1690,143,7520.3111.753E−06ENSG00000126856PRDM7
chr4170,696,0320.1322.809E−06NANA
chr4170,696,0370.1313.123E−06NANA
chr7157,794,468−0.0553.609E−06ENSG00000155093PTPRN2
chr4170,696,0180.1314.001E−06NANA
chr4170,696,0530.1304.603E−06NANA
chr4170,696,0140.1314.857E−06ENSG00000056050HPF1
chr67,441,8170.0617.444E−06ENSG00000124784RIOK1
chr1716,540,145−0.2749.447E−06ENSG00000197566ZNF624
chr7157,378,3240.0521.138E−05ENSG00000155093PTPRN2
chr4170,696,0620.1281.172E−05ENSG00000056050HPF1
chr1231,296,6700.0891.202E−05ENSG00000119283TRIM67
chr67,441,8380.0651.218E−05ENSG00000124784RIOK1
chr11114,480,7480.0411.337E−05ENSG00000137634NXPE4
chr1231,296,6630.0891.358E−05ENSG00000167333TRIM68
chr7157,378,3130.0461.398E−05ENSG00000155093PTPRN2
chr1231,296,6600.0891.555E−05ENSG00000185880TRIM69
chr11114,480,7680.0441.590E−05ENSG00000137634NXPE4
chr11130,343,9430.3361.617E−05ENSG00000166106ADAMTS15
chr122,903,455−0.0581.622E−05ENSG00000256150ITFG2-AS1
chr7157,378,3070.0441.699E−05ENSG00000155093PTPRN2
chr5140,045,2680.0261.765E−05ENSG00000120314WDR55
chr1818,627,551−0.0401.910E−05ENSG00000124784ROCK1
chr11130,343,9540.3332.152E-05ENSG00000166106ADAMTS15
chr19999,961−0.0212.233E−05ENSG00000116032GRIN3B
chr7157,378,3390.0522.274E−05ENSG00000155093PTPRN2
chr1716,540,132−0.2702.365E−05ENSG00000197566ZNF624
chr12,460,4660.0312.831E−05ENSG00000197921HES5
chr1459,113,3710.2532.918E−05ENSG00000165617DACT1
chr2062,194,2200.2063.388E−05ENSG00000130589HELZ2
chr67,441,8460.0683.433E−05ENSG00000124784RIOK1
chr1231,296,6480.0853.830E−05ENSG00000185880TRIM69
chr12,475,1770.0624.081E−05ENSG00000197921HES5
chr4170,696,0670.1274.084E−05ENSG00000056050HPF1
chr1231,296,6410.0845.177E−05ENSG00000185880TRIM69
chr3128,205,4180.1425.408E−05ENSG00000179348GATA2
chr19999,937−0.0195.557E−05ENSG00000116032GRIN3B
chr9a95,376,1520.2725.571E−05ENSG00000188312CENPP
ENSG00000127080IPPK
chr1940,366,3230.1635.666E−05ENSG00000275395FCGBP
chr2060,886,3540.0315.690E−05ENSG00000130702LAMA5
chr2060,886,3570.0315.912E−05ENSG00000130702LAMA5
chr1940,366,3080.1635.921E−05ENSG00000275395FCGBP
chr2062,194,2130.2036.059E−05ENSG00000130589HELZ2
chr122,903,464−0.0626.221E−05ENSG00000256150ITFG2-AS1
chr67,441,8500.0736.404E−05ENSG00000124784RIOK1
chr2250,985,4080.0466.716E−05ENSG00000130487KLHDC7B
chr424,423,0940.0406.749E−05ENSG00000109819PPARGC1A
chr5170,068,7010.0447.195E−05ENSG00000182132KCNIP1
chr1669,969,3290.2497.352E−05ENSG00000198373WWP2
chr2060,886,3480.0317.884E−05ENSG00000130702LAMA5
chr1656,998,1910.0408.090E−05ENSG00000087237CETP
chr9a95,376,1370.2838.368E−05ENSG00000188312CENPP
ENSG00000127080IPPK
chr12133,341,5010.0398.446E−05ENSG00000176915ANKLE2
chr1656,998,1860.0388.749E−05ENSG00000087238CETP
chr2062,194,2020.2028.765E−05ENSG00000130589HELZ2
chr1454,419,6150.0288.799E−05ENSG00000125378BMP4
chr193,823,0800.0419.100E−05ENSG00000105278ZFR2
chr12,460,4720.0289.116E−05ENSG00000197921HES5
chr1940,366,3410.1629.234E−05ENSG00000275395FCGBP
chr10102,822,6750.0329.349E−05ENSG00000107821KAZALD1
chr1740,825,6880.0309.473E−05ENSG00000068137PLEKHH3
chr112,037,0420.0559.810E−05ENSG00000130600H19

NA not available.

aThe CpG sites annotated to two genes.

Circular Manhattan plot for epigenome-wide association study of depression.

The numbers of chromosome and the −log10 of P-values for statistical significance are shown. The dots represent the observed CpG sites. The results of epigenome-wide association study on depression score (P-value <1 × 10−4). NA not available. aThe CpG sites annotated to two genes.

Predicting functions of cis-regulatory regions and ontology enrichments analysis

A total of 15,978 genomic cis-regulatory regions were identified to be associated with one or more genes. (Supplementary Fig. 1) Many important pathway terms probably related to depression were significantly enriched, such as Notch signaling pathway, nicotine pharmacodynamics pathway, Huntington disease, p53 pathway by glucose deprivation, Parkinson disease, and hedgehog signaling pathways. Moreover, the GO enriched terms mainly highlighted DNA binding and nucleic acid metabolic process (Table 3).
Table 3

The top GREAT ontology enrichments for regions potentially related to depression by using binomial test.

Ontology databaseTerm nameBinom FDR Q-valueBinom region fold enrichment
Pathways
PANTHERTranscription regulation by bZIP transcription factor1.94E−243.18
PANTHERNotch signaling pathway2.38E−071.81
PANTHERNicotine pharmacodynamics pathway5.88E−061.85
PANTHERHuntington disease5.95E−061.38
PANTHERp53 pathway by glucose deprivation7.56E−052.03
PANTHERATP synthesis7.44E−0510.97
PANTHERParkinson disease7.32E−051.45
PANTHERHedgehog signaling pathway1.72E−041.76
PANTHERMuscarinic acetylcholine receptor 1 and 3 signaling pathway1.31E−031.37
PANTHERAdrenaline and noradrenaline biosynthesis1.21E−021.59
BioCycHistidine degradation III8.26E−097.34
BioCycPalmitate biosynthesis I (animals)1.27E−083.92
BioCycCatecholamine biosynthesis1.09E−086.49
BioCycAdenine and adenosine salvage I2.15E−0718.12
BioCycAspartate biosynthesis2.86E−078.98
BioCycMethylglyoxal degradation VI4.68E−073.41
BioCycOxidized GTP and dGTP detoxification6.76E−0718.79
BioCycHypusine biosynthesis8.58E−0724.23
BioCycSerine and glycine biosynthesis2.47E−063.16
BioCycDolichyl-diphosphooligosaccharide biosynthesis2.45E−063.50
MSigDBElongation arrest and recovery1.03E−436.52
MSigDBFormation of tubulin folding intermediates by CCT/TriC6.80E−305.44
MSigDBNotch signaling pathway5.63E−282.85
MSigDBPrefoldin mediated transfer of substrate to CCT/TriC2.68E−264.43
MSigDBThrombin signaling through proteinase activated receptors (PARs)3.13E−253.45
MSigDBFormation of RNA Pol II elongation complex3.55E−253.64
MSigDBGlucagon signaling in metabolic regulation1.69E−242.84
MSigDBPKC-catalyzed phosphorylation of inhibitory phosphoprotein of myosin phosphatase2.20E−232.67
MSigDBProstacyclin signaling through prostacyclin receptor4.70E−224.40
MSigDBG alpha (12/13) signaling events3.12E−201.98
GO function
GO-MFDNA binding1.26E−1141.36
GO-MFNucleic acid binding1.06E−971.27
GO-MFSequence-specific DNA binding transcription factor activity7.48E−981.46
GO-MFNucleic acid binding transcription factor activity9.13E−961.46
GO-MFSequence-specific DNA binding1.57E−901.54
GO-MFOrganic cyclic compound binding1.75E−871.18
GO-MFHeterocyclic compound binding5.68E−871.18
GO-MFTranscription regulatory region DNA binding4.84E−581.61
GO-MFRegulatory region DNA binding1.72E−571.60
GO-MFTranscription regulatory region sequence-specific DNA binding1.64E−531.85
GO-BPRNA metabolic process1.71E−891.27
GO-BPGene expression8.91E−881.26
GO-BPRNA biosynthetic process7.09E−871.30
GO-BPTranscription, DNA-dependent7.90E−861.30
GO-BPRegulation of macromolecule biosynthetic process2.51E−821.23
GO-BPNucleobase-containing compound biosynthetic process5.60E−821.28
GO-BPRegulation of RNA biosynthetic process1.08E−811.24
GO-BPNucleic acid metabolic process1.56E−801.23
GO-BPOrganic cyclic compound biosynthetic process2.66E−791.26
GO-BPRegulation of RNA metabolic process7.78E−791.23

MF molecular function, BP biological process.

The top GREAT ontology enrichments for regions potentially related to depression by using binomial test. MF molecular function, BP biological process.

Differentially methylated region (DMR) analysis

Among the 19 DMRs identified (Fig. 2 and Table 4), the methylation levels of 14 DMRs (1, 2, 3, 5–12, 15, 17, 19) were positively and two DMRs (13, 14) negatively correlated with depression score. But it was difficult to determine the trend of associations between three DMRs (4, 16, 18) and depression score.
Fig. 2

Differential methylation patterns for the identified differentially methylated regions (DMRs).

The horizontal axis shows the chromosome positions with the black point indicating each CpG, and the vertical axis shows the coefficient for the association of each CpG sites with depression score. The blue line indicates the methylation pattern for each DMR. BP, base pair; chr, chromosome.

Table 4

The results of annotation to significant differentially methylated regions (DMRs) (slk corrected P-value < 0.05).

DMR IDChromosomeStartEndLengthStouffer-Liptak-Kechris (slk) corrected P-valueGene symbolLocation
1chr67,441,7727,441,894110.001CAGE1Near
2chr2110,989,96210,991,112590.002TPTEAt
3chr7157,378,078157,378,360130.002PTPRN2At
4chr12132,329,893132,330,413180.003MMP17At
5chr5140,045,103140,045,416130.004WDR55At
6chr1256,329,61856,329,905170.004DGKAAt
7chr1940,366,27640,366,427140.008FCGBPAt
8chr2250,985,40850,985,540150.009KLHDC7BAt
9chr1690,143,72090,143,886110.020PRDM7At
10chr1523,034,97123,035,42360.021NIPA2At
11chr5138,729,711138,730,335360.023PROB1At
12chr12,475,0752,475,337140.024TNFRSF14-AS1At
13chr295,539,95995,540,22390.032TEKT4At
14chr937,002,61937,002,829240.036PAX5At
15chr1687,901,44387,901,581110.040SLC7A5At
16chr6101,850,301101,850,869220.043GRIK2At
17chr1454,419,54554,419,845160.044BMP4At
18chr2223,163,019223,163,942290.045PAX3At
19chr1871,143871,655230.048SAMD11At

Differential methylation patterns for the identified differentially methylated regions (DMRs).

The horizontal axis shows the chromosome positions with the black point indicating each CpG, and the vertical axis shows the coefficient for the association of each CpG sites with depression score. The blue line indicates the methylation pattern for each DMR. BP, base pair; chr, chromosome. The results of annotation to significant differentially methylated regions (DMRs) (slk corrected P-value < 0.05). The DMRs were located at/near 19 genes, among which DGKA and NIPA2 might play an important roles in regulating depression. Interestingly, several DMRs covered the top CpG sites listed in Table 2. The DMR1 (located at CAGE1), DMR3 (PTPRN2), and DMR9 (PRDM7) covered 4 CpG sites, and the DMR7 (located at FCGBP) covered three CpG sites.

Gene expression analysis

There were 12 twin pairs (including seven male pairs) with a median age of 53 years (95% range: 43–65) and a median depression score of 7.5 (range: 1–27) included in the gene expression analysis.

Weighted gene co-expression network analysis (WGCNA)

As Fig. 3 illustrated, genes clustered in pink module (including 3629 genes) were positively correlated with both depression score (r = 0.62, P = 0.002) and disease status (r = 0.49, P = 0.02). For this module, neuroactive ligand–receptor interaction, nicotine addiction, calcium signaling pathway, glutam4atergic synapse, and nervous system development were significantly enriched. (Table 5) MM and depression score-based GS exhibited a very significant positive correlation in pink module (r = 0.67, P < 0.001) (Supplementary Fig. 2), and 27 hub genes were identified (Supplementary Table 1).
Fig. 3

Relationships of consensus module eigengenes and external trait.

Each row in the table corresponds to a consensus module, and each column to a trait. Numbers in the table report the correlations of the corresponding module eigengenes and trait with the P-values printed below the correlations in parentheses. The table is color coded by correlation according to the color legend.

Table 5

The results of GO function and KEGG pathway enrichment analysis for genes clustered in pink module by DAVID tool.

CategoryTermCountP-value
Pathways
KEGGNeuroactive ligand–receptor interaction933.03E−14
KEGGOlfactory transduction995.04E−07
KEGGNicotine addiction201.53E−06
KEGGCalcium signaling pathway494.49E−05
KEGGMorphine addiction305.49E−05
KEGGGlutamatergic synapse355.99E−05
KEGGECM-receptor interaction281.62E−04
KEGGSerotonergic synapse331.98E−04
KEGGMaturity onset diabetes of the young128.83E−04
KEGGRetrograde endocannabinoid signaling299.61E−04
BIOCARTAIntrinsic prothrombin activation pathway101.61E−03
KEGGBile secretion212.78E−03
KEGGGABAergic synapse237.88E−03
KEGGLinoleic acid metabolism118.60E−03
BIOCARTAPlatelet amyloid precursor protein pathway78.88E−03
KEGGSteroid hormone biosynthesis171.17E−02
KEGGComplement and coagulation cascades191.42E−02
GO function
GO-BPChemical synaptic transmission883.54E−13
GO-BPPotassium ion transmembrane transport452.07E−07
GO-BPRegulation of ion transmembrane transport419.55E−07
GO-MFG-protein coupled receptor activity1671.73E−06
GO-MFCalcium ion binding1657.84E−06
GO-MFVoltage-gated potassium channel activity242.58E−05
GO-BPG-protein coupled receptor signaling pathway1993.06E−05
GO-BPSodium ion import across plasma membrane109.99E−05
GO-BPNervous system development741.54E−04
GO-BPG-protein coupled receptor signaling pathway192.83E−04
GO-MFGlutamate receptor activity86.30E−04
GO-MFGABA-A receptor activity101.83E−03
GO-BPCentral nervous system development342.46E−03

MF molecular function, BP biological process.

Relationships of consensus module eigengenes and external trait.

Each row in the table corresponds to a consensus module, and each column to a trait. Numbers in the table report the correlations of the corresponding module eigengenes and trait with the P-values printed below the correlations in parentheses. The table is color coded by correlation according to the color legend. The results of GO function and KEGG pathway enrichment analysis for genes clustered in pink module by DAVID tool. MF molecular function, BP biological process.

The common genes and enrichment terms between methylation analysis and WGCNA

The CpG sites (P < 0.05) were annotated to 2808 genes, of which 404 genes were also clustered in the pink module in WGCNA. Among these common genes, DENND5B, KBTBD13, TENM3, and BMP2 were also identified as hub genes following our strict criteria. And genes including PRDM7, KCNIP1, PLEKHH3, GRIK2, PROB1, PAX3 were where the top CpG sites or DMRs were located. (Supplementary Table 2) Many common enrichment terms were also found, including extra cellular matrix (ECM)-receptor interaction pathway, maturity onset diabetes of the young pathway, complement and coagulation cascades pathway, DNA binding, neuron fate specification, glial cell differentiation, thyroid gland development, and cellular response to hormone stimulus.

Differentially expressed genes related to depression

Genes including HELZ2 (P = 0.013), PTPRN2 (P = 0.040), GATA2 (P = 0.013), ZNF624 (P = 0.019) were found differentially expressed between the two independent groups.

Discussion

In this study based on monozygotic twins, we detected some important epigenetic variants underlying depression by EWAS. A total of 66 interesting CpG sites (P < 1 × 10−4) and 19 DMRs were identified. Moreover, many crucial GREAT ontology enrichments were identified. Genes clustered in the pink module were positively correlated with depression score in WGCNA, and many genes and enrichment terms were overlapped between methylation analysis and WGCNA. Finally, four genes were found to be differentially expressed in depression cases and health controls. In EWAS, some genes where the top CpG sites and DMRs were located (Tables 2 and 4) may play essential roles in regulating depression status: (1) PTPRN2: DNA methylation variation of PTPRN2 was found to be associated with mood state disturbances across [33]; (2) HES5: HES5 could negatively regulate 5-HT1A receptor gene, which was related to MDD and suicide [34]; (3) GATA2: It was reported that overexpression of human GATA2 interfered with spine formation and produced depressive behavior in rats [35]; (4) DGKA: Blood transcript levels of DGKA differed significantly between participants with MDDs and nondepressed controls [36]; (5) NIPA2: It was suggested that rare copy number variants (CNVs) in NIPA2 could increase the risk of MDD by disrupting regulatory regions [37]; (6) PRDM7: The protein encoded by this gene was involved in lysine degradation pathway, and lysine methylation was a physiological post-translational modification of tau protein which played an important role in aging and Alzheimer’s disease [38]; (7) KCNIP1: The protein encoded by this gene was a member of the family of cytosolic voltage-gated potassium (Kv) channel-interacting proteins (KCNIPs), and could regulate rapidly inactivating (A-type) currents and hence neuronal membrane excitability; (8) GRIK2: GRIK2, as one glutamate-related gene, might be related to risk for mood disorders [39], and the gene polymorphism of GRIK2 was associated with depressive symptoms [40]. The other genes have an unknown function in terms of depression now, whereas they may also be interesting potential candidates to be future researched and validated. As additional validation, we integrated the methylation data with gene expression data. Genes clustered in the pink module were positively correlated with depression score in WGCNA. And some genes were in common with EWAS findings, like BMP2, PRDM7, KCNIP1, and GRIK2. It was indicated that histone deacetylases could control neurogenesis in embryonic brain by inhibiting BMP2/4 signaling [41]. The other three genes had been discussed above. Additionally, four genes HELZ2, PTPRN2, GATA2, and ZNF624 were differentially expressed between depression cases and health controls. PTPRN2 and GATA2 have been discussed above, whereas the biological of HELZ2 and ZNF624 involved in depression remained to be studied further. Two strengths can be noticed in our study. Since the case co-twin design using monozygotic twins discordant for a trait or disease was a powerful tool for EWAS [6, 7], our results based on the twin data would be credible. Meanwhile, considering the various genetic constitutions, environmental exposures, and a multitude of life styles in different ethnic populations worldwide, our findings will specifically help elucidate the underlying pathogenesis of depression in the Chinese population. Nevertheless, some limitations of our study should also be considered. First, the sample size of the present study was relatively limited due to the challenges of recruiting and identifying qualified twin pairs. We’ll further validate the top CpG sites, essential genes, and biological pathways in a community population. And we’ll also evaluate if the physical distribution of top CpG sites at different chromosomes is over-represented in the regulatory domain of one specific biological pathway. Additionally, we’ll conduct a causal effect analysis based on one specific biological pathway by integrating data of genetic variation, epigenetic variation, and environmental factors. Second, the Townsend deprivation index (TDI) was indicated to be associated with depression [42, 43]. However, we couldn’t add this factor as a covariate in the linear mixed effects model, because we didn’t investigate the corresponding information of TDI during the epidemiological survey. We’ll consider the TDI factor in the validation analysis in the future. In summary, we confirm that epigenetic factors are significant in explaining depression through twin-based analysis. We detected multiple CpG sites, genes, DMRs, and pathways that were potentially associated with depression. The findings provided an important clues to further elucidate the pathogenesis of depression and helped to identify new diagnostic biomarkers and therapeutic targets for this disease. Supplementary legends Supplementary table 1 Supplementary table 2 Supplementary figure 1 Supplementary figure 2
  43 in total

1.  Comb-p: software for combining, analyzing, grouping and correcting spatially correlated P-values.

Authors:  Brent S Pedersen; David A Schwartz; Ivana V Yang; Katerina J Kechris
Journal:  Bioinformatics       Date:  2012-09-05       Impact factor: 6.937

2.  Quantification of Tau Protein Lysine Methylation in Aging and Alzheimer's Disease.

Authors:  Carol J Huseby; Claire N Hoffman; Grace L Cooper; Jean-Christophe Cocuron; Ana P Alonso; Stefani N Thomas; Austin J Yang; Jeff Kuret
Journal:  J Alzheimers Dis       Date:  2019       Impact factor: 4.472

Review 3.  Progress in Epigenetics of Depression.

Authors:  Catherine J Peña; Eric J Nestler
Journal:  Prog Mol Biol Transl Sci       Date:  2018-04-10       Impact factor: 3.622

Review 4.  Genetic Studies on the Tripartite Glutamate Synapse in the Pathophysiology and Therapeutics of Mood Disorders.

Authors:  Rafael T de Sousa; Alexandre A Loch; André F Carvalho; André R Brunoni; Marie Reine Haddad; Ioline D Henter; Carlos A Zarate; Rodrigo Machado-Vieira
Journal:  Neuropsychopharmacology       Date:  2016-08-11       Impact factor: 7.853

5.  Weighted gene co-expression network analysis of expression data of monozygotic twins identifies specific modules and hub genes related to BMI.

Authors:  Weijing Wang; Wenjie Jiang; Lin Hou; Haiping Duan; Yili Wu; Chunsheng Xu; Qihua Tan; Shuxia Li; Dongfeng Zhang
Journal:  BMC Genomics       Date:  2017-11-13       Impact factor: 3.969

6.  Increased DNA methylation variability in rheumatoid arthritis-discordant monozygotic twins.

Authors:  Amy P Webster; Darren Plant; Simone Ecker; Flore Zufferey; Jordana T Bell; Andrew Feber; Dirk S Paul; Stephan Beck; Anne Barton; Frances M K Williams; Jane Worthington
Journal:  Genome Med       Date:  2018-09-04       Impact factor: 11.117

7.  Personalized Epigenome Remodeling Under Biochemical and Psychological Changes During Long-Term Isolation Environment.

Authors:  Fengji Liang; Ke Lv; Yue Wang; Yanhong Yuan; Liang Lu; Qiang Feng; Xiaolu Jing; Honghui Wang; Changning Liu; Simon Rayner; Shukuan Ling; Hailong Chen; Yumin Wan; Wanlong Zhou; Li He; Bin Wu; Lina Qu; Shanguang Chen; Jianghui Xiong; Yinghui Li
Journal:  Front Physiol       Date:  2019-07-31       Impact factor: 4.566

8.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

9.  Histone deacetylases control neurogenesis in embryonic brain by inhibition of BMP2/4 signaling.

Authors:  Maya Shakèd; Kathrin Weissmüller; Hanno Svoboda; Peter Hortschansky; Norikazu Nishino; Stefan Wölfl; Kerry L Tucker
Journal:  PLoS One       Date:  2008-07-16       Impact factor: 3.240

10.  Accounting for cellular heterogeneity is critical in epigenome-wide association studies.

Authors:  Andrew E Jaffe; Rafael A Irizarry
Journal:  Genome Biol       Date:  2014-02-04       Impact factor: 13.583

View more
  7 in total

1.  DNA methylation and waist-to-hip ratio: an epigenome-wide association study in Chinese monozygotic twins.

Authors:  Y Wu; H Tian; W Wang; W Li; H Duan; D Zhang
Journal:  J Endocrinol Invest       Date:  2022-07-26       Impact factor: 5.467

2.  Epigenome-wide DNA methylation in obsessive-compulsive disorder.

Authors:  Miriam A Schiele; Jan Lipovsek; Pascal Schlosser; Michael Soutschek; Gerhard Schratt; Michael Zaudig; Götz Berberich; Anna Köttgen; Katharina Domschke
Journal:  Transl Psychiatry       Date:  2022-06-01       Impact factor: 7.989

3.  Protocol for a pilot randomized controlled trial of a mobile health exercise intervention for older patients with myeloid neoplasms (GO-EXCAP 2).

Authors:  Kah Poh Loh; Chandrika Sanapala; Michelle Janelsins; Heidi D Klepin; Rebecca Schnall; Eva Culakova; Michael B Sohn; Paula Vertino; Martha Susiarjo; Marielle Jensen-Battaglia; Michael W Becker; Jane Liesveld; Jason H Mendler; Eric Huselton; Po-Ju Lin; Karen Mustian
Journal:  J Geriatr Oncol       Date:  2021-12-21       Impact factor: 3.929

4.  Elevated BICD2 DNA methylation in blood of major depressive disorder patients and reduction of depressive-like behaviors in hippocampal Bicd2-knockdown mice.

Authors:  Jianbo Xiu; Jiayu Li; Zeyue Liu; Hui Wei; Caiyun Zhu; Rongrong Han; Zijing Liu; Wanwan Zhu; Yan Shen; Qi Xu
Journal:  Proc Natl Acad Sci U S A       Date:  2022-07-18       Impact factor: 12.779

5.  Bioinformatic analysis and experimental validation identified DNA methylation-Related biomarkers and immune-cell infiltration of atherosclerosis.

Authors:  Congjian Xu; Di Sun; Changmin Wei; Hao Chang
Journal:  Front Genet       Date:  2022-09-08       Impact factor: 4.772

6.  Association of Urinary Iodine Concentration with Depressive Symptoms among Adults: NHANES 2007-2018.

Authors:  Shumin Chen; Kaiwen Cui; Jia Luo; Dongfeng Zhang
Journal:  Nutrients       Date:  2022-10-07       Impact factor: 6.706

7.  Dietary Vitamin C and Vitamin C Derived from Vegetables Are Inversely Associated with the Risk of Depressive Symptoms among the General Population.

Authors:  Anni Wang; Jia Luo; Tianhao Zhang; Dongfeng Zhang
Journal:  Antioxidants (Basel)       Date:  2021-12-13
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.