| Literature DB >> 27777418 |
M-L Wong1,2, M Arcos-Burgos3,4, S Liu1,2, J I Vélez3,5, C Yu1,2, B T Baune6, M C Jawahar6, V Arolt7, U Dannlowski7,8, A Chuah3, G A Huttley3, R Fogarty1, M D Lewis1,2, S R Bornstein7,9, J Licinio1,2.
Abstract
Major depressive disorder (MDD) affects around 350 million people worldwide; however, the underlying genetic basis remains largely unknown. In this study, we took into account that MDD is a gene-environment disorder, in which stress is a critical component, and used whole-genome screening of functional variants to investigate the 'missing heritability' in MDD. Genome-wide association studies (GWAS) using single- and multi-locus linear mixed-effect models were performed in a Los Angeles Mexican-American cohort (196 controls, 203 MDD) and in a replication European-ancestry cohort (499 controls, 473 MDD). Our analyses took into consideration the stress levels in the control populations. The Mexican-American controls, comprised primarily of recent immigrants, had high levels of stress due to acculturation issues and the European-ancestry controls with high stress levels were given higher weights in our analysis. We identified 44 common and rare functional variants associated with mild to moderate MDD in the Mexican-American cohort (genome-wide false discovery rate, FDR, <0.05), and their pathway analysis revealed that the three top overrepresented Gene Ontology (GO) processes were innate immune response, glutamate receptor signaling and detection of chemical stimulus in smell sensory perception. Rare variant analysis replicated the association of the PHF21B gene in the ethnically unrelated European-ancestry cohort. The TRPM2 gene, previously implicated in mood disorders, may also be considered replicated by our analyses. Whole-genome sequencing analyses of a subset of the cohorts revealed that European-ancestry individuals have a significantly reduced (50%) number of single nucleotide variants compared with Mexican-American individuals, and for this reason the role of rare variants may vary across populations. PHF21b variants contribute significantly to differences in the levels of expression of this gene in several brain areas, including the hippocampus. Furthermore, using an animal model of stress, we found that Phf21b hippocampal gene expression is significantly decreased in animals resilient to chronic restraint stress when compared with non-chronically stressed animals. Together, our results reveal that including stress level data enables the identification of novel rare functional variants associated with MDD.Entities:
Mesh:
Year: 2016 PMID: 27777418 PMCID: PMC5461220 DOI: 10.1038/mp.2016.174
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Genome-wide variants significantly associated with major depressive disorder
|
|
|
|
|
| P | P |
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| exm167893 | 1 | rs41310573 |
| T | 4.39E−20 | 3.59E−15 | 0.30 | 0.07 | c.115A>G | p.Ser39Gly |
| exm1508600 | 19 | rs201935337 |
| C | 4.77E−11 | 1.95E−6 | 0.10 | 0.00 | c.136G>A | p.Gly46Arg |
| exm1616604 | 22 | rs140395831 |
| A | 3.01E−10 | 6.16E−6 | 0.09 | 0.00 | c.1252A>C | p.Ser418Arg |
| exm875366 | 11 | rs56293203 |
| G | 1.57E−9 | 2.57E−5 | 0.13 | 0.02 | c.442G>A | p.Val148Ile |
| exm283068 | 2 | rs78562453 |
| C | 6.23E−8 | 7.28E−4 | 0.06 | 0.00 | c.36G>C | p.Arg12Ser |
| exm445797 | 5 | rs115054458 |
| C | 3.43E−7 | 3.51E−3 | 0.08 | 0.01 | c.7391C>T | p.Ala2464Val |
| exm1441979 | 19 | rs143696449 |
| G | 4.55E−7 | 4.13E−3 | 0.06 | 0.00 | c.3206C>T | p.Ala1069Val |
| exm669085 | 7 | rs748441912 |
| A | 4.94E−7 | 4.04E−3 | 0.11 | 0.02 | c.2535G>A | Synonymous |
| exm75804 | 1 | rs62001028 |
| G | 5.32E−7 | 3.96E−3 | 0.07 | 0.00 | c.820C>T | p.Arg274Trp |
| exm1355772 | 17 | rs150952348 |
| T | 6.71E−7 | 4.57E−3 | 0.06 | 0.00 | c.3160A>G | p.Ile1054Val |
| exm1044842 | 12 | Rs782472239 |
| C | 9.12E−7 | 5.74E−3 | 0.08 | 0.00 | c.638C>A | p.Pro213His |
| exm1435859 | 19 | rs112610420 |
| T | 1.22E−6 | 7.15E−3 | 0.07 | 0.00 | c.875A>C | p.Tyr292Ser |
| exm1325307 | 17 | rs142029931 |
| G | 1.31E−6 | 7.16E−3 | 0.06 | 0.00 | c.2501G>A | p.Arg834His |
| exm1505393 | 19 | rs201483250 |
| C | 1.48E−6 | 7.55E−3 | 0.17 | 0.06 | c.535C>T | p.Arg179Trp |
| exm1369092 | 17 | rs200897153 |
| C | 2.49E−6 | 0.01 | 0.10 | 0.01 | c.5801G>A | p.Arg1934His |
| exm1293569 | 17 | rs3744550 |
| T | 3.57E−6 | 0.02 | 0.30 | 0.15 | c.5585A>G | p.His1862Arg |
| exm782507 | 9 | rs115668237 |
| C | 5.87E−6 | 0.03 | 0.04 | 0.00 | c.1239C>G | p.Cys413Trp |
| exm2249659 | 11 | rs56344012 |
| A | 1.04E−5 | 0.04 | 0.07 | 0.01 | c.15861A>G | Synonymous |
| exm2275308 | 19 | rs200520741 |
| C | 1.05E−5 | 0.04 | 0.05 | 0.00 | Splicing | Splicing |
Abbreviations: Chr, chromosome; FDR, false discovery rate; Freq., frequency; HGVS, Human Genome Variation Society; rs ID, reference SNP identification number; SNP, single nucleotide polymorphism.
Figure 1Flow diagram describing the process of filtering out low quality and functionally irrelevant variants. From 247 909 variants, 164 011 variants were discarded because they were monoallelic, had more than two alleles, had a call rate of <90%, or because their genotype proportions deviated from the expected ones as defined by the Hardy-–Weinberg (H–W) equilibrium law in both cases (patients with major depression) and controls at a P-value of 2 × 10−7. The remaining 83 898 variants were used for exome-wide association analysis. After additional functional filtering using several tools (SIFT, PolyPhen-2, MutationTaster, Gerp++ and phyloP) to exclude tolerated and conserved variants, and after the exclusion of non-exonic variants and variants in the control subjects with allelic frequency >0.01, the remaining 47 296 variants were used in rare variant analysis. MDD, major depressive disorder.
Figure 2Manhattan plots for a GWAS of major depressive disorder (MDD) in a cohort of Mexican-American patients (n=203) and matched controls (n=196). Results of the GWAS using an additive model after correcting stratification by principal component analysis. In the inset, the first and second principal components that shows absence of genetic stratification (left box), and the Q–Q plot of observed vs expected of the −log10 (P-values, right box). The estimated Genomic Control ‘inflation factor’ λ was 1. GWAS, genome-wide association study.
Rare variants analysis using the KBAC method after filtering out: tolerated and non-conserved, non-exonic and variants that in controls had a MAF >0.01
|
|
|
|
| P |
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| 19 | 17 434 032 | 17 445 638 |
| 9.99E−04 | 0.08 | 0.03 | 399 | 2 | 4 |
| 19 | 11 485 383 | 11 487 627 |
| 9.99E−04 | 0.06 | 0.03 | 399 | 2 | 4 |
| 19 | 3 136 191 | 3 163 766 |
| 9.99E−04 | 0.07 | 0.04 | 399 | 3 | 4 |
| 15 | 78 287 327 | 78 369 994 |
| 9.99E−04 | 0.07 | 0.04 | 399 | 3 | 5 |
| 12 | 6 419 602 | 6 437 672 |
| 9.99E−04 | 0.07 | 0.05 | 399 | 5 | 6 |
| 19 | 19 040 010 | 19 052 041 |
| 2.00E−03 | 0.06 | 0.04 | 399 | 3 | 4 |
| 19 | 14 843 205 | 14 889 353 |
| 2.00E−03 | 0.10 | 0.04 | 399 | 10 | 13 |
| 17 | 73 823 308 | 73 840 798 |
| 2.00E−03 | 0.09 | 0.04 | 399 | 8 | 14 |
| 12 | 55 945 011 | 55 945 940 |
| 2.00E−03 | 0.07 | 0.04 | 399 | 6 | 7 |
| 11 | 67 776 048 | 67 796 743 |
| 2.00E−03 | 0.07 | 0.05 | 399 | 4 | 5 |
| 10 | 135 051 408 | 135 055 433 |
| 3.00E−03 | 0.06 | 0.04 | 399 | 2 | 4 |
| 9 | 139 607 024 | 139 619 170 |
| 3.00E−03 | 0.08 | 0.04 | 399 | 1 | 3 |
| 8 | 145 747 761 | 145 752 416 |
| 3.00E−03 | 0.07 | 0.05 | 399 | 3 | 5 |
| 7 | 16 639 401 | 16 685 442 |
| 3.00E−03 | 0.09 | 0.05 | 399 | 3 | 4 |
| 4 | 2 965 343 | 3 042 474 |
| 3.00E−03 | 0.14 | 0.05 | 399 | 7 | 10 |
| 17 | 40 950 854 | 40 963 605 |
| 4.00E−03 | 0.07 | 0.04 | 399 | 4 | 5 |
| 17 | 40 834 632 | 40 852 011 |
| 4.00E−03 | 0.07 | 0.04 | 399 | 4 | 5 |
| 16 | 1 664 641 | 1 727 909 |
| 4.00E−03 | 0.07 | 0.04 | 399 | 6 | 7 |
| 12 | 110 220 892 | 110 271 212 |
| 4.00E−03 | 0.05 | 0.04 | 399 | 4 | 6 |
| 11 | 4 615 269 | 4 616 243 |
| 4.00E−03 | 0.06 | 0.05 | 399 | 1 | 2 |
| 11 | 1 012 824 | 1 036 706 |
| 4.00E−03 | 0.11 | 0.05 | 399 | 14 | 18 |
| 9 | 125 486 269 | 125 487 204 |
| 4.00E−03 | 0.08 | 0.05 | 399 | 3 | 5 |
| 17 | 48 638 449 | 48 704 542 |
| 5.00E−03 | 0.07 | 0.04 | 399 | 5 | 7 |
| 14 | 44 973 354 | 44 976 499 |
| 5.00E−03 | 0.05 | 0.05 | 399 | 5 | 6 |
| 11 | 66 059 373 | 66 064 135 |
| 5.00E−03 | 0.05 | 0.05 | 399 | 1 | 2 |
| 10 | 72 530 995 | 72 545 157 |
| 5.00E−03 | 0.07 | 0.05 | 399 | 2 | 4 |
| 9 | 130 928 344 | 130 966 662 |
| 5.00E−03 | 0.07 | 0.05 | 399 | 6 | 8 |
Abbreviations: Chr, chromosome; FDR, false discovery rate; KBAC, kernel-based adaptive cluster; MAF, minor allele frequency.
Genome-wide variants significantly associated with major depressive disorder after using a sequential strategy
|
|
|
|
|
| P | P |
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| exm167893 | 1 | rs41310573 |
| T | 8.06E−10 | 1.74E−06 | 0.30 | 0.06 | c.115A>G | p.Ser39Gly |
| exm1472172 | 19 | rs201138159 |
| 9.87E−08 | 1.07E−04 | 0.17 | 0.00 | c.107A>G | p.Glu36Gly | |
| exm875366 | 11 | rs56293203 |
| G | 7.48E−07 | 5.38E−04 | 0.15 | 0.00 | c.442G>A | p.Val148Ile |
| exm1508600 | 19 | rs201935337 |
| C | 2.39E−06 | 1.29E−03 | 0.13 | 0.00 | c.136G>A | p.Gly46Arg |
| exm1616604 | 22 | rs140395831 |
| A | 1.15E−05 | 4.97E−03 | 0.10 | 0.00 | c.1252A>C | p.Ser418Arg |
| exm445797 | 5 | rs115054458 |
| C | 4.73E−05 | 1.70E−02 | 0.11 | 0.01 | c.7391C>T | p.Ala2464Val |
| exm75804 | 1 | rs62001028 |
| G | 5.70E−05 | 1.76E−02 | 0.11 | 0.00 | c.820C>T | p.Arg274Trp |
| exm1577187 | 21 | rs142151549 |
| T | 8.95E−05 | 2.42E−02 | 0.12 | 0.00 | c.4364T>G | p.Val1455Gly |
| exm646600 | 7 | rs112002983 |
| A | 9.60E−05 | 2.30E−02 | 0.11 | 0.00 | c.794A>C | p.Asp265Ala |
| exm1435859 | 19 | rs112610420 |
| T | 1.08E−04 | 2.33E−02 | 0.10 | 0.00 | c.875A>C | p.Tyr292Ser |
| exm1355772 | 17 | rs150952348 |
| T | 1.15E−04 | 2.25E−02 | 0.08 | 0.00 | c.3160A>G | p.Ile1054Val |
| exm1505393 | 19 | rs201483250 |
| C | 2.19E−04 | 3.93E−02 | 0.20 | 0.05 | c.535C>T | p.Arg179Trp |
| exm669085 | 7 | rs748441912 |
| A | 2.53E−04 | 4.19E−02 | 0.13 | 0.01 | c.2535G>A | Synonymous |
Abbreviations: Chr, chromosome; FDR, false discovery rate; Freq., frequency; HGVS, Human Genome Variation Society; rs ID, reference SNP identification number; SNP, single nucleotide polymorphism.
Fifty percent of the sample was randomly selected for discovery and the remaining 50% for join analyses.
Rare variants analysis using the KBAC method in the European-ancestry cohort: tolerated and non-conserved, non-exonic and variants that in controls had a MAF >0.01
|
|
|
|
|
| P |
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| 22 | 45248966 | 45435430 |
| CCDS46727.1 | 0.001 | 0.027 | 0.068 | 972 | 6 | 6 |
| 22 | 45152408 | 45288475 |
| CCDS33664.1 | 0.003 | 0.027 | 0.102 | 972 | 16 | 8 |
| 22 | 45248966 | 45434692 |
| CCDS56234.1 | 0.001 | 0.027 | 0.102 | 972 | 6 | 6 |
| 22 | 45248966 | 45435430 |
| CCDS14061.1 | 0.002 | 0.027 | 0.102 | 972 | 6 | 6 |
| 22 | 45152408 | 45288475 |
| CCDS14060.2 | 0.003 | 0.027 | 0.123 | 972 | 16 | 8 |
Abbreviations: Chr, chromosome; FDR, false discovery rate; KBAC, kernel-based adaptive cluster; MAF, minor allele frequency.
Whole-genome sequencing analysis of 25 human samples
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| AU mean ( | 390 1078 | 546 756.6 | 4 447 834.6 | 3 900 468.9 | 87.7 | 23 942.4 | 12 613.4 | 11 329 | 1 318 630 |
| AU SD ( | 37083.5 | 35237.0 | 67382.1 | 51576.2 | 0.4 | 291.1 | 142.3 | 177.9 | 8851.6 |
| MA mean ( | 772 9021.3 | 517 554.1 | 8 246 575.4 | 3 995 923.3 | 48.6 | 114 430.9 | 31 641.7 | 82 789.1 | 2 927 921 |
| MA SD ( | 413 339 | 5041.5 | 412 190.9 | 43 721.7 | 2.7 | 7998.5 | 1800.7 | 6263 | 179 407.6 |
| 2.09E−15 | 0.028 | 7.05E−16 | 0.00016 | 7.22E−19 | 1.99E−16 | 3.64E−16 | 1.83E−16 | 4.70E−15 | |
| Mann–Whitney U, | 3.59E−05 | 9.65E−04 | 3.59E−05 | 5.27E−04 | 3.59E−05 | 3.59E−05 | 3.59E−05 | 3.59E−05 | 3.59E−05 |
Abbreviations: AU, Australian; dbSNP, the single nucleotide polymorphism database; INDELS, small insertions and deletions; MA, Mexican-American; ncRNA, non-coding RNAs; SD, standard deviation; SNVs, single nucleotide variants; 3′ UTR, three prime untranslated region; 5′ UTR, five prime untranslated region.
Hippocampal quantitative gene expression analyses for PHF21B gene variants extracted from the Braineac web server (UK Brain Expression Consortium, UKBEC)
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| rs1003851 | chr22:44782265 | 3963529 | 3.90E−05 | HIPP | CC=5; CT=46; TT=76 | C=20.9% T=79.1% |
| rs117160508 | chr22:45442461 | 3963529 | 1.60E−04 | HIPP | CC=1; CG=12; GG=119 | C=5.2% G=94.8% |
| rs118154683 | chr22:45442465 | 3963529 | 1.60E−04 | HIPP | TT=1; TA=12; AA=119 | T=5.2% A=94.8% |
| rs75806917 | chr22:45442547 | 3963529 | 1.60E−04 | HIPP | GG=1; GT=11; TT=119 | G=4.9% T=95.1% |
| rs8138025 | chr22:45974757 | 3963548 | 1.90E−04 | HIPP | AA=19; AG=63; GG=52 | A=37.7% G=62.3% |
| rs5765490 | chr22:45974913 | 3963548 | 2.10E−04 | HIPP | TT=19; TC=63; CC=52 | T=37.7% C=62.3% |
| rs62228464 | chr22:45113754 | 3963550 | 6.60E−05 | HIPP | CC=18; CA=51; AA=61 | C=32.5% A=67.5% |
Abbreviations: chr, chromosome; Expr ID, expression identification; HIPP, hippocampus; rs ID, reference single nucleotide polymorphism identification number.
Figure 3Animal studies. (a) Paired t-test showed significantly increased floating time between the first (T1, baseline) and the second (T2, post-CRS) forced swim tests (FST) in the chronic restraint stress (CRS) non-resilient (nonresil, n=8) group but not in the CRS resilient (resil, n=11) and the non-CRS (n=6) groups. (b) One-way ANOVA showed that hippocampal Phf21b gene expression was significantly decreased in the CRS resil (resilient) group (n=11) but not in the CRS nonresil (non-resilient) group (n=8) comparing with the non-CRS group (n=6); columns=mean, bars=standard deviation of the mean. (s), seconds; AU, arbitrary units; **P<0.01; ***P<0.001.