| Literature DB >> 19065144 |
P F Sullivan1, E J C de Geus, G Willemsen, M R James, J H Smit, T Zandbelt, V Arolt, B T Baune, D Blackwood, S Cichon, W L Coventry, K Domschke, A Farmer, M Fava, S D Gordon, Q He, A C Heath, P Heutink, F Holsboer, W J Hoogendijk, J J Hottenga, Y Hu, M Kohli, D Lin, S Lucae, D J Macintyre, W Maier, K A McGhee, P McGuffin, G W Montgomery, W J Muir, W A Nolen, M M Nöthen, R H Perlis, K Pirlo, D Posthuma, M Rietschel, P Rizzu, A Schosser, A B Smit, J W Smoller, J-Y Tzeng, R van Dyck, M Verhage, F G Zitman, N G Martin, N R Wray, D I Boomsma, B W J H Penninx.
Abstract
Major depressive disorder (MDD) is a common complex trait with enormous public health significance. As part of the Genetic Association Information Network initiative of the US Foundation for the National Institutes of Health, we conducted a genome-wide association study of 435 291 single nucleotide polymorphisms (SNPs) genotyped in 1738 MDD cases and 1802 controls selected to be at low liability for MDD. Of the top 200, 11 signals localized to a 167 kb region overlapping the gene piccolo (PCLO, whose protein product localizes to the cytomatrix of the presynaptic active zone and is important in monoaminergic neurotransmission in the brain) with P-values of 7.7 x 10(-7) for rs2715148 and 1.2 x 10(-6) for rs2522833. We undertook replication of SNPs in this region in five independent samples (6079 MDD independent cases and 5893 controls) but no SNP exceeded the replication significance threshold when all replication samples were analyzed together. However, there was heterogeneity in the replication samples, and secondary analysis of the original sample with the sample of greatest similarity yielded P=6.4 x 10(-8) for the nonsynonymous SNP rs2522833 that gives rise to a serine to alanine substitution near a C2 calcium-binding domain of the PCLO protein. With the integrated replication effort, we present a specific hypothesis for further studies.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19065144 PMCID: PMC2717726 DOI: 10.1038/mp.2008.125
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Descriptive data for cases with MDD and controls at low liability for MDD included in the GWAS.
| Descriptor | Cases | Controls | Test |
|---|---|---|---|
| Number of subjects genotyped | 1,738 | 1,802 | - |
| Mean age in years (SD) | 42.6 (12.6) | 45.1 (14.1) | |
| Female, % | 69.6 | 62.0 | |
| Educational level (% low/middle/high) | 7.8 / 62.0 / 32.2 | 5.7 / 56.3 / 38.1 | |
| Partner status, % with partner | 68.9 | 87.0 | |
| Smoking (current), % | 42.0 | 20.2 | |
| Mean neuroticism (NEO, SD) | 39.3 (8.0) | 28.2 (5.5) | |
| MDD, age of onset in years (SD) | 27.7 (12.4) | - | |
| Family history of depression, % | 85.5 | - | |
| Recurrent MDD | 50.9 | - | |
| Family history, recurrent MDD, or early | 94.8 | - |
Figure 1GWAS results for MDD in cases versus controls. Panel a shows the quantile-quantile plots and lambda estimates for the primary analysis using the Cochran-Armitage trend test and confirmatory analyses using logistic regressions and Cochran-Mantel-Haenszel stratified tests. The dashed lines show the expected 95% probability interval for ordered p-values, and the circles show the observed versus expected values for all SNPs. The λ values are the median Χ2 from all association tests divided by the expected value under the null hypothesis of no association. If λ is large (e.g., >1.2), there is evidence that the observed test statistics deviate from the expected. This could be due to true associations but is more likely due to a systematic bias (e.g., population stratification effects). The λ values in Panel a are not consistent with the presence of stematic biases in the results. Panel b depicts −log10(p) by genomic location for chr1-chr22 plus chrX. Odd-numbered chromosomes are in light blue and even-numbered chromosomes in orange. The 25 smallest p-values are shown with green crosses.
Information on the SNPs with the smallest association p-values in the GWAS.
| Basic SNP data | Bioinformatics | Results | MAF | Quality control - | Quality control - | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP | Chr | Position | Alleles | Strand | Gene | TAMAL | SLEP | Rank | OR | P- | P- | q- | P-gwemp | All | Cases | Controls | HapMap_EUR | All | Cases | Controls | p- | additional |
| rs12471796 | 2 | 20,177,820 | A/G | + | 10 | 1.26 | 0.000014 | 0.000014 | 0.58 | 0.99 | 0.298 | 0.322 | 0.275 | 0.271 | 0.012 | 0.010 | 0.014 | 0.36 | ||||
| rs7565124 | 2 | 20,183,313 | A/G | + | Reg pot | 7 | 1.26 | 0.000012 | 0.000011 | 0.58 | 0.98 | 0.296 | 0.321 | 0.272 | 0.271 | 0.030 | 0.034 | 0.026 | 0.20 | |||
| rs3923028 | 2 | 29,597,247 | T/C | − | ALK | CNV | CNV, mutated in colon CA | 12 | 1.34 | 0.000024 | 0.000020 | 0.66 | 1.00 | 0.135 | 0.153 | 0.119 | 0.175 | 0.001 | 0.002 | 0.000 | 0.06 | |
| rs12621441 | 2 | 201,794,446 | A/G | + | Near CNV | 13 | 1.31 | 0.000024 | 0.000027 | 0.66 | 1.00 | 0.166 | 0.185 | 0.147 | 0.150 | 0.008 | 0.008 | 0.009 | 0.86 | |||
| rs11132168 | 4 | 184,428,336 | T/C | + | MDD linkage peak | 16 | 0.75 | 0.000029 | 0.000035 | 0.66 | 1.00 | 0.133 | 0.116 | 0.150 | 0.117 | 0.001 | 0.001 | 0.002 | 0.63 | |||
| rs17074631 | 4 | 184,652,456 | G/A | + | MDD linkage peak | 23 | 0.75 | 0.000043 | 0.000040 | 0.66 | 1.00 | 0.137 | 0.120 | 0.154 | 0.076 | 0.003 | 0.005 | 0.002 | 0.26 | |||
| rs2094923 | 6 | 14,397,061 | T/G | − | SCZ linkage meta-analsyis | 20 | 0.82 | 0.000042 | 0.000046 | 0.66 | 1.00 | 0.417 | 0.393 | 0.441 | 0.475 | 0.001 | 0.001 | 0.002 | 1.00 | |||
| rs2274822 | 6 | 14,399,068 | C/T | − | SCZ linkage meta-analsyis | 6 | 0.79 | 0.000009 | 0.000007 | 0.58 | 0.96 | 0.268 | 0.245 | 0.291 | 0.283 | 0.003 | 0.002 | 0.003 | 1.00 | |||
| rs1558477 | 7 | 30,928,587 | C/T | + | MDD linkage peak | 1 | 1.27 | 0.000001 | 0.000002 | 0.28 | 0.37 | 0.430 | 0.460 | 0.401 | 0.442 | 0.003 | 0.003 | 0.004 | 0.77 | HWD cases | ||
| rs7791986 | 7 | 30,930,719 | G/C | + | MDD linkage peak | 14 | 1.22 | 0.000026 | 0.000038 | 0.66 | 1.00 | 0.451 | 0.477 | 0.427 | 0.425 | 0.001 | 0.001 | 0.002 | 0.38 | |||
| rs2715148 | 7 | 82,094,686 | A/C | + | PCLO | Cons, reg pot | BIP GWAS rs2715148 | 2 | 0.79 | 0.000001 | 0.000003 | 0.28 | 0.42 | 0.482 | 0.452 | 0.510 | 0.525 | 0.002 | 0.002 | 0.002 | 0.72 | |
| rs2522833 | 7 | 82,098,359 | C/A | + | PCLO | Cons, reg pot, cSNP | BIP GWAS rs7781142 | 3 | 1.26 | 0.000002 | 0.000002 | 0.28 | 0.52 | 0.455 | 0.485 | 0.427 | 0.425 | 0.002 | 0.000 | 0.003 | 0.03 | |
| rs2522840 | 7 | 82,123,066 | G/T | + | PCLO | Cons, reg pot | BIP GWAS rs7799260 | 4 | 1.25 | 0.000004 | 0.000003 | 0.40 | 0.74 | 0.456 | 0.484 | 0.428 | 0.425 | 0.004 | 0.002 | 0.006 | 0.18 | |
| rs2107828 | 7 | 82,200,320 | A/T | + | PCLO | Reg pot | 8 | 0.81 | 0.000013 | 0.000007 | 0.58 | 0.99 | 0.460 | 0.433 | 0.486 | 0.500 | 0.037 | 0.036 | 0.038 | 0.79 | ||
| rs1457266 | 8 | 24,825,757 | A/G | − | Reg pot | MDD linkage peak | 17 | 0.81 | 0.000029 | 0.000034 | 0.66 | 1.00 | 0.319 | 0.295 | 0.342 | 0.300 | 0.002 | 0.002 | 0.001 | 0.44 | ||
| rs7005189 | 8 | 81,663,211 | T/C | + | Cons, reg pot | BIP GWAS rs11778905 | 15 | 0.76 | 0.000028 | 0.000036 | 0.66 | 1.00 | 0.153 | 0.134 | 0.170 | 0.150 | 0.001 | 0.000 | 0.002 | 0.25 | ||
| rs1780436 | 10 | 34,297,618 | A/G | − | 5 | 0.80 | 0.000008 | 0.000013 | 0.58 | 0.95 | 0.374 | 0.348 | 0.400 | 0.325 | 0.018 | 0.016 | 0.021 | 0.31 | ||||
| rs11031676 | 11 | 32,242,721 | T/C | + | Reg pot | 21 | 1.26 | 0.000043 | 0.000035 | 0.66 | 1.00 | 0.232 | 0.253 | 0.212 | 0.139 | 0.006 | 0.007 | 0.004 | 0.28 | |||
| rs12579771 | 12 | 44,019,689 | T/C | + | TMEM16F | Cons, reg pot | MDD linkage peak | 11 | 0.78 | 0.000022 | 0.000023 | 0.66 | 1.00 | 0.205 | 0.184 | 0.225 | 0.271 | 0.000 | 0.000 | 0.001 | 1.00 | |
| rs4765078 | 12 | 123,171,707 | C/T | + | 25 | 0.82 | 0.000044 | 0.000035 | 0.66 | 1.00 | 0.374 | 0.350 | 0.397 | 0.408 | 0.004 | 0.003 | 0.004 | 1.00 | ||||
| rs8023445 | 15 | 46,980,083 | C/T | + | SHC4 | Reg pot | 9 | 0.72 | 0.000014 | 0.000009 | 0.58 | 0.99 | 0.119 | 0.101 | 0.135 | 0.108 | 0.011 | 0.012 | 0.010 | 0.63 | ||
| rs3885179 | 19 | 14,688,830 | A/C | − | ZNF333 | Reg pot, cSNP | 18 | 0.61 | 0.000032 | 0.000031 | 0.66 | 1.00 | 0.046 | 0.035 | 0.056 | 0.033 | 0.021 | 0.024 | 0.018 | 0.24 | ||
| rs941796 | 20 | 39,724,220 | A/G | + | Reg pot | 22 | 1.22 | 0.000043 | 0.000037 | 0.66 | 1.00 | 0.398 | 0.422 | 0.374 | 0.408 | 0.013 | 0.013 | 0.012 | 0.88 | |||
| rs12480143 | 20 | 39,741,240 | G/A | + | 24 | 1.25 | 0.000044 | 0.000035 | 0.66 | 1.00 | 0.265 | 0.288 | 0.244 | 0.233 | 0.001 | 0.001 | 0.002 | 1.00 | ||||
| rs928862 | 21 | 20,559,590 | G/A | + | Reg pot | Near CNV | 19 | 0.78 | 0.000040 | 0.000044 | 0.66 | 1.00 | 0.190 | 0.170 | 0.209 | 0.167 | 0.013 | 0.012 | 0.013 | 0.77 | ||
Notes. Sorted by location. All locations per NCBI Build 35 (UCSC hg17). Alleles are given as minor/major. OR (CI)=odds ratio (95% confidence interval). P-asymptomic=p-value from Trend test. P-empirical=pointwise p-value from adaptive permuation method in PLINK. q-value=see text. P-gwemp=genomewide empirical p-value via traditional permutation testing (5000 replicates). MAF=minor allele frequency. HapMap MAFs have been converted to the reference allele of the MDD sample. P-missing tests the difference in missingness between cases and controls. For noteworthy associations, the four flags refer to acceptable cluster plots, conformation to Hardy-Weinberg equilibrium, absence of plate-specific association outliers, and the presence of a “proxy” SNP in high linkage disequilibrium with the primary SNP.
TAMAL codes. cSNP=coding SNP. Bioinformatic flag possibilities: coding SNP (cSNP), SNP in segmental duplication, known copy number variant (CNV), conserved base (Cons), miRNA target site, region of regulatory potential (reg pot), predicted promoter, transfactor binding site, enhancer, exon, splice site, mRNA expression QTL (lymphocytes or cortex). Only positive flags are shown.
SLEP=Sullivan Lab Evidence Project (http://slep.unc.edu) a compendium of genetic findings from the literature. Sources (PubMed IDs): CNVs from Database of Genomic Variation (15286789), breast and colon cancer mutations (17932254), MDD genomewide linkage studies (12612864, 14582139, 17427203), SCZ genomewide linkage meta-analysis (12802786), and bipolar disorder (BIP) GWAS (17554300).
Clustering of SNPs with low p-values.
| Rank | Chr | Start | End | Nsnps | Pmin | N | N | N | Expressed | Genes | Gene | SLEP |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 7 | 30,928,587 | 30,931,521 | 3 | 1.25E-06 | 2 | 0 | 1 | Yes | ADCYAP1R1 | adenylate cyclase activating polypeptide 1 | Neuroactive ligand-receptor interaction |
| 2 | 7 | 82,041,576 | 82,208,167 | 10 | 1.50E-06 | 6 | 4 | 0 | Yes | PCLO | piccolo | |
| 4 | 6 | 14,388,932 | 14,399,068 | 2 | 9.09E-06 | 1 | 1 | 0 | ||||
| 5 | 2 | 20,177,820 | 20,183,313 | 2 | 1.18E-05 | 2 | 0 | 0 | Yes | LAPTM4A | lysosomal-associated protein transmembrane 4 alpha | |
| 6 | 15 | 46,979,618 | 46,980,083 | 2 | 1.36E-05 | 1 | 1 | 0 | Yes | CREBBP/EP300 inhibitor 1/EP300 interacting inhibitor of | ||
| 9 | 2 | 201,794,446 | 201,880,818 | 2 | 2.44E-05 | 1 | 1 | 0 | Yes | Caspase 10./Caspase 10./PRO3098./Caspase 10 splice | CASP10 causes multiple neoplasms | |
| 14 | 20 | 39,724,220 | 39,742,644 | 5 | 4.27E-05 | 5 | 0 | 0 | ||||
| 15 | 6 | 14,386,148 | 14,397,061 | 3 | 4.23E-05 | 1 | 1 | 1 | ||||
| 16 | 4 | 184,652,456 | 184,658,003 | 3 | 4.28E-05 | 1 | 0 | 2 | ||||
| 17 | 5 | 117,174,763 | 117,282,887 | 4 | 4.84E-05 | 1 | 1 | 2 | ||||
| 19 | 10 | 127,071,672 | 127,087,021 | 3 | 0.000046 | 1 | 2 | 0 | ||||
| 20 | 5 | 22,752,605 | 22,792,155 | 3 | 4.65E-05 | 1 | 0 | 2 | Yes | CDH12 | cadherin 12, type 2 | |
| 22 | 15 | 88,130,196 | 88,136,792 | 2 | 5.52E-05 | 1 | 1 | 0 | Yes | ANPEP MESP2 | alanyl | MESP2 causes spondylocostal dysostosis |
| 23 | 8 | 54,098,247 | 54,102,064 | 2 | 4.83E-05 | 2 | 0 | 0 | ||||
| 24 | 4 | 145,875,183 | 145,878,794 | 2 | 5.47E-05 | 1 | 0 | 1 | ||||
| 27 | 11 | 32,242,721 | 32,244,520 | 2 | 4.25E-05 | 2 | 0 | 0 | ||||
| 28 | 8 | 27,249,840 | 27,379,524 | 6 | 5.38E-05 | 1 | 2 | 3 | Yes | Hypothetical protein FLJ46514./cholinergic receptor, | CHRNA2 causes nocturnal frontal lobe | |
| 29 | 3 | 12,453,817 | 12,459,985 | 2 | 0.00005 | 1 | 1 | 0 | PPARG | peroxisome proliferator-activated receptor gamma | Type 2 diabetes mellitus risk gene | |
| 32 | 3 | 99,975,821 | 100,183,009 | 2 | 7.26E-05 | 1 | 0 | 1 | Yes | DCBLD2 ST3GAL6 | discoidin, CUB and LCCL domain containing 2/ST3 beta- | |
| 34 | 3 | 70,451,852 | 70,476,913 | 2 | 8.22E-05 | 1 | 0 | 1 | ||||
| 38 | 2 | 7,424,098 | 7,440,754 | 3 | 8.95E-05 | 1 | 1 | 1 | ||||
| 41 | 5 | 54,352,635 | 54,363,712 | 3 | 0.000071 | 1 | 1 | 1 | Yes | GZMK | granzyme K | |
| 43 | 13 | 111,889,281 | 111,902,203 | 2 | 7.87E-05 | 1 | 1 | 0 | ||||
| 44 | 1 | 211,470,329 | 211,508,991 | 4 | 0.000072 | 1 | 3 | 0 | ||||
| 46 | 8 | 24,784,576 | 24,825,085 | 2 | 0.000359 | 0 | 2 | 0 | NEF3 NEFM | neurofilament 3 |
SLEP=Sullivan Lab Evidence Project (http://slep.unc.edu) a compendium of genetic findings from the literature.
Figure 2Plot of the PCLO region (NCBI build 35, UCSC hg17, chr7:82,000,000-82,500,000). P-values in this figure are all from SNPMstat. The x-axis is chromosomal position, the left y-axis −log10(p) for genotyped SNPs (colored diamonds) and imputed SNPs (grey diamonds), and the right y-axis the recombination rate from the HapMap EUR panel (light blue curve). The color of the genotyped SNPs corresponds to LD with the SNP with smallest p-value (rs2715148): red 0.8 ≤ r2 ≤ 1.0, orange 0.5 ≤ r2 < 0.8, yellow 0.2 ≤ r2 < 0.5, and white r2 < 0.2. The significant and extent of all 3-SNP haplotypes with p<0.0001 in this region are colored light green. The transcripts for two PCLO isoforms are shown in dark green at the bottom. Graph adapted from an R function by the Broad DGI group.
Figure 3PCLO region replication results for MDD showing genomic context and forest plots for the top 12 SNPs in the original sample. The backbone of the graph is the region of PCLO targeted for follow-up. SNP locations are given by the grey triangles. There are 12 forest plots for the SNPs with p<0.001 in the original sample. Each forest plot is for one SNP and shows the odds ratio (square) and 95% confidence intervals (horizontal line) for a particular sample with the area of the square proportional to sample size.
PCLO replication results.
Abbreviations: N=total sample size for an analysis, Z=logistic regression beta divided by its standard error, P=asymptotic p-value from Wald chi-square test (1 df) uncorrected for multiple comparisons, P_corr=empirical p-value from accounting for multiple comparisons and LD structure (50K permutations), and I^2=i-squared, an index of heterogeneity of logistic regression parameter estimates.