| Literature DB >> 21738487 |
Chuong B Do1, Joyce Y Tung, Elizabeth Dorfman, Amy K Kiefer, Emily M Drabant, Uta Francke, Joanna L Mountain, Samuel M Goldman, Caroline M Tanner, J William Langston, Anne Wojcicki, Nicholas Eriksson.
Abstract
Although the causes of Parkinson's disease (PD) are thought to be primarily environmental, recent studies suggest that a number of genes influence susceptibility. Using targeted case recruitment and online survey instruments, we conducted the largest case-control genome-wide association study (GWAS) of PD based on a single collection of individuals to date (3,426 cases and 29,624 controls). We discovered two novel, genome-wide significant associations with PD-rs6812193 near SCARB2 (p = 7.6 × 10(-10), OR = 0.84) and rs11868035 near SREBF1/RAI1 (p = 5.6 × 10(-8), OR = 0.85)-both replicated in an independent cohort. We also replicated 20 previously discovered genetic associations (including LRRK2, GBA, SNCA, MAPT, GAK, and the HLA region), providing support for our novel study design. Relying on a recently proposed method based on genome-wide sharing estimates between distantly related individuals, we estimated the heritability of PD to be at least 0.27. Finally, using sparse regression techniques, we constructed predictive models that account for 6%-7% of the total variance in liability and that suggest the presence of true associations just beyond genome-wide significance, as confirmed through both internal and external cross-validation. These results indicate a substantial, but by no means total, contribution of genetics underlying susceptibility to both early-onset and late-onset PD, suggesting that, despite the novel associations discovered here and elsewhere, the majority of the genetic component for Parkinson's disease remains to be discovered.Entities:
Mesh:
Year: 2011 PMID: 21738487 PMCID: PMC3121750 DOI: 10.1371/journal.pgen.1002141
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Description of cohorts.
| Description | Number | Age | % Male | Age of onset |
| 23andMe controls | 29624 | 48.2 (16.0) | 58.5% | – |
| 23andMe cases | 3426 | 64.3 (10.6) | 60.3% | 57.4 (10.7) |
| IPDGC controls | 15470 | – | – | – |
| IPDGC cases | 6584 | – | – | – |
| NINDS controls | 799 | 58.6 (16.4) | 41.9% | – |
| NINDS cases | 932 | 66.2 (11.0) | 59.8% | 58.5 (13.1) |
Age is the average current age for the 23andMe cohort and the average age at collection for the NINDS cohort. Standard deviations are given in parentheses.
GWAS results for all SNPs with in the 23andMe cohort.
| SNP | Chr | Position | Region | Alleles | MAF | Cohort | OR |
|
| rs34637584 | 12 | 39020469 |
| G/A | 0.002 | 23andMe | 9.615 (6.43–14.37) |
|
| IPDGC | – | – | ||||||
| i4000416 | 1 | 153472258 |
| T/C | 0.005 | 23andMe | 4.048 (3.08–5.32) |
|
| IPDGC | – | – | ||||||
| rs356220 | 4 | 90860363 |
| C/T | 0.375 | 23andMe | 1.285 (1.22–1.36) |
|
| IPDGC | – | – | ||||||
| rs12185268 | 17 | 41279463 |
| A/G | 0.211 | 23andMe | 0.769 (0.72–0.82) |
|
| IPDGC | – | – | ||||||
| rs10513789 | 3 | 184242767 |
| T/G | 0.201 | 23andMe | 0.803 (0.75–0.86) |
|
| IPDGC | 0.873 (0.83–0.92) |
| ||||||
| rs6812193 | 4 | 77418010 |
| C/T | 0.365 | 23andMe | 0.839 (0.79–0.89) |
|
| IPDGC | 0.90 (0.86–0.94) |
| ||||||
| rs6599389 | 4 | 929113 |
| G/A | 0.075 | 23andMe | 1.311 (1.19–1.44) |
|
| IPDGC | – | – | ||||||
| rs11868035 | 17 | 17655826 |
| G/A | 0.309 | 23andMe | 0.851 (0.80–0.90) |
|
| IPDGC | 0.95 (0.91–0.996) | 0.033 | ||||||
| rs823156 | 1 | 204031263 |
| A/G | 0.183 | 23andMe | 0.827 (0.77–0.89) |
|
| IPDGC | – | – | ||||||
| rs4130047 | 18 | 38932233 |
| T/C | 0.313 | 23andMe | 1.161 (1.10–1.23) |
|
| IPDGC | 1.077 (1.03–1.13) | 0.0014 | ||||||
| rs2823357 | 21 | 15836776 |
| G/A | 0.376 | 23andMe | 1.149 (1.09–1.21) |
|
| IPDGC | 0.971 (0.93–1.02) | 0.187 |
All genomic positions are given with respect to NCBI build 36.3. Alleles are listed as major/minor and are specified for the forward strand. Odds ratios per copy of the minor allele and -values are provided for the 23andMe cohort and, where requested, the IPDGC replication set. Minor allele frequencies are provided for the 23andMe cohort.
Figure 1Plot of -values around rs6812193 and SCARB2.
In the plot, circles represent unannotated SNPs, upside-down triangles represent non-synonymous variants, and boxes with an “x” are SNPs in regions that are highly conserved across 44 placental mammals. Colors depict the squared correlation () of each SNP with the most associated SNP (i.e., rs6812193). Purple designates the SNP with the strongest association, and gray indicates SNPs for which information was missing. Plots were produced using the LocusZoom program [71].
Figure 2Plot of -values around rs11868035 and SREBF1/RAI1.
Colors depict the squared correlation () of each SNP with rs11868035. For details, see Figure 1.
Replication of previously reported associations.
| Published SNP (Proxy) | Region | Alleles |
| 23andMe OR (CI) | Pub. OR (CI) | Pop | Ref. | Grade |
|
|
| T/C |
| 4.048 (3.08–5.32) | 3.28 (2.41–4.47) | All |
| B |
| rs356220 |
| C/T |
| 1.285 (1.22–1.36) | 1.32 (1.26–1.38) | All |
| A |
| MAPT-H1H2 (rs1876828) |
| C/T |
| 0.764 (0.71–0.82) | 0.76 (0.72–0.80) | Euro |
| A |
| rs6812193 |
| C/T |
| 0.839 (0.79–0.89) | 0.89 (NR) | Euro |
| – |
| rs823156 |
| A/G |
| 0.827 (0.77–0.89) | 0.82 (0.75–0.89) | Asian |
| A |
| rs11711441 (rs11716740) |
| C/T |
| 0.821 (0.76–0.89) | 0.82 (0.74–0.90) | Euro |
| – |
| rs11248060 |
| C/T |
| 1.202 (1.11–1.30) | 1.24 (1.10–1.40) | All |
| C |
| rs2102808 (rs9917256) |
| G/A |
| 1.199 (1.11–1.30) | 1.28 (1.20–1.36) | Euro |
| – |
| rs4698412 |
| A/G |
| 0.891 (0.84–0.94) | 0.87 (0.82–0.91) | Asian |
| A |
| rs1491942 (rs11175655) |
| G/A | 0.00016 | 1.167 (1.08–1.26) | 1.19 (1.13–1.25) | Asian |
| – |
| rs823128 |
| A/G | 0.00019 | 0.758 (0.65–0.88) | 0.70 (0.64–0.76) | Asian |
| A |
| chr1:154105678 (rs10737170) |
| A/C | 0.00047 | 1.165 (1.07–1.27) | 1.67 (1.50–1.84) | Euro |
| – |
| rs12817488 (rs11060112) |
| A/C | 0.0072 | 0.920 (0.86–0.98) | 0.86 (0.82–0.91) | Euro |
| – |
| rs2282048 (rs872606) |
| A/C | 0.0097 | 0.932 (0.88–0.98) | 0.91 (0.84–0.99) | All |
| C |
| rs12718379 |
| A/G | 0.011 | 1.072 (1.02–1.13) | 1.09 (1.01–1.18) | All |
| C |
| rs7077361 |
| T/C | 0.0114 | 0.900 (0.83–0.98) | 0.84 (NR) | Euro |
| – |
| rs10200894 | 2q36.3 | C/G | 0.0136 | 0.889 (0.81–0.98) | 0.92 (0.83–1.01) | Euro |
| C |
| rs3129882 |
| A/G | 0.0194 | 1.066 (1.01–1.13) | 1.16 (1.02–1.32) | All |
| C |
| rs4880 |
| A/G | 0.0304 | 0.943 (0.89–0.99) | 0.88 (0.74–1.04) | Asian |
| B |
| rs797906 |
| C/A | 0.0578 | 1.055 (1.00–1.11) | 1.08 (1.01–1.15) | All |
| C |
| rs7617877 | 3p24.1 | G/A | 0.0859 | 1.050 (0.99–1.11) | 1.23 (1.13–1.33) | Euro |
| – |
| rs6280 |
| T/C | 0.129 | 0.957 (0.90–1.01) | 1.08 (1.02–1.15) | All |
| C |
| rs1079597 |
| C/T | 0.157 | 1.056 (0.98–1.14) | 1.17 (1.00–1.36) | All |
| C |
| rs6710823 (rs4954218) |
| T/G | 0.194 | 1.042 (0.98–1.11) | 1.38 (1.29–1.47) | Euro |
| – |
| rs17115100 |
| G/T | 0.198 | 1.061 (0.97–1.16) | 0.80 (NR) | Euro |
| – |
| rs7412 |
| C/T | 0.2 | 1.067 (0.97–1.18) | 1.15 (1.03–1.28) | All |
| C |
| rs12063142 |
| C/T | 0.252 | 1.035 (0.98–1.10) | NR (NR) | Euro |
| – |
| rs2010795 |
| G/A | 0.277 | 1.032 (0.98–1.09) | 1.09 (1.02–1.16) | All |
| C |
| rs10464059 (rs1862326) | 5q35.3 | G/T | 0.308 | 0.962 (0.89–1.04) | 1.33 (1.19–1.52) | Euro |
| – |
| rs1799836 |
| T/C | 0.414 | 0.983 (0.94–1.03) | 1.10 (1.01–1.20) | All |
| C |
| rs11030104 |
| A/G | 0.485 | 0.977 (0.92–1.04) | 1.12 (1.04–1.22) | All |
| C |
| rs1043424 |
| A/C | 0.492 | 1.021 (0.96–1.08) | 0.91 (0.81–1.01) | Euro |
| C |
| rs1994090 |
| T/G | 0.496 | 1.023 (0.96–1.09) | 1.39 (1.24–1.56) | Asian |
| – |
| rs1801133 |
| G/A | 0.506 | 0.981 (0.93–1.04) | 1.12 (1.02–1.22) | Asian |
| B |
| rs1223271 |
| G/A | 0.553 | 0.976 (0.90–1.06) | 0.85 (NR) | Euro |
| – |
| rs17329669 | 7p14.2 | A/G | 0.56 | 0.978 (0.91–1.05) | 1.13 (1.03–1.24) | All |
| C |
| rs12431733 |
| C/T | 0.678 | 0.989 (0.94–1.04) | 1.13 (NR) | Euro |
| – |
| rs5174 |
| C/T | 0.841 | 0.994 (0.94–1.05) | 0.93 (0.87–1.00) | All |
| C |
| rs13312 |
| C/G | 0.874 | 0.995 (0.93–1.06) | 0.76 (0.66–0.86) | All |
| A |
| rs1801582 |
| C/G | 0.881 | 0.995 (0.93–1.07) | 0.79 (0.64–0.97) | All |
| C |
| rs1805874 (rs2205108) |
| G/T | 0.882 | 1.004 (0.95–1.06) | 1.12 (1.01–1.25) | All |
| C |
| rs4837628 |
| T/C | 0.925 | 0.997 (0.95–1.05) | 0.79 (0.72–0.87) | Euro |
| – |
SNPs were taken from the PDGene “Top Results” list of meta-analyses [27] and the NHGRI list of associations [57]. Alleles are given with respect to the forward genomic strand for NCBI build 36.3 and are listed as major/minor. Where a proxy SNP was used, alleles refer to the proxy SNP. Published OR is the published odds ratio per copy of the minor allele for the association, as reported in the corresponding reference. Overall grades for SNPs based on the Venice criteria [72] were taken from the PDGene list, where available, and omitted otherwise. “NR” means that an OR or CI was not reported. -values and power calculations were calculated assuming a two-sided test. Our power to replicate a given association conditional on the published odds ratio and the minor allele frequency using a 0.05 threshold for significance was above 86% for all associations except for rs10200894, for which it was 59%. “Pop” refers to the ancestry in which the association was discovered, as taken from the PDGene list or the original paper where applicable. “All” indicates that multiple studies were used in the PDGene meta-analysis, irrespective of ancestry.
Heritability estimates.
| Source | Description |
|
|
|
|
| NINDS | All PD | - | 0.229 (0.000–0.543) | 0.833 (0.500–0.935) | 0.077 (0.036- |
| 23andMe | All PD | - | 0.272 (0.229–0.315) | 0.857 (0.833–0.877) | 0.065 (0.056–0.077) |
| 23andMe | Early-onset PD ( | - | 0.306 (0.136–0.476) | 0.873 (0.766–0.933) | 0.057 (0.037–0.129) |
| 23andMe | Late-onset PD ( | - | 0.285 (0.224–0.346) | 0.863 (0.830–0.890) | 0.062 (0.051–0.078) |
|
| Twin study; broad-definition PD | - | 0.30 (0.00–0.47) | 0.87 (0.50–0.93) | 0.059 (0.037- |
|
| Twin study; all PD | - | 0.274 (0.000–0.708) | 0.858 (0.000–0.976) | 0.064 (0.025- |
|
| Early-onset PD ( | - |
| 0.996 (0.884–0.996) | 0.018 (0.018–0.053) |
|
| Late-onset PD ( | - | 0.068 (0.00–0.59) | 0.693 (0.000–0.958) | 0.258 (0.030- |
|
| Family study; all PD | - | 0.401 | 0.910 | 0.044 |
|
| Early-onset PD ( | - | 0.169 | 0.793 | 0.104 |
|
| Late-onset PD ( | - | 0.453 | 0.926 | 0.039 |
|
| Family study; all PD | - | 0.60 (0.40–0.80) | 0.96 (0.91–0.99) | 0.029 (0.022–0.044) |
|
| Offsprings; all PD | 3.0 | 0.35 | 0.89 | 0.050 |
|
| Late-onset PD ( | 3.2 | 0.38 | 0.90 | 0.046 |
|
| Parents and siblings; all PD | 3.92 | 0.456 | 0.927 | 0.038 |
|
| Early-onset PD ( | 7.76 | 0.747 | 0.980 | 0.024 |
|
| Late-onset PD ( | 2.95 | 0.348 | 0.891 | 0.050 |
denotes the heritability of liability for PD, with a 95% CI provided where available. In the case of [10], confidence intervals were estimated via a bootstrap procedure based on numbers provided in the original paper. For studies that did not provide direct estimates of heritability, the relative recurrence risk ratio was used to estimate under the assumption of no shared environmental covariance (see Materials and Methods). denotes the maximum theoretical AUC corresponding to the given heritability of liability, assuming a disease prevalence of 0.01. denotes the proportion of additive genetic variance explained by a genetic profile that achieves an AUC of 0.6 (see Materials and Methods).
Internal and external cross-validation experiments using sparse logistic regression.
| Internal Validation | External Validation | |||||
| Signif. Threshold | Avg SNPs | Avg Regions | AUC | SNPs | Regions | AUC |
|
| 9.0 | 6.6 |
| 11 | 9 |
|
|
| 18.4 | 15.4 |
| 22 | 19 |
|
|
| 41.6 | 35.0 |
| 60 | 51 |
|
|
| 156.0 | 138.8 |
| 220 | 195 |
|
|
| 698.4 | 639.2 |
| 803 | 727 |
|
The internal five-fold cross-validation experiment was performed using only the 23andMe cohort. The external cross-validation experiment was performed by training on the 23andMe cohort and testing on the NINDS cohort. “SNPs” denotes the number of SNPs included in the fitted model. “Regions” denotes the number of distinct LD blocks represented by the SNPs in the fitted model. Each AUC value represents a covariate-adjusted AUC. For the internal validation experiment, average values are provided for SNPs and Regions, providing an average over all five cross-validation folds, and AUCs were computed by pooling predictions over the five cross-validation folds. For each row of the table, the sparsity inducing prior was chosen to achieve the approximate upper bound on the expected false positive rate indicated in the first column; here, corresponds to a model containing only genome-wide significant associations, whereas corresponds to suggestive associations. In each of the internal and external validation experiments, models with AUCs in bold are significantly better than non-bold models (see Table S3).