Literature DB >> 19772629

Genomewide association study for onset age in Parkinson disease.

Jeanne C Latourelle1, Nathan Pankratz, Alexandra Dumitriu, Jemma B Wilk, Stefano Goldwurm, Gianni Pezzoli, Claudio B Mariani, Anita L DeStefano, Cheryl Halter, James F Gusella, William C Nichols, Richard H Myers, Tatiana Foroud.   

Abstract

BACKGROUND: Age at onset in Parkinson disease (PD) is a highly heritable quantitative trait for which a significant genetic influence is supported by multiple segregation analyses. Because genes associated with onset age may represent invaluable therapeutic targets to delay the disease, we sought to identify such genetic modifiers using a genomewide association study in familial PD. There have been previous genomewide association studies (GWAS) to identify genes influencing PD susceptibility, but this is the first to identify genes contributing to the variation in onset age.
METHODS: Initial analyses were performed using genotypes generated with the Illumina HumanCNV370Duo array in a sample of 857 unrelated, familial PD cases. Subsequently, a meta-analysis of imputed SNPs was performed combining the familial PD data with that from a previous GWAS of 440 idiopathic PD cases. The SNPs from the meta-analysis with the lowest p-values and consistency in the direction of effect for onset age were then genotyped in a replication sample of 747 idiopathic PD cases from the Parkinson Institute Biobank of Milan, Italy.
RESULTS: Meta-analysis across the three studies detected consistent association (p < 1 x 10(-5)) with five SNPs, none of which reached genomewide significance. On chromosome 11, the SNP with the lowest p-value (rs10767971; p = 5.4 x 10(-7)) lies between the genes QSER1 and PRRG4. Near the PARK3 linkage region on chromosome 2p13, association was observed with a SNP (rs7577851; p = 8.7 x 10(-6)) which lies in an intron of the AAK1 gene. This gene is closely related to GAK, identified as a possible PD susceptibility gene in the GWAS of the familial PD cases.
CONCLUSION: Taken together, these results suggest an influence of genes involved in endocytosis and lysosomal sorting in PD pathogenesis.

Entities:  

Mesh:

Year:  2009        PMID: 19772629      PMCID: PMC2758866          DOI: 10.1186/1471-2350-10-98

Source DB:  PubMed          Journal:  BMC Med Genet        ISSN: 1471-2350            Impact factor:   2.103


Background

Parkinson disease (PD), the second most common neurodegenerative disorder, is characterized by debilitating symptoms of tremor, rigidity, and bradykinesia, usually occurring late in life. PD incidence increases with age from 1.7/10,000 person-years between ages 50 to 59 to 9.3/10,000 person-years between ages 70 to 79 and has a prevalence of approximately 1.8% among people over the age of 65 [1]. While the average age of onset of PD is approximately 60 years, there is wide variation, with some individuals having onset before age 20 and others not until after age 90 [2,3]. Five monogenic forms of PD have been identified and characterized by mode of transmission, onset age and protein affected by mutation. These include α-synuclein (SNCA or PARK1) [4], parkin (PARK2) [5], PTEN-induced putative kinase 1 (PINK1 or PARK6) [6], DJ-1 (PARK7) [7], and leucine-rich repeat kinase 2 (LRRK2 or PARK8) [8]. Onset for PARK1 is younger than that seen for idiopathic PD [4]. PARK2 (parkin) is a recessive form with young onset, commonly before age 40. Heterozygous mutations in parkin are also associated with earlier onset of PD, typically in the early to mid sixth decade [9,10]. By contrast, PD associated with LRRK2 mutations presents an onset distribution very similar to that seen in idiopathic PD, as well as clear age-dependent penetrance [11-13]. Onset of PD has been shown to be correlated between siblings with PD [14] suggesting that genetic modifiers influence onset age. Segregation analyses in three independent studies showed evidence of a genetic effect influencing age of onset of PD [15-17]. Notably, all three of these segregation analyses showed stronger evidence for the presence of "major genes" influencing onset age or penetrance, than for genes influencing susceptibility. Furthermore, age is one of the strongest risk factors for PD, suggesting that age related penetrance is strongly associated with disease expression. By identifying genes related to onset age, it may be possible to identify pathogenic mechanisms and therapeutic targets capable of delaying onset of disease symptoms. Effectively postponing disease onset will reduce disease prevalence and ease the burden of PD in our aging population. All prior PD genome wide association studies (GWAS) have focused exclusively on the detection of susceptibility genes and none has investigated association to genes influencing onset age [18-20]. In this study, we describe the first GWAS of onset age. This GWAS included 857 PD cases with a positive family history of PD. In addition, we performed a second GWAS with onset age as the phenotype using publicly available data from 440 randomly ascertained PD cases [19]. We conducted a meta-analysis of the two studies comprising approximately 2 million SNPs imputed using HapMap data. Finally, a replication study of the top findings from the meta-analysis was performed in an independent sample of 747 randomly ascertained PD cases from Milan, Italy.

Methods

PD Cases

One PD case (n = 935) from each family recruited from two ongoing studies of familial PD, the GenePD study and the PROGENI study, was selected for the GWAS. Both studies recruited families consisting of at least two members meeting diagnostic criteria for PD. PD cases underwent a uniform neurological evaluation that employed PD diagnostic criteria based on a modified version of the United Kingdom PD Society Brain Bank Criteria [21]. Detailed descriptions of the inclusion and exclusion criteria are described elsewhere for the PROGENI [22] and GenePD [14] studies. The analyzed sample was exclusively white, non-Hispanic. No subject known to have a disease-producing mutation was included in this analysis. All cases were known to be negative for the LRRK2 G2019S mutation, and many, but not all, were also screened for PARK1(SNCA)(N screened = 702), PARK2 (parkin)(N = 593), PARK7 (DJ1)(N = 328), and NR4A2 (N = 550) [9,10,12,23-27]. PD onset age was determined by interview and reflected the age of first symptom of PD, which commonly preceded age of physician diagnosis. The reliability of ascertaining age of onset though interview compared to medical records has been estimated as 0.94 [28]. In addition, control samples (n = 895) obtained from the NINDS Human Genetics Resource Center DNA and Cell Line Repository (Camden, NJ) were intermixed on the same plates as the cases when genotyped. All selected control samples were reported to be white, non-Hispanic. While not used in the analysis of onset age of PD, these samples were used for SNP and sample quality assessment and genotype imputation as described below. Appropriate written informed consent was obtained for all samples included in this study.

Microarray Genotyping and Quality Assessment

Genotyping was performed by the Center for Inherited Disease Research (CIDR) using the Illumina HumanCNV370 version1_C BeadChips (Illumina, San Diego, CA, USA) and the Illumina Infinium II assay protocol [29]. As previously described [20], 78 cases and 28 controls were removed due to a low genotype call rate (<98% of SNPs), cryptic relatives in the sample, or population stratification. The final sample included 857 PD cases (all derived from whole blood) and 867 control DNA samples (all derived from lymphoblastoid cell lines). Genotype calls and quality scores were determined from allele cluster definitions for each SNP as determined by the Illumina BeadStudio Genotyping Module version 3.1.14 and the combined intensity data from 96% of study samples as previously described [20]. Genotype calls with a quality score (Gencall value) of 0.25 or higher were considered acceptable. Blind duplicate reproducibility was 99.98%. Although we performed additional SNP filtering described below, the CIDR released dataset contained 344,301 SNPs and is available on dbGaP (; Accession number: phs000126.v1.p1). SNPs were removed if: 1) the call rate of the SNPs was lower than 98% (n = 7,764), 2) the minor allele frequency was less than 0.01 in the combined case and control dataset (n = 7,667), 3) there were differential rates of missing genotypes in the cases and controls (n = 75) or males and females (n = 271), or 4) significant deviation from Hardy Weinberg equilibrium was observed in the control sample (n = 906). Many markers failed multiple tests. The final dataset consisted of 328,189 SNPs that passed all quality control measures (94.6% of all attempted SNPs).

Mayo-Perlegen LEAPS sample

Publicly available SNP genotype data for 443 PD cases was accessed from dbGaP for the GWAS "Mayo-Perlegen LEAPS (Linked Efforts to Accelerate Parkinson's Solutions) Collaboration" (; Accession number: phs000048.v1.p1; [19]. This data set included genotyping for the 198,345 SNPs meeting the quality control standards described by Maraganore et al. [19]. We examined these data for population stratification and for cryptic relatedness. While no population outliers were observed, three individuals were removed due to apparent second-degree relationships identified during our review of the data, leaving a sample of 440 PD cases included in these analyses. While data are also publicly available for 267 PD cases from the Fung et al. GWAS [18], that study only included cases whose age of PD onset was 55 years or greater resulting in an age distribution that was significantly different from the other two samples (p < 0.0001). Therefore, we did not include the cases from that study in our meta-analysis due to limited variability in the onset distribution.

Population stratification

Both the GenePD-PROGENI and Mayo-Perlegen LEAPS sample sets were screened for population outliers during initial QC. In addition, after the final study sample sets were determined, principle components were recalculated independently for each study using only the samples included in the final analyses. Association to onset age was then tested for the first six principle components for each study. No association (p < 0.05) between the first six principle components and onset age was seen in either study sample.

Imputation

Imputation was performed to increase the power of the meta-analysis and to facilitate the joint analysis of results generated from Mayo-Perlegen LEAPS and our GWAS, which were genotyped on different platforms. The program MACH 1.0 (compiled using Intel's optimized compiler) was used to impute genotypes for 2,543,887 autosomal SNPs characterized in the HapMap project [30-32]. The GenePD-PROGENI sample and the Mayo-Perlegen LEAPS cohort were imputed separately using phased haplotype data downloaded from the HapMap project website [33]. In the GenePD-PROGENI sample, imputation was performed using both cases and controls. A subset of 200 cases and controls with high call rates was selected to perform the initial model parameter calculation. Next, imputation was performed on all participants using all autosomal SNPs where the strand was not ambiguous (i.e. not an A/T or G/C SNP) and that passed all other quality control measures described previously. In the Mayo-Perlegen LEAPS cohort, all PD cases were used to compute the initial model parameters. All unambiguous autosomal SNPs passing quality review, as described in the original Mayo-Perlegen LEAPS GWAS [19], were used for this imputation. In addition, because the genotyping platform used in the Mayo-Perlegen LEAPS study included a high percentage of ambiguous SNPs, those ambiguous SNPs which could be confidently matched to HapMap strands through the comparison of minor allele frequencies were also used for imputation. For each study, initial model parameters were calculated using 100 iterations. SNP quality was assessed using the Rsq metric, which estimates the squared correlation between imputed and actual genotypes with SNPs having an Rsq <0.3 excluded from further study (N = 61,271). At the time of this study, only autosomal imputation is supported by MACH; therefore, the program IMPUTE was used to impute SNPs on the X chromosome [34,35]. Haplotype, legend, and recombination rate files based upon HapMap (rel#21-NCBI build 35) were downloaded from IMPUTE's website for use in imputing 64,621 SNPs in the non-pseudoautosomal region of the X chromosome. Imputation was run in all participants in the GenePD-PROGENI study and separately in all PD cases in the Mayo-Perlegen LEAPS GWAS, using all X-chromosome SNPs that passed quality assessment as described above and were found in the legend file provided by IMPUTE.

Statistical Analyses

In order to meta-analyze results from both studies, association to onset age was performed using exclusively imputed, not genotyped data, for 1,861,750 SNPs passing imputation QC and with minor allele frequencies greater than 10%. Imputed SNPs with minor allele frequencies less than 10% in the GenePD-PROGENI sample (N = 620,866) were excluded from the association analyses to avoid false positives that can occur in the analysis of low minor allele frequency SNPs. To evaluate SNP association under an additive mode of inheritance, the predicted allele dosage for each genotype estimated by MACH for autosomes or by IMPUTE for the X chromosome was used. To model the recessive and dominant modes of inheritance, the genotype probabilities calculated by MACH were used. While a combination of additive and recessive models can have high power to detect dominant effects, this power decreases as the minor allele frequency of the SNP nears 0.5 [36]. Therefore, we studied additive, dominant and recessive models. The total probability of having either one copy or two copies of the minor allele was used for the dominant model and the probability of having two copies of the minor allele was used for the recessive model. Recessive and dominant models were not studied for the X chromosome. Linear regression analyses were performed using SAS v9.1.

Meta-Analysis

Since the genotypic data was generated on different arrays (Illumina and Perlegen) with few SNPs in common, we employed a conservative meta-analytic approach to combine results from the two studies. Meta-analysis of the results of the linear regression of the imputed data for the GenePD-PROGENI sample and the Mayo-Perlegen LEAPS cohort was performed using METAL . As is common in GWAS meta-analyses, a fixed effects model with standard error weighting was used, as random-effects models may be too conservative in GWA studies with a small number of studies[37].

Replication Study

In order to validate the top findings from the GWAS meta-analysis, an additional sample of 896 PD cases with reported ages of PD onset was provided by the Parkinson Institute - Istituti Clinici di Perfezionamento, Milan, Italy from the "Human Genetic Bank of Patients Affected by PD and Parkinsonisms". These cases were recruited irrespective of family history or onset age, and similar to the GenePD-PROGENI, used the UK Parkinson's Disease Society Brain Bank criteria to confirm idiopathic PD [38] and defined onset as the age of first symptom of PD. Twenty-four SNPs were selected based upon the following criteria (1) a p-value less than 0.00001 in the meta-analysis of the two GWAS, with (2) a consistent direction of effect in both studies. Nine SNPs meeting these criteria were identified from the additive inheritance model, thirteen from the dominant model and eleven from the recessive model. Four SNPs were identified in both the additive and dominant model, one SNP was identified in both the additive and recessive model and one SNP was identified by all three models. Three gene regions were identified under two different genetic models with different SNPs. For each multiply-nominated gene region, the SNP from the model with the smaller p-value was selected for replication. These SNPs were genotyped using TaqMan technology implemented on the ABI PRISM® 7900HT Sequence Detection system (Applied Biosystems: Foster City, CA) at Boston University School of Medicine. Individual samples (149) with genotyping call rates of less than 95% were excluded from further analysis. Association of the 24 SNPs to onset age was evaluated in the 747 remaining Italian cases using linear regression performed with the software Plink v1.01 [39] using the corresponding genetic model (additive, recessive or dominant) by which each SNP was originally identified. A final fixed effects meta-analysis of all three studies was performed using METAL. To distinguish whether associations observed were to age in general, as opposed to age at onset of PD, linear regression to censoring age was performed in the 867 NINDS control samples genotyped with the GenePD-PROGENI cases.

Results

Demographic characteristics of the three samples studied are shown in Table 1. All three studies have a similar percentage of male participants. The GenePD-PROGENI and Mayo-Perlegen LEAPS samples have similar mean ages of PD onset, while the Italian sample has a somewhat younger average age at onset. The GenePD-PROGENI sample has the widest range of onset ages from 19 to 90 years while the Mayo-Perlegen LEAPS has no participants under 30 years of age and the Italian sample has no participants over 81 years of age. No significant differences in onset age are seen between men and women in any of the studies.
Table 1

Age and sex distribution of the GenePD- PROGENI, Mayo-Perlegen LEAPS, and Milan, Italian study samples.

N% MaleMean OnsetStd Dev OnsetRange Onset
GenePD-PROGENI85759.261.910.919-90

Mayo-Perlegen LEAPS44061.661.011.230-94

Milan, Italian74759.455.210.820-81
Age and sex distribution of the GenePD- PROGENI, Mayo-Perlegen LEAPS, and Milan, Italian study samples. Supplementary Tables S1 (additive model), S2 (dominant model) and S3 (recessive model) (see Additional file 1) present the top SNPs from each region with a meta-analysis p < 0.0001 for imputed SNP data in the GenePD-PROGENI and Mayo-Perlegen LEAPS studies. Twenty-four SNPs with meta-analysis p-values of less than 0.00001 and with a consistent direction of effect for both GWAS were genotyped in the Italian replication sample of 747 PD cases (Table 2). SNPs genotyped in either the GenePD-PROGENI or Mayo-Perlegen LEAPS GWAS platforms are distinguished by notation from those SNPs imputed by both studies. The results of the association analysis in the Italian sample, as well as the combined meta-analysis of all three samples, are shown in Tables 3 (additive), 4 (dominant), and 5 (recessive) and in Figure 1.
Table 2

Location and minor allele frequency (MAF) information for 24 SNPs selected for replication study in Italian sample

ChrSNPSNP positionaMinor AlleleMAFGenes in Regionb
1rs75564472,335,959G0.24PEX10

1rs1355637c148,849,062T0.19MCL1|ENSA

1rs10918270160,182,125A0.41ATF6

2rs1189912120,231,454C0.50SDC1

2rs7577851c69,577,214T0.17AAK1

2rs17817190134,068,118G0.11NAP5

6rs693638842,275,464T0.25CCND3|TAF8|GUCA1A|GUCA1B|MRPS10

6rs1572662154,853,289A0.36CNKSR3|CLDN20|RBM16|TIAM2|TFB1M

7rs1420143c29,514,849C0.44CHN2

7rs1766398376,243,177T0.12

10rs1226173652,738,610T0.28PRKG1

10rs7076519127,738,829C0.33ADAM12

11rs10767971c32,852,240T0.42PRRG4|QSER1

12rs1282969730,067,223G0.13

12rs10773917130,572,784T0.23

13rs4771006d26,180,684A0.49WASF3

13rs676495c29,561,754T0.24

14rs1014748647,054,769T0.27MDGA2

15rs1756584125,670,842A0.10OCA2

17rs990457212,803,084A0.34RICH2

17rs4791571c14,155,801A0.42HS3ST3B1

18rs194118427,292,121C0.30DSG3

19rs1042013440,953,768G0.18C19orf55|SNX26

20rs388794223,925,072G0.19GGTLC1

a From NCBI Build 36 reference; b Nearest genes within 100 kb of SNP;c SNP was genotyped in Mayo-Perlegen LEAPS GWAS;d SNP was genotyped in GenePD-PROGENI GWAS

Table 3

Additive model study-specific and meta-analysis results

GenePD-PROGENIMayo-Perlegen LEAPSGWAS Meta-analysisMilan, ItalianThree Sample Meta-analysis
SNPeffectp-valueeffectp-valueDirection of Effecteffectp-valueeffectp-valueDirection of Effecteffectp-value

rs17565841-4.662.7 × 10-6-3.641.7 × 10-1---4.539.1 × 10-7-1.575.0 × 10-2----2.842.6 × 10-6

rs10918270-2.121.1 × 10-4-1.901.6 × 10-2---2.054.8 × 10-6-0.622.7 × 10-1----1.492.0 × 10-5

rs12261736-3.611.8 × 10-7-1.174.0 × 10-1---3.133.5 × 10-7-0.424.5 × 10-1----1.656.6 × 10-5

rs47710062.351.6 × 10-32.591.6 × 10-3++2.467.3 × 10-60.503.8 × 10-1+++1.521.2 × 10-4

rs14201432.081.4 × 10-42.588.1 × 10-3++2.203.6 × 10-60.029.8 × 10-1+++1.274.1 × 10-4

rs69363882.454.2 × 10-42.987.2 × 10-3++2.609.2 × 10-6-0.317.0 × 10-1++-1.588.2 × 10-4

rs101474862.213.1 × 10-42.898.0 × 10-4++2.449.2 × 10-7-0.693.0 × 10-1++-1.319.5 × 10-4

rs7076519-2.719.3 × 10-6-2.532.5 × 10-2---2.676.0 × 10-70.681.8 × 10-1--+-0.921.3 × 10-2
Table 4

Dominant model study-specific and meta-analysis results

GenePD-PROGENIMayo-Perlegen LEAPSGWAS Meta-analysisMilan, ItalianThree Sample Meta-analysis
SNPeffectp-valueeffectp-valueDirection of Effecteffectp-valueeffectp-valueDirection of Effecteffectp-value

rs17565841-5.084.7 × 10-6-3.682.0 × 10-1---4.901.9 × 10-6-1.943.6 × 10-2----3.262.1 × 10-6

rs1941184-2.941.3 × 10-4-2.941.0 × 10-2---2.943.7 × 10-6-1.251.2 × 10-1----2.284.3 × 10-6

rs10918270-3.262.9 × 10-5-2.741.5 × 10-2---3.091.2 × 10-6-0.872.9 × 10-1----2.267.5 × 10-6

rs38879424.177.2 × 10-47.101.5 × 10-3++4.856.4 × 10-60.932.6 × 10-1+++2.372.7 × 10-4

rs12261736-4.039.5 × 10-6-2.042.8 × 10-1---3.667.3 × 10-6-0.465.7 × 10-1----2.073.5 × 10-4

rs107739173.153.8 × 10-52.583.5 × 10-2++2.993.7 × 10-6-0.138.7 × 10-1++-1.754.8 × 10-4

rs10420134-3.215.8 × 10-5-3.125.1 × 10-2---3.197.1 × 10-60.565.1 × 10-1--+-1.672.3 × 10-3

rs99045723.574.3 × 10-62.726.4 × 10-2++3.387.2 × 10-7-1.051.9 × 10-1++-1.553.1 × 10-3

rs17817190-3.873.9 × 10-5-5.272.1 × 10-2---4.072.5 × 10-61.302.1 × 10-1--+-1.855.3 × 10-3

rs15726623.574.1 × 10-54.897.5 × 10-2++3.697.8 × 10-6-0.674.1 × 10-1++-1.451.2 × 10-2

rs7076519-3.731.2 × 10-5-3.813.0 × 10-2---3.749.3 × 10-71.486.2 × 10-2--+-1.232.6 × 10-2
Table 5

Recessive model study-specific and meta-analysis results

GenePD-PROGENIMayo-Perlegen LEAPSGWAS Meta-analysisMilan, ItalianThree Sample Meta-analysis
SNPeffectp-valueeffectp-valueDirection of Effecteffectp-valueeffectp-valueDirection of Effecteffectp-value

rs107679714.239.3 × 10-62.351.3 × 10-1++3.714.3 × 10-62.402.6 × 10-2+++3.245.4 × 10-7

rs7577851-8.641.2 × 10-4-8.033.1 × 10-2---8.489.8 × 10-6-3.891.3 × 10-1----6.858.7 × 10-6

rs1355637-5.102.3 × 10-3-14.254.1 × 10-5---6.835.1 × 10-61.106.0 × 10-1--+-4.196.2 × 10-4

rs11899121-3.418.0 × 10-5-3.331.6 × 10-2---3.393.5 × 10-60.436.5 × 10-1--+-1.996.3 × 10-4

rs12829697-10.518.5 × 10-5-10.333.0 × 10-2---10.477.0 × 10-60.688.1 × 10-1--+-6.157.2 × 10-4

rs17663983-16.721.2 × 10-5-25.534.6 × 10-1---16.838.1 × 10-65.772.9 × 10-1--+-9.462.3 × 10-3

rs12261736-8.481.9 × 10-6-0.568.9 × 10-1---7.149.6 × 10-6-0.734.9 × 10-1----2.652.7 × 10-3

rs14201433.317.9 × 10-46.051.3 × 10-3++3.907.3 × 10-6-0.525.9 × 10-1++-1.893.3 × 10-3

rs676495-6.901.9 × 10-5-11.272.4 × 10-1---7.029.1 × 10-61.862.7 × 10-1--+-2.841.4 × 10-2

rs7556447-13.342.1 × 10-5-11.111.9 × 10-1---13.087.9 × 10-6-0.447.7 × 10-1----3.012.3 × 10-2

rs4791571-5.181.3 × 10-5-6.392.4 × 10-1---5.235.9 × 10-62.573.5 × 10-2--+-1.526.9 × 10-2
Figure 1

Forest plots showing study-specific and pooled effects for top six results in final meta-analysis. The effect (in years) on age at PD onset and 95% confidence intervals for the indicated genetic model are shown for each of the three independent samples and the combined analysis. A. Additive model of the A allele in SNP rs17565841 B. Dominant model of the A allele in SNP rs17565841 C. Dominant model of the C allele in SNP rs1941184 D. Dominant model of the A allele in SNP rs10918270 E. Recessive model of the T allele in SNP rs10767971 F. Recessive model of the T allele in SNP rs7577851.

Location and minor allele frequency (MAF) information for 24 SNPs selected for replication study in Italian sample a From NCBI Build 36 reference; b Nearest genes within 100 kb of SNP;c SNP was genotyped in Mayo-Perlegen LEAPS GWAS;d SNP was genotyped in GenePD-PROGENI GWAS Additive model study-specific and meta-analysis results Dominant model study-specific and meta-analysis results Recessive model study-specific and meta-analysis results Forest plots showing study-specific and pooled effects for top six results in final meta-analysis. The effect (in years) on age at PD onset and 95% confidence intervals for the indicated genetic model are shown for each of the three independent samples and the combined analysis. A. Additive model of the A allele in SNP rs17565841 B. Dominant model of the A allele in SNP rs17565841 C. Dominant model of the C allele in SNP rs1941184 D. Dominant model of the A allele in SNP rs10918270 E. Recessive model of the T allele in SNP rs10767971 F. Recessive model of the T allele in SNP rs7577851. Although ten of the replication SNPs showed a consistent direction of effect across all three studies, only two SNPs (both of which were genotyped on the Mayo-Perlegen LEAPS GWAS platforms) resulted in increased statistical evidence of association when combining the three studies, and both of these were in a recessive model. The most strongly associated SNP, rs10767971, located on chromosome 11 between the genes QSER1 and PRRG4, was associated with a 3.2 year older PD onset in individuals with 2 copies of the minor allele (p = 5.4 × 10-7 in the 3 sample meta-analysis, compared to 4.3 × 10-6 in the 2 sample meta-analysis). Conversely, an estimated 6.9 year earlier age of onset (p = 8.7 × 10-6 in the 3 sample meta-analysis compared to 9.8 × 10-6 in the 2 sample meta-analysis) was observed for the SNP rs7577851, located in the 16th intron of the gene AAK1 on chromosome 2. The most highly associated SNP in both the additive and dominant three sample meta-analyses, rs17565841, is located approximately 3 kb from the 3' end of the gene OCA2 on chromosome 15. This SNP was associated with an average 2.8 years younger onset age (p = 2.6 × 10-6) under an additive model and a 3.3 years younger onset age (p = 2.1 × 10-6) under a dominant model. Inclusion of the Italian cases in the meta-analysis did not strengthen the evidence of association as compared to the results seen in the two sample meta-analyses (9.1 × 10-7 for additive and 1.9 × 10-6 for dominant). However, in the Italian replication sample, this SNP did provide modest statistical association to onset age (p = 0.05 for additive and p = 0.04 for dominant) with the same direction of effect seen in the two other studies. Also showing consistent directions of effect and p-values in the three-sample meta-analysis at the level of p < 1 × 10-5 were two SNPs located in the genes DSG3 and ATF6. The SNP rs1941184, had the second best p-value identified under the dominant model and is located in the third intron of the gene DSG3 on chromosome 18. This SNP was associated with an average 2.3 year younger age of onset of PD across the three studies (p = 4.3 × 10-6). The SNP rs10918270, located in the 15th intron of the gene ATF6 on chromosome 1, was identified under both additive and dominant modes of inheritance, but showed stronger association in the three sample meta-analysis under the dominant model (p = 7.5 × 10-6) with an average 2.3 year younger onset of PD. No association to age with direction of effect consistent with that observed for onset age of PD was seen in the control sample for any of the 24 SNPs at a significance level equal to 0.05.

Discussion

We present results from the first GWAS for age at onset of PD, including a meta-analysis with the publicly available Mayo-Perlegen LEAPS GWAS data (dbGaP Study Accession: phs000048.v1.p1) and a follow-up replication study in an independent PD sample recruited in Milan, Italy. Differences were observed in the age distributions in the three populations used in this study (Table 1). However, there were no imposed age restrictions in any of the studies and a wide distribution of ages was represented in all three populations, each with a range of greater than 60 years. No SNP reached the commonly accepted criterion for genome-wide significance of p < 5 × 10-8 [40], which is based on recent estimates of independent genomewide sequence variation to maintain 5% genomewide type I error rate [41,42]. While this criterion provides an appropriate cutoff for determining significance for the large number of SNPs provided by imputation, this measure does not account for the testing of multiple genetic models, as was performed in this study. Despite the lack of genomewide significance, the meta-analysis in this study showed evidence of several interesting loci with consistent effects on onset age of PD across the three independent populations studied. The SNP with the strongest evidence for association to onset age, rs10767971, is associated with a later age of onset under a recessive model. This SNP is nearly equidistant between the genes PRRG4 and QSER1 on chromosome 11, at just under 20 kb from each. The association results for nearby SNPs studied in the meta-analysis of the two GWAS as well as the LD structure and recombination rates for the region are shown in Figure 2A and 2B under recessive and dominant models. While the strongest evidence of association appeared with a recessive model, many nearby SNPs in strong LD showed similar association under the dominant model. The pattern suggests a single association, spanning several SNPs that may be more evident around the PRRG4 gene and may extend toward the CCDC73 and EIF3M genes.
Figure 2

Evidence of association in Chromosome 11 region. Along the x axis is the physical position in the region (in kB) with known genes shown in their orientation The left Y-axis denotes the association test result as -log(p-value) corresponding to diamonds in the figure. The blue diamond identifies the primary SNP result labeled with an rs# and p-value. The color of additional diamonds depicts the pairwise linkage disequilibrium with the primary SNP: red indicates r2 > 0.8, orange 0.5 < r2 < 0.8, yellow 0.2 < r2 < 0.5, white r2 < 0.2, grey unknown LD. R2 values were obtained from the CEPH HapMap data. The right Y-axis indicates the recombination rate, also obtained from CEPH HapMap data, and shown within the figure by the solid light-blue line. A. Recessive model B. Dominant model.

Evidence of association in Chromosome 11 region. Along the x axis is the physical position in the region (in kB) with known genes shown in their orientation The left Y-axis denotes the association test result as -log(p-value) corresponding to diamonds in the figure. The blue diamond identifies the primary SNP result labeled with an rs# and p-value. The color of additional diamonds depicts the pairwise linkage disequilibrium with the primary SNP: red indicates r2 > 0.8, orange 0.5 < r2 < 0.8, yellow 0.2 < r2 < 0.5, white r2 < 0.2, grey unknown LD. R2 values were obtained from the CEPH HapMap data. The right Y-axis indicates the recombination rate, also obtained from CEPH HapMap data, and shown within the figure by the solid light-blue line. A. Recessive model B. Dominant model. The SNP showing the strongest association to PD onset across the three populations under both the additive and dominant models lies 3 kb from the 3' end of the gene OCA2 (oculocutaneous albinism II). Mutations in OCA2 are responsible for the most common form of albinism, oculocutaneous albinism type 2 [43] and several studies have shown this gene to be associated with common skin, hair and eye color variation found in European populations [43-48]. Variations in OCA2 have been associated with susceptibility to melanoma [49], which has also been reported to occur with increased frequency among PD cases [50-52]. Because of the targeted degradation of pigmented neurons in PD brains, these associations have lead to a hypothesized link between genes involved in pigmentation, such as OCA2, and PD, likely mediated through common elements in melanin and neuromelanin synthesis[53]. The protein encoded by OCA2, 'P' protein, is involved in the transport of tyrosine, a precursor to melanin, as well as in the regulation of melanosomal pH which may be key to the initiation of the enzyme controlling melanin synthesis in melanocytes [54]. It is not clear whether synthesis of neuromelanin is also regulated through melanosomal pH or is affected by the 'P' protein in a similar way [55]. Nevertheless, the association between OCA2 and younger age of onset are suggestive of a neuromelanin-related mechanism of effect. The association of an intronic SNP in the gene DSG3 (desmoglein 3 (pemphigus vulgaris antigen)) may also be indicative of a neuromelanin related effect on onset age of PD. The protein encoded by DSG3 is the autoantigen for the autoimmune skin disease pemphigus vulgaris and this gene is expressed primarily in skin. Interestingly, some reports have demonstrated increased expression of DSG3 in melanocytes compared to keratinocytes (the most common cell type in the epidermis) [56]. An intronic SNP in the gene ATF6 also showed strong association to earlier age of onset of PD. ATF6 (activating transcription factor 6) transcribes a transcription factor localized to the endoplasmic reticulum (ER). The ATF6 protein is a critical regulator of the unfolded protein response (UPR), a highly conserved pathway activated in response to ER stress, and is a protective cellular response to the accumulation of misfolded proteins [57-59]. The UPR has been implicated in neurotoxin based cellular models of PD [60] and has also been shown to be activated by the over-expression of α-synuclein in yeast cells [61]. More recently, postmortem studies of PD case and control brains have shown activation of the UPR in cases, but not controls and that this activation is associated with the aggregation of α-synuclein [62]. The finding of an intronic SNP in AAK1 associated with PD onset age is intriguing because of the gene's genomic location, its function, and its close relation to a gene identified for PD susceptibility. The AAK1 (AP2 associated kinase 1) gene is located on chromosome 2p14, near 2p13, which has previously been implicated as the PARK3 locus[63], and for which linkage to onset age was demonstrated in both the GenePD [64] and PROGENI [65] studies. The AAK1 gene itself has not been previously implicated by positional mapping, but a microarray study of PD brain compared to controls demonstrated differential expression of AAK1 [66]. In our past GWAS of PD susceptibility in the GenePD-PROGENI cohort [20], the region containing the gene GAK (cyclin G-associated kinase) had the strongest evidence for association (chromosome 4p). The AAK1 and GAK genes both function at multiple steps in clathrin-mediated vesicular transport and the two kinases likely have some redundant functions [67] related to their homologous serine/threonine-kinase domain [68]. Recently, cathepsin D was implicated as the main lysosomal enzyme involved in α-synuclein degredation [69], and depletion of GAK was shown to impair the lysosomal sorting of cathepsin D [68]. Thus, the finding that AAK1 influences onset age and GAK influences risk in familial PD suggest that pathways involving lysosomal activity influence PD risk. Several previous genome scans have provided evidence of loci linked to onset age, including 2p13 seen in both the GenePD and PROGENI studies, as mentioned above. Evidence of linkage has also been reported on chromosomes 1p, 1q, 8q, 9q, 10q, 20 and 21[64,65,70], but there is little overlap between these linkage regions and our top meta-analysis association results. Aside from the identification of the AAK1 gene near 2p13, the strongest association result observed under one of the previously reported linkage regions occurs near the gene sortilin-related VPS10 domain containing receptor 3 (SORCS3) located near the LOD score peak at 10q, originally identified in a combined linkage scan of onset age in PD and Alzheimer's disease [70]. This association is seen most strongly under a dominant model (p = 2.5 × 10-5) with minor allele carriers having an estimated older onset age by 3.3 years (see Additional file 1: Table S2). A final region of interest is 15q26.2 that includes the gene MCTP2. Several SNPs in this region showed association to earlier onset age of PD in the meta-analysis of the two GWAS under the recessive model (see Additional file 1: Table S3). These SNPs did not reach the criteria for inclusion in the replication study, as the strongest p-value seen was 2.2 × 10-5 with rs17504636. This SNP was associated with a 9.2 year earlier average PD onset. This region overlaps with a SNP reported in the susceptibility GWAS including these cases [20]. In that study the SNP rs4476132 was associated with PD susceptibility with an odds ratio of 1.3 (p = 7.7 × 10-5- meta-analysis with additive model). This overlap is consistent with a locus associated with a risk for younger-onset PD or with an effect modifying age dependent penetrance. The gene in this region, MCTP2 (multiple C2 domains, transmembrane 2), is expressed in the brain and has been implicated in linkage and association studies of abdominal fat [71] and major depression [72]. Important distinctions can be made between those genes that influence susceptibility for developing disease, and the genetic modifiers that influence penetrance or, as studied here, onset age. Perhaps the best examples for genetic modifiers are seen for Huntington's disease (HD) where an expanded CAG trinucleotide repeat on chromosome 4p16.3 causes the disease, but wide variation in onset age is evident for individuals with identical repeat lengths. The identification of the genes that presumably interact with huntingtin to produce relatively younger or older onset for a given repeat size provide insight into the pathogenic mechanisms for HD, as well as therapeutic targets for intervention [73-75]. Similarly, identifying those genes and their products that are associated with older onset in PD may provide insight into the disease mechanisms and processes for delaying onset with implications for novel treatments.

Conclusion

The identification of the 15q26.2 region as well as the related genes AAK1 and GAK in the studies of PD onset age and susceptibility highlights the importance of the continued study of both of these traits, both separately and in combination. The direct overlap in affection and onset age association results in the 15q26.2 region shows this to be a candidate region that would benefit from further examination with consideration of important age-related effects, for example in studies correlating expression of genes in this region with onset age. The identification of association to onset age with the gene AAK1, in the same pathway as a previously identified susceptibility-associated gene GAK highlights the importance of genetic pathways in PD etiology, showing that the genes along the same pathway may have redundant effects or may modify disease pathology different ways, observed by differences in disease onset and progression. Studying PD in the context of onset age provides fundamental insight into the disease process and is essential to understanding mechanisms that modify disease penetrance and therefore may be key in identifying therapeutic targets.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JCL participated in conception and design of the study, conducted statistical analyses, participated in the interpretation of data, and drafted the manuscript. NP, AD and JBW participated in the conception and design of the study, conducted statistical analyses, participated in the interpretation of data, and revised the article critically for important intellectual content. SG, GP, CBM, ALD, CH, JFG, WCN, RHM, and TF participated in the conception and design of the study, participated in the interpretation of data, and revised article critically for important intellectual content. All authors read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here:

Additional file 1

Supplementary Tables. The excel file includes three supplementary tables. Supplementary Table S1 includes regions showing associations at p < 0.0001 for the additive model GWAS meta-analysis and study-specific results. Supplementary Table S2 includes regions showing associations at p < 0.0001 for the dominant model GWAS meta-analysis and study-specific results. Supplementary Table S3 includes Regions showing associations at p < 0.0001 for the recessive model GWAS meta-analysis and study-specific results Click here for file
  74 in total

1.  Endoplasmic reticulum stress and the unfolded protein response in cellular models of Parkinson's disease.

Authors:  Elizabeth J Ryu; Heather P Harding; James M Angelastro; Ottavio V Vitolo; David Ron; Lloyd A Greene
Journal:  J Neurosci       Date:  2002-12-15       Impact factor: 6.167

2.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies.

Authors:  Paul I W de Bakker; Manuel A R Ferreira; Xiaoming Jia; Benjamin M Neale; Soumya Raychaudhuri; Benjamin F Voight
Journal:  Hum Mol Genet       Date:  2008-10-15       Impact factor: 6.150

3.  Reliability of reported age at onset for Parkinson's disease.

Authors:  Carson R Reider; Cheryl A Halter; Peter F Castelluccio; David Oakes; William C Nichols; Tatiana Foroud
Journal:  Mov Disord       Date:  2003-03       Impact factor: 10.338

4.  Heterozygosity for a mutation in the parkin gene leads to later onset Parkinson disease.

Authors:  T Foroud; S K Uniacke; L Liu; N Pankratz; A Rudolph; C Halter; C Shults; K Marder; P M Conneally; W C Nichols
Journal:  Neurology       Date:  2003-03-11       Impact factor: 9.910

5.  Cathepsin D is the main lysosomal enzyme involved in the degradation of alpha-synuclein and generation of its carboxy-terminally truncated species.

Authors:  Daniel Sevlever; Peizhou Jiang; Shu-Hui C Yen
Journal:  Biochemistry       Date:  2008-08-15       Impact factor: 3.162

6.  Alpha-synuclein and familial Parkinson's disease.

Authors:  Nathan Pankratz; William C Nichols; Veronika E Elsaesser; Michael W Pauciulo; Diane K Marek; Cheryl A Halter; Joanne Wojcieszek; Alice Rudolph; Ronald F Pfeiffer; Tatiana Foroud
Journal:  Mov Disord       Date:  2009-06-15       Impact factor: 10.338

7.  HFE gene mutations in a population of Italian Parkinson's disease patients.

Authors:  Giorgio Biasiotto; Stefano Goldwurm; Dario Finazzi; Sara Tunesi; Anna Zecchinelli; Francesca Sironi; Gianni Pezzoli; Paolo Arosio
Journal:  Parkinsonism Relat Disord       Date:  2008-03-05       Impact factor: 4.891

8.  Pigmentation genes link Parkinson's disease to melanoma, opening a window on both etiologies.

Authors:  Elena Herrero Hernández
Journal:  Med Hypotheses       Date:  2008-11-21       Impact factor: 1.538

9.  Genomewide association study for susceptibility genes contributing to familial Parkinson disease.

Authors:  Nathan Pankratz; Jemma B Wilk; Jeanne C Latourelle; Anita L DeStefano; Cheryl Halter; Elizabeth W Pugh; Kimberly F Doheny; James F Gusella; William C Nichols; Tatiana Foroud; Richard H Myers
Journal:  Hum Genet       Date:  2008-11-06       Impact factor: 4.132

10.  The Gly2019Ser mutation in LRRK2 is not fully penetrant in familial Parkinson's disease: the GenePD study.

Authors:  Jeanne C Latourelle; Mei Sun; Mark F Lew; Oksana Suchowersky; Christine Klein; Lawrence I Golbe; Margery H Mark; John H Growdon; G Frederick Wooten; Ray L Watts; Mark Guttman; Brad A Racette; Joel S Perlmutter; Anwar Ahmed; Holly A Shill; Carlos Singer; Stefano Goldwurm; Gianni Pezzoli; Michela Zini; Marie H Saint-Hilaire; Audrey E Hendricks; Sally Williamson; Michael W Nagle; Jemma B Wilk; Tiffany Massood; Karen W Huskey; Jason M Laramie; Anita L DeStefano; Kenneth B Baker; Ilia Itin; Irene Litvan; Garth Nicholson; Alastair Corbett; Martha Nance; Edward Drasby; Stuart Isaacson; David J Burn; Patrick F Chinnery; Peter P Pramstaller; Jomana Al-hinti; Anette T Moller; Karen Ostergaard; Scott J Sherman; Richard Roxburgh; Barry Snow; John T Slevin; Franca Cambi; James F Gusella; Richard H Myers
Journal:  BMC Med       Date:  2008-11-05       Impact factor: 8.775

View more
  51 in total

1.  AAK1 identified as an inhibitor of neuregulin-1/ErbB4-dependent neurotrophic factor signaling using integrative chemical genomics and proteomics.

Authors:  Letian Kuai; Shao-En Ong; Jon M Madison; Xiang Wang; Jeremy R Duvall; Timothy A Lewis; Catherine J Luce; Sean D Conner; David A Pearlman; John L Wood; Stuart L Schreiber; Steven A Carr; Edward M Scolnick; Stephen J Haggarty
Journal:  Chem Biol       Date:  2011-07-29

Review 2.  Advances in the discovery of genetic risk factors for complex forms of neurodegenerative disorders: contemporary approaches, success, challenges and prospects.

Authors:  Sumeet Kumar; Navneesh Yadav; Sanjay Pandey; B K Thelma
Journal:  J Genet       Date:  2018-07       Impact factor: 1.166

Review 3.  Insights into the Molecular Mechanisms of Alzheimer's and Parkinson's Diseases with Molecular Simulations: Understanding the Roles of Artificial and Pathological Missense Mutations in Intrinsically Disordered Proteins Related to Pathology.

Authors:  Orkid Coskuner-Weber; Vladimir N Uversky
Journal:  Int J Mol Sci       Date:  2018-01-24       Impact factor: 5.923

Review 4.  Multi-system disorders of glycosphingolipid and ganglioside metabolism.

Authors:  You-Hai Xu; Sonya Barnes; Ying Sun; Gregory A Grabowski
Journal:  J Lipid Res       Date:  2010-03-08       Impact factor: 5.922

5.  Genomewide linkage study of modifiers of LRRK2-related Parkinson's disease.

Authors:  Jeanne C Latourelle; Audrey E Hendricks; Nathan Pankratz; Jemma B Wilk; Cheryl Halter; William C Nichols; James F Gusella; Anita L Destefano; Richard H Myers; Tatiana Foroud
Journal:  Mov Disord       Date:  2011-06-09       Impact factor: 10.338

6.  An algorithm for direct causal learning of influences on patient outcomes.

Authors:  Chandramouli Rathnam; Sanghoon Lee; Xia Jiang
Journal:  Artif Intell Med       Date:  2016-11-05       Impact factor: 5.326

Review 7.  The genetics of Parkinson's disease: progress and therapeutic implications.

Authors:  Andrew B Singleton; Matthew J Farrer; Vincenzo Bonifati
Journal:  Mov Disord       Date:  2013-01       Impact factor: 10.338

Review 8.  Pathways to neurodegeneration: mechanistic insights from GWAS in Alzheimer's disease, Parkinson's disease, and related disorders.

Authors:  Vijay K Ramanan; Andrew J Saykin
Journal:  Am J Neurodegener Dis       Date:  2013-09-18

9.  Shrunken methodology to genome-wide SNPs selection and construction of SNPs networks.

Authors:  Yang Liu; Michael Ng
Journal:  BMC Syst Biol       Date:  2010-09-13

10.  The analysis of association between SNCA, HUSEYO and CSMD1 gene variants and Parkinson's disease in Iranian population.

Authors:  Neda Shahmohammadibeni; Simin Rahimi-Aliabadi; Javad Jamshidi; Babak Emamalizadeh; Hossein Ali Shahmohammadibeni; Alireza Zare Bidoki; Haleh Akhavan-Niaki; Hajar Eftekhari; Shokoufeh Abdollahi; Mahmoud Shekari Khaniani; Mahnaz Shahmohammadibeni; Atena Fazeli; Marzieh Motallebi; Shaghayegh Taghavi; Azadeh Ahmadifard; Amir Ehtesham Shafiei Zarneh; Monavvar Andarva; Tahereh Dadkhah; Ehteram Khademi; Elham Alehabib; Mahnoosh Rahimi; Abbas Tafakhori; Minoo Atakhorrami; Hossein Darvish
Journal:  Neurol Sci       Date:  2016-01-05       Impact factor: 3.307

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.