| Literature DB >> 25668194 |
Laura N D'Aoust1, Anna C Cummings1, Renee Laux2, Denise Fuzzell2, Laura Caywood3, Lori Reinhart-Mercer3, William K Scott3, Margaret A Pericak-Vance3, Jonathan L Haines4.
Abstract
Alzheimer disease (AD) is the most common cause of dementia. As with many complex diseases, the identified variants do not explain the total expected genetic risk that is based on heritability estimates for AD. Isolated founder populations, such as the Amish, are advantageous for genetic studies as they overcome heterogeneity limitations associated with complex population studies. We determined that Amish AD cases harbored a significantly higher burden of the known risk alleles compared to Amish cognitively normal controls, but a significantly lower burden when compared to cases from a dataset of unrelated individuals. Whole-exome sequencing of a selected subset of the overall study population was used as a screening tool to identify variants located in the regions of the genome that are most likely to contribute risk. By then genotyping the top candidate variants from the known AD genes and from linkage regions implicated previous studies in the full dataset, new associations could be confirmed. The most significant result (p = 0.0012) was for rs73938538, a synonymous variant in LAMA1 within the previously identified linkage peak on chromosome 18. However, this association is specific to the Amish and did not generalize when tested in a dataset of unrelated individuals. These results suggest that additional risk variation in the Amish remains to be identified and likely resides outside of the classical protein coding gene regions.Entities:
Mesh:
Year: 2015 PMID: 25668194 PMCID: PMC4323242 DOI: 10.1371/journal.pone.0118043
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flow Diagram of this Study.
Individuals were selected from the full Amish dataset for whole-exome sequencing. The variants identified from these data were used to screen three classes of variants, genes that are very near or contain GWAS hits or carry early-onset mutations, genes within four candidate linkage regions implicated by previous studies and genes that harbor variants that occur uniquely in cases or in controls. The top variants from these three classes were then genotyped in the full dataset and case-control association was performed.
Details of Risk Loci from Meta-Analysis Used to Calculate Total Genetic Risk Score.
| Marker | Chr | Position | Gene | Overall | Amish | Unrelated | Weights |
|---|---|---|---|---|---|---|---|
| OR | MAF | MAF | |||||
|
| 1 | 207692049 | CR1 | 1.18 | 0.24 | 0.18 | 0.052 |
|
| 2 | 127892810 | BIN1 | 1.22 | 0.45 | 0.4 | 0.062 |
|
| 2 | 234068476 | INPP5D | 1.08 | 0.45 | 0.5 | 0.024 |
|
| 5 | 88223420 | MEF2C | 0.93 | - | - | - |
|
| 6 | 32578530 | HLA-DRB5/ | 1.11 | 0.18 | 0.28 | 0.033 |
| HLA-DRB1 | |||||||
|
| 6 | 47487762 | CD2AP | 1.1 | - | - | - |
|
| 7 | 37841534 | NME8 | 0.93 | 0.29 | 0.35 | 0.023 |
|
| 7 | 100004446 | ZCWPW1 | 0.91 | 0.28 | 0.29 | 0.03 |
|
| 7 | 143110762 | EPHA1 | 0.9 | 0.27 | 0.32 | 0.033 |
|
| 8 | 27195121 | PTK2B | 1.1 | 0.32 | 0.35 | 0.03 |
|
| 8 | 27467686 | CLU | 0.86 | 0.36 | 0.41 | 0.047 |
|
| 11 | 47557871 | CELF1 | 1.08 | 0.35 | 0.31 | 0.024 |
|
| 11 | 59923508 | MS4A6A | 0.9 | - | - | - |
|
| 11 | 85867875 | PICALM | 0.87 | 0.45 | 0.35 | 0.044 |
|
| 11 | 121435587 | SORL1 | 0.77 | 0.05 | 0.04 | 0.082 |
|
| 14 | 53400629 | FERMT2 | 1.14 | 0.05 | 0.11 | 0.041 |
|
| 14 | 92926952 | SLC24A4/RIN3 | 0.91 | 0.2 | 0.22 | 0.03 |
|
| 18 | 29088958 | DSG2 | 0.73 | 0.01 | 0.01 | 0.099 |
|
| 19 | 1063443 | ABCA7 | 1.15 | - | - | - |
|
| 19 | 51727962 | CD33 | 0.94 | 0.29 | 0.3 | 0.019 |
|
| 20 | 55018260 | CASS4 | 0.88 | 0.1 | 0.08 | 0.04 |
|
| 19 | 19q13.2 | APOE | 2.5 | 0.14 | 0.26 | 0.287 |
Alleles, MAF, and overall OR are published values. Chr = chromosome. Pos = position in bp. MAF = minor allele frequency. OR = odds ratio. Adapted from Lambert, et al, 2013 [16]. Allele frequency was calculated using the 921 Amish samples and the 971 samples from the unrelated dataset that passed QC in the follow-up genotyping phase.
Demographics of Genetic Risk Score Samples.
| Dataset | Affection status | Female | Total | Average age of exam/onset |
|---|---|---|---|---|
| (standard deviation) | ||||
| Amish |
| 63% | 126 | 78 (7.75) |
|
| 58% | 503 | 79 (6.72) | |
| Unrelated |
| 63% | 473 | 74 (8) |
|
| 60% | 498 | 74 (8) |
Percent female, age of exam and onset averages and standard deviations were calculated for 629 Amish samples and the 971 samples from the unrelated dataset which passed QC for follow-up genotyping.
Demographics of Amish Exome Sequencing Samples.
| Affection status | Female | Total | Average age of exam/onset |
|---|---|---|---|
| (standard deviation) | |||
| LOAD case | 55% | 53 | 78 (6.92) |
| Cognitively normal | 62% | 65 | 76 (7.21) |
| Unclear or unknown | 41% | 44 | 78 (7.60) |
Percent female, age of exam and onset averages and standard deviations were calculated for the 162 samples which passed QC for whole-exome sequencing.
Summary of Variant QC from the Follow-up Genotyping.
| Sequence | GWAS hit | Complete | |
|---|---|---|---|
| Variant | Dataset | ||
|
| 56 | 21 | 77 |
|
| 5 | 2 | 7 |
|
| 2 | 0 | 2 |
|
| 0 | 2 | 2 |
|
| 1 | 0 | 1 |
|
| 48 | 17 | 65 |
Sequence variant = variant identified from whole-exome sequence data. GWAS hit = SNP implicated by two recent meta-analyses [16,18]. Complete dataset = all variants and markers genotyped in three Sequenom pools and two TaqMan assays.
Demographics of Amish Samples Used For Follow-up Genotyping.
| Affection status | Female | Total | Average age of exam/onset |
|---|---|---|---|
| (standard deviation) | |||
| LOAD case | 63% | 126 | 78 (7.75) |
| Cognitively normal | 58% | 503 | 79 (6.72) |
| Unclear or unknown | 49% | 292 | 80 (6.82) |
Percent female, age of exam and onset averages and standard deviations were calculated for the 921 samples which passed QC for follow-up genotyping.
Fig 2Distributions of Total Genetic Risk Scores.
Total genetic risk score averages and standard deviations were calculated for the 629 Amish LOAD cases and cognitively normal controls and the 971 LOAD cases and cognitively normal controls from the unrelated case-control dataset who passed QC for the follow-up genotyping phase. n = total number of individuals. μ = average total risk score for group.
Summary of variants identified that are within or very near known AD genes.
| Gene | Location | Variants | dbSNP | ESP | 1000G | Novel |
|---|---|---|---|---|---|---|
|
| 19p13.3 | 20 | 19 | 18 | 18 | 1 |
|
| 19q13.2 | 0 | 0 | 0 | 0 | 0 |
|
| 21q21.3 | 1 | 1 | 1 | 0 | 0 |
|
| 2q14 | 3 | 3 | 3 | 3 | 0 |
|
| 20q13.31 | 9 | 9 | 8 | 9 | 0 |
|
| 6p12 | 1 | 1 | 1 | 1 | 0 |
|
| 19q13.3 | 1 | 1 | 1 | 1 | 0 |
|
| 11p11 | 1 | 0 | 0 | 0 | 1 |
|
| 8p21-p12 | 2 | 2 | 2 | 2 | 0 |
|
| 1q32 | 9 | 9 | 9 | 9 | 0 |
|
| 18q12.1 | 8 | 7 | 7 | 5 | 1 |
|
| 7q34 | 3 | 3 | 3 | 3 | 0 |
|
| 14q22.1 | 5 | 4 | 4 | 4 | 1 |
|
| 6p21.3 | 0 | 0 | 0 | 0 | 0 |
|
| 2q37.1 | 2 | 2 | 2 | 2 | 0 |
|
| 5q14 | 1 | 1 | 1 | 1 | 0 |
|
| 11q12.2 | 45 | 42 | 41 | 35 | 1 |
|
| 7p14.1 | 0 | 0 | 0 | 0 | 0 |
|
| 11q14 | 2 | 2 | 2 | 2 | 0 |
|
| 14q24.3 | 1 | 1 | 1 | 1 | 0 |
|
| 1q31-q42 | 2 | 1 | 1 | 1 | 1 |
|
| 8p21.1 | 12 | 10 | 11 | 10 | 1 |
|
| 14q32.12 | 5 | 5 | 5 | 4 | 0 |
|
| 11q23.2-q24.2 | 15 | 14 | 14 | 14 | 1 |
|
| 6p21.1 | 1 | 1 | 1 | 1 | 0 |
|
| 7q22.1 | 6 | 5 | 4 | 4 | 1 |
Counts are displayed for the number of variants present in the human variation catalogs of dbSNP build 137 (dbSNP), ESP 6500 release (ESP), and 1000 Genomes April 2012 release (1000G). The number of novel variants identified in each implicated gene is also shown.
* Closest gene to GWAS hit.
Summary of variants identified within implicated linkage regions.
| Peak | Variants | dbSNP | ESP | 1000G | Novel |
|---|---|---|---|---|---|
|
| 282 | 277 | 273 | 276 | 4 |
|
| 54 | 54 | 54 | 54 | 0 |
|
| 158 | 157 | 156 | 156 | 1 |
|
| 63 | 62 | 62 | 62 | 0 |
Counts are displayed for the number of variants present in the human variation catalogs of dbSNP build 137 (dbSNP), ESP 6500 release (ESP), and 1000 Genomes April 2012 release (1000G). The number of novel variants identified in each implicated linkage region is also shown. Chr = chromosome. Mbp = megabase pair.
MQLS-corrected allele frequencies and case-control association p-values for the top sequencing variants in the sequencing dataset.
| Marker | Chr | Position | Case | Control | Overall | p value | Gene | Function |
|---|---|---|---|---|---|---|---|---|
| MAF | MAF | MAF | ||||||
|
| 10 | 102762544 | 0.0104 | 0.0019 | 0.0013 | 1.25E-06 |
| synonymous |
|
| 15 | 91503598 | 0.0104 | 0.0019 | 0.0013 | 1.25E-06 |
| synonymous |
|
| 3 | 128525253 | 0.0987 | 0.0259 | 0.0525 | 3.31E-06 |
| synonymous |
|
| 12 | 6601475 | 0.0156 | 0.0078 | 0.0072 | 4.68E-06 |
| missense |
|
| 5 | 176008380 | 0.0156 | 0.0078 | 0.0072 | 5.00E-06 |
| missense |
|
| 20 | 3785672 | 0.0414 | 0.0036 | 0.0108 | 9.28E-06 |
| UTR-3 |
|
| 6 | 137234733 | 0.0403 | 0.0171 | 0.0199 | 1.01E-05 |
| UTR-3 |
|
| 2 | 25141538 | 0.5938 | 0.4151 | 0.4619 | 1.28E-05 |
| missense |
|
| 6 | 136710582 | 0.0511 | 0.0203 | 0.0277 | 1.53E-05 |
| synonymous |
|
| 15 | 91496233 | 0.0278 | 0.005 | 0.0085 | 3.74E-05 |
| missense |
|
| 4 | 175158508 | 0.0651 | 0.0195 | 0.0318 | 4.67E-05 |
| UTR-3 |
|
| 20 | 3785297 | 0.041 | 0.0076 | 0.0129 | 4.89E-05 |
| synonymous |
|
| 10 | 105762909 | 0.0104 | 0 | 0.001 | 7.00E-05 |
| missense |
|
| 20 | 3687141 | 0.0894 | 0.0374 | 0.0455 | 7.94E-05 |
| stop-gained |
|
| 10 | 90673047 | 0 | 0.0434 | 0.0352 | 9.17E-05 |
| missense |
Chr = chromosome. MAF = minor allele frequency. Nucleotide position is based upon the UCSC hg19 human reference genome. Gene and Function annotated by SeattleSeq134.
MQLS-corrected allele frequencies and case-control association p-values for the top variants in the full dataset.
| Marker | Chr | Position | Case MAF | Control MAF | p value | Gene |
|---|---|---|---|---|---|---|
|
| 18 | 7008583 | 0.1513 | 0.0766 | 0.0012 |
|
|
| 11 | 47505996 | 0.0069 | 0.0001 | 0.0543 |
|
|
| 18 | 13116432 | 0.3063 | 0.3384 | 0.0550 |
|
|
| 18 | 12984144 | 0.3162 | 0.3423 | 0.0758 |
|
|
| 2 | 86371883 | 0.4026 | 0.4523 | 0.0775 |
|
|
| 3 | 170078232 | 0.1287 | 0.1635 | 0.0788 |
|
|
| 20 | 55033856 | 0.3946 | 0.3700 | 0.0920 |
|
Chr = chromosome. MAF = minor allele frequency. Nucleotide position is based upon the UCSC hg19 human reference genome. Gene annotated by SeattleSeq134.
* Variant in implicated linkage regions.
+ Variant in implicated AD gene.