| Literature DB >> 30108311 |
Joshua C Bis1, Xueqiu Jian2, Brian W Kunkle3, Yuning Chen4, Adam C Naj5, Myriam Fornage2,6, Lindsay A Farrer7,8,9,10,11, Kara L Hamilton-Nelson3, William S Bush12, William J Salerno13, Daniel Lancour14, Yiyi Ma14, Alan E Renton15, Edoardo Marcora15,16, John J Farrell14, Yi Zhao5, Liming Qu5, Shahzad Ahmad17, Najaf Amin18, Philippe Amouyel18,19,20, Gary W Beecham3, Jennifer E Below21, Dominique Campion22,23, Laura Cantwell5, Camille Charbonnier22, Jaeyoon Chung14, Paul K Crane1, Carlos Cruchaga24, L Adrienne Cupples4,25, Jean-François Dartigues26, Stéphanie Debette26,27, Jean-François Deleuze28, Lucinda Fulton29, Stacey B Gabriel30, Emmanuelle Genin31, Richard A Gibbs13, Alison Goate15,16, Benjamin Grenier-Boley18, Namrata Gupta30, Jonathan L Haines12, Aki S Havulinna32,33, Seppo Helisalmi34, Mikko Hiltunen35, Daniel P Howrigan36,37, M Arfan Ikram17, Jaakko Kaprio32, Jan Konrad24, Amanda Kuzma5, Eric S Lander30, Mark Lathrop38, Terho Lehtimäki39, Honghuang Lin40, Kari Mattila39, Richard Mayeux41, Donna M Muzny13, Waleed Nasser13, Benjamin Neale36,37, Kwangsik Nho42, Gaël Nicolas22, Devanshi Patel14, Margaret A Pericak-Vance3, Markus Perola32,33,43, Bruce M Psaty1,44,45,46, Olivier Quenez22, Farid Rajabli3, Richard Redon47, Christiane Reitz41, Anne M Remes34,48, Veikko Salomaa33, Chloe Sarnowski4, Helena Schmidt49, Michael Schmidt3, Reinhold Schmidt49, Hilkka Soininen34,50, Timothy A Thornton51, Giuseppe Tosto41, Christophe Tzourio26, Sven J van der Lee17, Cornelia M van Duijn17, Otto Valladares5, Badri Vardarajan41, Li-San Wang5, Weixin Wang5, Ellen Wijsman52,53, Richard K Wilson29, Daniela Witten51,53, Kim C Worley13, Xiaoling Zhang4,14, Celine Bellenguez18, Jean-Charles Lambert18, Mitja I Kurki32,36,37, Aarno Palotie32,36,37, Mark Daly30,32,37, Eric Boerwinkle13,6, Kathryn L Lunetta4, Anita L Destefano4,54, Josée Dupuis4, Eden R Martin3, Gerard D Schellenberg5, Sudha Seshadri25,54,55.
Abstract
The Alzheimer's Disease Sequencing Project (ADSP) undertook whole exome sequencing in 5,740 late-onset Alzheimer disease (AD) cases and 5,096 cognitively normal controls primarily of European ancestry (EA), among whom 218 cases and 177 controls were Caribbean Hispanic (CH). An age-, sex- and APOE based risk score and family history were used to select cases most likely to harbor novel AD risk variants and controls least likely to develop AD by age 85 years. We tested ~1.5 million single nucleotide variants (SNVs) and 50,000 insertion-deletion polymorphisms (indels) for association to AD, using multiple models considering individual variants as well as gene-based tests aggregating rare, predicted functional, and loss of function variants. Sixteen single variants and 19 genes that met criteria for significant or suggestive associations after multiple-testing correction were evaluated for replication in four independent samples; three with whole exome sequencing (2,778 cases, 7,262 controls) and one with genome-wide genotyping imputed to the Haplotype Reference Consortium panel (9,343 cases, 11,527 controls). The top findings in the discovery sample were also followed-up in the ADSP whole-genome sequenced family-based dataset (197 members of 42 EA families and 501 members of 157 CH families). We identified novel and predicted functional genetic variants in genes previously associated with AD. We also detected associations in three novel genes: IGHG3 (p = 9.8 × 10-7), an immunoglobulin gene whose antibodies interact with β-amyloid, a long non-coding RNA AC099552.4 (p = 1.2 × 10-7), and a zinc-finger protein ZNF655 (gene-based p = 5.0 × 10-6). The latter two suggest an important role for transcriptional regulation in AD pathogenesis.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30108311 PMCID: PMC6375806 DOI: 10.1038/s41380-018-0112-7
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Participant Characteristics
| A. Discovery Sample | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| AD Cases (N = 5,740) | Cognitively Normal Controls (N = 5,096) | ||||||||
| Ancestry | Sampling | N | Age (mean) | Sex (%F) | APOE E4 (%carrier) | N | Age (mean) | Sex (%F) | APOE E4 (%carrier) |
| EA | Case-Control | 5,015 | 75.25 | 55.8% | 40.6% | 4,919 | 86.53 | 59.2% | 14.4% |
| EA | Enriched | 507 | 83.61 | 63.3% | 66.9% | NA | NA | NA | NA |
| Hispanic | Case-Control | 46 | 72.59 | 71.7% | 43.5% | 6 | 85.94 | 66.7% | 16.7% |
| Hispanic | Enriched | 172 | 75.45 | 61.6% | 39.5% | 171 | 73.46 | 60.8% | 39.2% |
Associations with Individual Variants outside the APOE region
| ADSP Discovery Meta | All Replication (ADGC+CHARGE+ADES-FR+FinnAD) | Discovery + All Replication | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Name | gene | MAC (EA/CH) | best P | Model (group) | MAC | P Model 1 | P Model 2 | P Model 0 | P Model 1 | P Model 2 |
| 6:41129252:C:T (R47H) | TREM2 | 120/0 | 0 (EA) | 224 | ||||||
| 7:154988675:G:A | AC099552.4 | 10/0 | 2 (EA) | 0 | NA | NA | ||||
| 7:99971313:T:C (rs2405442) | PILRA | 6,219/219 | 0 (EA) | 22,798 | 5.3E-05 | 2.3E-05 | 1.1E-06 | 5.0E-07 | ||
| 20:62729814:C:T (rs148484121) | OPRL1 | 61/4 | 5.8E-07 | 1 (all) | 111 | 3.4E-01 | 5.6E-01 | 3.7E-03 | 1.4E-04 | 4.5E-04 |
| 11:59940599:T:A (rs7232) | MS4A6A | 7,540/258 | 7.7E-07 | 0 (all) | 20,963 | |||||
| 17:44828931:G:A (rs199533) | NSF | 4,238/135 | 1.3E-06 | 0 (all) | 11,120 | 2.5E-01 | 1.4E-02 | 2.1E-04 | 1.6E-02 | 1.9E-04 |
| 14:106235767:C:T (rs77307099) | IGHG3 | 6,200/176 | 1.9E-06 | 0 (all) | 721 | 4.0E-01 | 3.5E-01 | 1.3E-04 | 7.9E-05 | |
| 14:106235766:G:A (rs78376194) | IGHG3 | 6,202/176 | 1.9E-06 | 0 (all) | 719 | 4.2E-01 | 3.6E-01 | 1.4E-04 | 8.5E-05 | |
| 6:15638035:C:T (rs77460377) | DTNBP1 | 16/3 | 1.9E-06 | 2 (all) | 35 | 8.5E-01 | 8.7E-01 | 8.7E-02 | 5.2E-03 | 3.0E-03 |
| 6:33041297:G:A (rs112178281) | HLA-DPA1 | 10/0 | 2.9E-06 | 1 (EA) | 6 | 7.5E-01 | 9.2E-01 | 1.4E-01 | 2.1E-05 | 2.0E-05 |
| 11:59945745:T:C (rs12453) | MS4A6A | 8,265/258 | 3.2E-06 | 0 (EA) | 23,420 | |||||
| 3:195506101:T:A | MUC4 | 38/6 | 3.8E-06 | 1 (all) | 0 | NA | NA | |||
| 10:88446985:T:C (rs76615432) | LDB3 | 760/62 | 5.0E-06 | 1 (CH) | 2,303 | 5.3E-01 | 5.9E-01 | 7.0E-01 | 6.3E-01 | 6.4E-01 |
| 19:1047507:AGGAGCAG:A | ABCA7 | 67/0 | 4.3E-06 | 0 (EA) | 11 | 8.8E-02 | 9.7E-02 | 2.4E-04 | 1.6E-02 | 1.7E-02 |
| 14:106236128:T:A (rs12890612) | IGHG3 | 6,395/369 | 4.5E-06 | 0 (all) | 1,473 | 8.5E-02 | 7.5E-02 | 8.0E-05 | 6.4E-05 | |
| 7:99799845:T:A (rs104395) | STAG3 | 5,248/248 | 5.5E-06 | 0 (EA) | 15,948 | 3.0E-03 | 1.2E-03 | 1.2E-04 | 4.0E-05 | |
Table shows variants with P < 6.1 × 10−6 in EA, CH, or combined strata in the discovery sample. Exome-wide significant results (P < 3.1 × 10−7) and suggestive results which improved in meta analysis of discovery + replication data are highlighted in bold. Results without variation data in the replication datasets are indicated in italics
Fig. 1Manhattan plot showing genome-wide association results for individual common variants. The plot shows the p-values from the Discovery meta-analysis against their genomic position for association with AD. Only variants with a combined minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes containing the variant are indicated above points that surpassed our significance threshold for follow-up. The dotted line indicates the threshold for follow-up, p < 6.1 × 10−6, corresponding to (1 / #variants) tested. The dashed line indicates the threshold for exome-wide significance, p < 3.1 × 10−7, corresponding to (0.05 / #variants tested)
Gene-based Association Results
| ADSP Discovery Meta | All Replication | Discovery + All Replication | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Variants b | SNVs | best P | Model | SNVs | P Model 1 | P Model 2 | SNVs | P Model 0 | P Model 1 | P Model 2 |
| TREM2 | High-Mod | 50 | 0 | 33 | 65 | ||||||
| CBLC a | High-Mod | 44 | 0 | 35 | 61 | 4.9E-02 | |||||
| OPRL1 | High-Mod | 42 | 1 | 37 | 1.3E-01 | 3.0E-01 | 64 | 8.3E-03 | 5.4E-04 | 1.7E-03 | |
| CBX3 | High-Mod | 8 | 6.0E-05 | 0 | 10 | 1.3E-01 | 2.8E-01 | 17 | 4.9E-04 | 4.6E-02 | 6.1E-02 |
| BCAM a | High-Mod | 90 | 5.2E-04 | 1 | 88 | 3.7E-03 | 144 | 4.8E-02 | |||
| GAS2L2 | High | 7 | 2 | 5 | 5.1E-02 | 6.7E-02 | 10 | 4.5E-01 | 3.9E-02 | 2.9E-02 | |
| ZNF655 | High | 9 | 2.8E-05 | 0 | 6 | 3.2E-02 | 3.4E-02 | 13 | 8.4E-04 | 3.4E-04 | |
| RHBDD1 | High | 2 | 3.2E-05 | 2 | 4 | 8.8E-01 | 9.8E-01 | 5 | 3.5E-01 | 4.8E-01 | 2.7E-01 |
| SIRPB1 | High | 6 | 8.0E-05 | 2 | 3 | 9.2E-01 | 7.9E-01 | 8 | 6.4E-01 | 3.0E-01 | 2.6E-01 |
| RPS16 | High | 5 | 1.6E-04 | 2 | 2 | 7.4E-01 | 4.2E-01 | 5 | 4.4E-02 | 7.7E-03 | 6.5E-03 |
| ABCA7 | LoF | 43 | 0 | 16 | 1.5E-01 | 1.1E-01 | 51 | 1.2E-04 | 1.2E-03 | 3.4E-04 | |
| GAS2L2 | LoF | 7 | 2 | 3 | 3.9E-02 | 4.8E-02 | 8 | 5.2E-01 | 4.3E-02 | 2.5E-02 | |
| ZNF655 | LoF | 8 | 1.9E-05 | 0 | 4 | 3.9E-02 | 3.0E-02 | 10 | 4.6E-04 | 2.0E-04 | |
| RPS16 | LoF | 3 | 1.6E-04 | 2 | 2 | 7.4E-01 | 4.2E-01 | 3 | 4.1E-02 | 7.9E-03 | 6.4E-03 |
Table shows genes with P-value < 5.7 × 10−5 (High-Mod), 2.2 × 10−4 (High), or 2.8 × 10−4 (LoF) in the total discovery sample. Results surpassing discovery stage Bonferroni corrected significance thresholds -- P = 2.8 × 10−6 (High-Mod), 1.1 × 10−5 (High), and 1.4 × 10−5 (LoF) – are indicated in bold.
alocated in APOE region
btype of functional variants included in gene-based test
Fig. 2Manhattan plots showing exome-wide association results for gene-based tests of rare functional variants. The plots show the gene-based p-values from the Discovery meta-analysis against their genomic position for association with AD. Each point represents a p-value from SKAT-O test aggregating rare variants (MAF < 5%), by gene, on the basis of predicted functional impact. Only genes with a cumulative minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes are indicated above points that surpassed our significance threshold for follow-up in tests aggregating only (a) moderate or high impact variants, (b) high impact variants; (c) loss-of-function variants. In each plot, the dotted line indicates the threshold for follow-up: (a) p<5.5 × 10−5, (b) p<6.3 × 10−5, (c) p<2.8 × 10−4, each corresponding to 1 / # genes tested. The dashed line indicates the threshold for exome-wide significance: (a) p<2.7 × 10−6, (b) p<3.1 × 10−6, (c) p < 1.4 × 10−5, each corresponding to 0.05 / # genes tested
Fig. 3Distribution of high impact, LoF and high-confidence LoF variants grouped by predicted consequence