Literature DB >> 30108311

Whole exome sequencing study identifies novel rare and common Alzheimer's-Associated variants involved in immune response and transcriptional regulation.

Joshua C Bis¹, Xueqiu Jian², Brian W Kunkle³, Yuning Chen⁴, Adam C Naj⁵, Myriam Fornage^2,6, Lindsay A Farrer^7,8,9,10,11, Kara L Hamilton-Nelson³, William S Bush¹², William J Salerno¹³, Daniel Lancour¹⁴, Yiyi Ma¹⁴, Alan E Renton¹⁵, Edoardo Marcora^15,16, John J Farrell¹⁴, Yi Zhao⁵, Liming Qu⁵, Shahzad Ahmad¹⁷, Najaf Amin¹⁸, Philippe Amouyel^18,19,20, Gary W Beecham³, Jennifer E Below²¹, Dominique Campion^22,23, Laura Cantwell⁵, Camille Charbonnier²², Jaeyoon Chung¹⁴, Paul K Crane¹, Carlos Cruchaga²⁴, L Adrienne Cupples^4,25, Jean-François Dartigues²⁶, Stéphanie Debette^26,27, Jean-François Deleuze²⁸, Lucinda Fulton²⁹, Stacey B Gabriel³⁰, Emmanuelle Genin³¹, Richard A Gibbs¹³, Alison Goate^15,16, Benjamin Grenier-Boley¹⁸, Namrata Gupta³⁰, Jonathan L Haines¹², Aki S Havulinna^32,33, Seppo Helisalmi³⁴, Mikko Hiltunen³⁵, Daniel P Howrigan^36,37, M Arfan Ikram¹⁷, Jaakko Kaprio³², Jan Konrad²⁴, Amanda Kuzma⁵, Eric S Lander³⁰, Mark Lathrop³⁸, Terho Lehtimäki³⁹, Honghuang Lin⁴⁰, Kari Mattila³⁹, Richard Mayeux⁴¹, Donna M Muzny¹³, Waleed Nasser¹³, Benjamin Neale^36,37, Kwangsik Nho⁴², Gaël Nicolas²², Devanshi Patel¹⁴, Margaret A Pericak-Vance³, Markus Perola^32,33,43, Bruce M Psaty^1,44,45,46, Olivier Quenez²², Farid Rajabli³, Richard Redon⁴⁷, Christiane Reitz⁴¹, Anne M Remes^34,48, Veikko Salomaa³³, Chloe Sarnowski⁴, Helena Schmidt⁴⁹, Michael Schmidt³, Reinhold Schmidt⁴⁹, Hilkka Soininen^34,50, Timothy A Thornton⁵¹, Giuseppe Tosto⁴¹, Christophe Tzourio²⁶, Sven J van der Lee¹⁷, Cornelia M van Duijn¹⁷, Otto Valladares⁵, Badri Vardarajan⁴¹, Li-San Wang⁵, Weixin Wang⁵, Ellen Wijsman^52,53, Richard K Wilson²⁹, Daniela Witten^51,53, Kim C Worley¹³, Xiaoling Zhang^4,14, Celine Bellenguez¹⁸, Jean-Charles Lambert¹⁸, Mitja I Kurki^32,36,37, Aarno Palotie^32,36,37, Mark Daly^30,32,37, Eric Boerwinkle^13,6, Kathryn L Lunetta⁴, Anita L Destefano^4,54, Josée Dupuis⁴, Eden R Martin³, Gerard D Schellenberg⁵, Sudha Seshadri^25,54,55.

Abstract

The Alzheimer's Disease Sequencing Project (ADSP) undertook whole exome sequencing in 5,740 late-onset Alzheimer disease (AD) cases and 5,096 cognitively normal controls primarily of European ancestry (EA), among whom 218 cases and 177 controls were Caribbean Hispanic (CH). An age-, sex- and APOE based risk score and family history were used to select cases most likely to harbor novel AD risk variants and controls least likely to develop AD by age 85 years. We tested ~1.5 million single nucleotide variants (SNVs) and 50,000 insertion-deletion polymorphisms (indels) for association to AD, using multiple models considering individual variants as well as gene-based tests aggregating rare, predicted functional, and loss of function variants. Sixteen single variants and 19 genes that met criteria for significant or suggestive associations after multiple-testing correction were evaluated for replication in four independent samples; three with whole exome sequencing (2,778 cases, 7,262 controls) and one with genome-wide genotyping imputed to the Haplotype Reference Consortium panel (9,343 cases, 11,527 controls). The top findings in the discovery sample were also followed-up in the ADSP whole-genome sequenced family-based dataset (197 members of 42 EA families and 501 members of 157 CH families). We identified novel and predicted functional genetic variants in genes previously associated with AD. We also detected associations in three novel genes: IGHG3 (p = 9.8 × 10-7), an immunoglobulin gene whose antibodies interact with β-amyloid, a long non-coding RNA AC099552.4 (p = 1.2 × 10-7), and a zinc-finger protein ZNF655 (gene-based p = 5.0 × 10-6). The latter two suggest an important role for transcriptional regulation in AD pathogenesis.

Entities: Chemical

Mesh：

Substances：

Year: 2018 PMID： 30108311 PMCID： PMC6375806 DOI： 10.1038/s41380-018-0112-7

Source DB: PubMed Journal: Mol Psychiatry ISSN： 1359-4184 Impact factor: 15.992

Introduction

Genomic studies have revealed that late-onset Alzheimer disease (LOAD) is highly polygenic, with as many as 30 susceptibility loci identified through large-scale meta-analysis of genome-wide association studies (GWAS), targeted exome genotyping array, and several early whole exome sequencing (WES) studies [1-12]. Although AD susceptibility is highly heritable (h = 0.58–0.79) [13], much of its genetic architecture is still unknown and few rare variants have been detected thus far [3, 6, 7, 14–19]. Discovery of rare variants in genomic studies, even those with large sample sizes and examining highly heritable diseases, remains challenging due to statistical power limitations in detecting all but the most strongly associated variants (odds ratio (OR) > 1.5) [20-23]. The protein coding regions of the genome, or exome, are the best characterized and most conserved portions of the genome and the source of most variants identified to date that are responsible for Mendelian diseases [24]; thus, the exome is a more attractive and less expensive target for identifying rare variants of large effect on disease than the non-protein coding portion of the genome. The Alzheimer’s Disease Sequencing Project (ADSP) was developed jointly by the National Institute on Aging (NIA) and National Human Genome Research Institute (NHGRI) in response to the National Alzheimer’s Project Act milestones (https://aspe.hhs.gov/national-alzheimers-project-act) to fight Alzheimer’s disease (AD) as an effort to analyze the genomes of well-characterized individuals with and without AD. To detect rare variants and genes associated with LOAD, we performed single-variant and gene-based analyses, including annotated loss-of-function analyses, on the ADSP Discovery Phase Case-Control WES dataset, and attempted to replicate associations in three independent WES datasets, a GWAS dataset containing single nucleotide variants (SNVs) that were imputed using the Haplotype Reference Consortium (HRC) [25] reference panel, and the ADSP family-based whole genome sequence dataset.

Methods

Sample selection and data preparation

Study participants were either European-American (EA) or Caribbean Hispanic (CH) ancestry and were sampled in two ways. To maximize contrast between cases and controls, and power to discover novel associations, the majority of participants were chosen using a risk score that included dosages of the APOE ε2/ε3/ε4 alleles, sex and either onset age (for cases) or age at last exam for controls (or pathology-based adjusted age at death for neuropathology control) [26]. All cases were at least 60 years old and met NINCDS-ADRDA criteria for possible, probable or definite AD based on clinical assessment, or had presence of AD (moderate or high likelihood) upon neuropathology examination. To maximize our ability to discover novel genetic associations, we chose cases whose AD risk score indicated that their disease was not well explained by age, sex, or dosages of the APOE ε2/ε3/ε4 alleles. Conversely, cognitively healthy controls were selected with the goal of identifying alleles associated with the increased risk of or protection from late-onset AD. At the time of last exam, all potential controls were at least 60 years old and were either judged to be cognitively normal or did not meet pathological criteria [27, 28] for AD following brain autopsy. Controls were selected for this study on the basis of the risk score indicating that they were the least likely to develop AD by age 85 years. Applying the risk score resulted in a sample that contained 2,220 AD cases (40%) and 752 controls (14%) who were ε4 heterozygotes and 161 AD cases (3%) and 17 controls (<1%) who were ε4 homozygotes. In addition, we sampled a set of “enriched” cases from families having at least three affected members for whom the diagnosis of AD was verified by direct examination or review of cognitive testing data and medical records. Cases from early-onset AD families or families with a known PSEN1, PSEN2, or APP mutation were excluded. Within each family, we selected only one AD case, typically the member with the lowest a priori AD risk (based on the risk score defined above), provided this person had sufficient genomic DNA. In addition, because 172 of the “enriched” cases described above were of CH ancestry, we also selected a set of 171 age- and sex-matched cognitively normal CH participants to serve as controls. Participant characteristics are shown in Table 1A.

Table 1

Participant Characteristics

A. Discovery Sample
		AD Cases (N = 5,740)				Cognitively Normal Controls (N = 5,096)
Ancestry	Sampling	N	Age (mean)	Sex (%F)	APOE E4 (%carrier)	N	Age (mean)	Sex (%F)	APOE E4 (%carrier)
EA	Case-Control	5,015	75.25	55.8%	40.6%	4,919	86.53	59.2%	14.4%
EA	Enriched	507	83.61	63.3%	66.9%	NA	NA	NA	NA
Hispanic	Case-Control	46	72.59	71.7%	43.5%	6	85.94	66.7%	16.7%
Hispanic	Enriched	172	75.45	61.6%	39.5%	171	73.46	60.8%	39.2%

Participant Characteristics

Procedures

Genotype calling and data processing

WES data were generated at the Broad Institute, the Baylor College of Medicine’s Human Genome Sequencing Center, and Washington University’s McDonnell Genome Institute. An effort was made to assign all samples from a study of origin to the same center and there was a relatively balanced number of cases and controls at each center. Genotypes for bi-allelic SNVs and insertion-deletion polymorphisms (indels) were called using ATLAS2 using version Ch37/hg19 of the reference genome. A coordinated effort was implemented for centralized variant calling and quality control (QC) efforts in order to create one batch of data for analysis. Although there were differences in allele frequencies across sequencing centers for some variants, it was difficult to determine whether these represented technical artifacts of different capture kits, variability in genetic background among cohorts assembled for this study, or chance differences that will often occur for infrequent or rare variants. QC steps and methods for evaluating cryptic relatedness, population substructure, differential missingness, and variant annotation are described in the Supplementary Materials.

Single-variant and gene-based association analyses

Statistical models & rationale for covariate adjustments

All models included adjustment for sequencing center and population substructure. Before conducting the primary analyses, we evaluated up to 20 PCs for association with AD status. Only ancestry-specific PCs that significantly associated with AD status (P<0.005) in at least one of the three adjustment models were included as covariates (EA subgroup: PC1, PC5, PC8, PC9, PC10, PC11, PC18; Hispanic analyses included PC1 and PC2). Because most participants for the discovery study were sampled to maximize differences in cases and controls based on age, sex, and APOE genotypes, we included only PCs and sequencing center in our base adjustment model (Model 0). We evaluated two other models that included several covariates in addition to those in the base model: Model 1 adjusted for sex and age at diagnosis or last follow-up; and Model 2 adjusted for APOE ε4 & ε2 dosages in addition to those included in Model 1. All analyses were performed separately by ancestry (EA and CH) using seqMeta (version 1.6) [29]; the primary analysis is an inverse variance-weighted meta-analysis of these two groups. Single variant tests were limited to variants with at least 10 copies of the minor allele across the total QCed sample (MAF~0.0005).

Gene-based association testing

Gene-based tests examine the aggregate effect of risk and protective variants within a region defined by gene annotations. We performed gene-based tests using SKAT-O, which optimally combines SKAT and burden tests [30]. For these analyses, the SKAT portion of the test included variants with a MAF≤0.05; the burden component aggregated variants with MAF≤0.01. The SKAT test used ‘Wu weights’, defined by a beta density function with pre-specified parameters a1 = 1 and a2 = 25, evaluated at the sample minor allele frequency. The SKAT-O statistic, a linear combination between a SKAT statistic (Qskat) and a burden statistic (Qburden) equal to (1-ρ) Qskat + ρQburden, was optimized across 11 values of ρ (0.1 increments), and calculation of the significance took into consideration the multiple values of ρ evaluated. In order to improve power by removing variants predicted to have a low functional impact on the translated protein, we filtered variants in each gene on the basis of annotated function as described in the Supplementary Materials. We performed SKAT-O testing for genes with at least two qualifying variants contributing to the test. The minimum number of aggregated alleles (i.e., cumulative minor allele count or cMAC) for a gene-based test was set at 10.

Statistical significance thresholds for discovery stage analyses

Within each analysis framework including individual variants and gene-based aggregation of variants evaluated under particular functional annotation criteria, suggestive associations (p < 1/ # tests) were selected for follow-up testing in independent samples and a Bonferroni-corrected threshold was used to define experiment wide statistical significance (p < 0.05 / # tests). We did not correct for the three models and meta-analyses of the combined results of the EA + CH populations because the results were highly correlated across the covariate adjustment strategies (Supplementary Figure S2).

Replication sample and analyses

Primary replication analyses for the SNVs / genes that we identified to be genome-wide significant or suggestive in any model were conducted in three independent WES datasets including CHARGE (612 cases and 1,836 controls), ADES-FR (1,142 cases and 1,104 controls) [31], and FinnAD (1,024 cases and 4,322 controls), as well as in the Alzheimer’s Disease Genetics Consortium (ADGC) HRC-imputed GWAS dataset (Table 1B, Supplementary Materials, Supplementary Table S1). The ADGC dataset included GWAS data on 9,343 cases and 11,527 cognitively normal elders from 32 datasets for whom genotypes were imputed using the Haplotype Reference Consortium (HRC) r1.1 reference panel (Supplementary Table S2) [25, 32]. CHARGE and ADGC participants selected for ADSP discovery analyses were not included in the replication study. Because we included all available cases and controls in the replication datasets instead of applying the participant selection criteria used for the discovery sample to maximize difference in cases and controls, model 0 is not appropriate in replication studies. Hence, single variant tests and gene-based SKAT-O tests were performed using seqMeta for models 1 and 2 only. Meta-analysis of summarized results from the four samples was performed using seqMeta. We also performed a meta-analysis of results across the ADSP discovery and four replication cohorts for findings that were at least “suggestive” (p < 1 / genes or variants) in the discovery phase. In addition to models 1 and 2, we conducted a meta-analysis of results obtained using model 0 in the ADSP discovery data and model 1 in the replication cohorts to verify our findings in ADSP model 0. Because the ADSP discovery dataset includes CH participants and all replication cohorts consist of EA participants only, we performed the meta-analysis with and without CH participants in ADSP. We considered any variants or genes with two-stage meta-analysis p-values < 0.05 / # tests to be significant per the recommendation by Skol et al. that joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies [33]. We acknowledge, however, that additional replication in independent samples is required. The top findings in the discovery sample were also followed-up in the ADSP whole-genome sequenced (WGS) family-based dataset [34, 35]. This dataset includes 197 individuals sequenced in 42 EA families and 351 individuals in 67 CH families. Additional follow-up was also performed using WGS information from 150 members of another 48 CH families. Individual variants were assessed by examining their co-segregation with AD status within families. Gene-based association tests were performed using the FSKAT software [36].

Analysis of variants at previously established AD loci

To identify a set of variants related to AD risk in loci previously associated with AD, we compiled a list of genes containing variants with significant or suggestive associations (p < 1 × 10−3) in either the published IGAP or UKBB AD GWAS meta-analyses [9, 37]. Because many signal variants from GWAS are in intergenic regions, we used a combination of BEDOPS [38] and BEDTools [39] operations to enlarge the genomic coordinates of these associated variants by 50 Kb on each side, merging adjacent regions that were overlapping and/or book-ended. Of the resulting genomic regions, segments greater than 100 kb were retained, shortened by 50 kb on each side, merged if separated by 200 kb or less, and utilized to find overlapping protein-coding genes, with gene boundaries as defined in version 19 of the GENCODE gene set [40] and a 50 kb buffer on each side. These parameters and sequence of operations were chosen because they resulted in an algorithm that satisfactorily captured the genomic interval of the association landscape at each locus, as confirmed by visual inspection of LocusZoom regional plots [41]. We queried variant and gene level association statistics for the resulting list of 299 putatively associated AD genes.

Results

Description of study samples after QC and filtering

After exclusions, 10,836 participants were available for analysis (5,740 cases; 5,096 controls). This included 218 CH cases and 177 CH controls. The study included more women than men, and, due largely to the selection criteria, cases were younger on average than controls and were more likely to carry one copy of the APOE ε4 allele. In total, the data included 1,524,414 bi-allelic SNVs or short indels. Most variants were rare, with 1,493,926 (98%) of variants having minor allele frequency of less than 5% and 160,898 (11%) having a minor allele count (MAC) of at least 10 copies.

Single-variant SNV and short indels association analysis

We performed single variant analyses for the 160,898 variants with a combined minor allele count of at least 10 copies across all participants (Supplementary Figure S3). Genomic inflation was moderate (λ < 1.1 in all models) (Supplementary Figures S4–S7). Single variant association testing identified three variants at an exome-wide significance level (p < 3.1 × 10−7) and 14 variants at the suggestive threshold (p < 6.1 × 10−6) outside of the APOE region (Table 2, Fig. 1, Supplementary Table S3). The significant associations included the rare missense R47H variant in TREM2 (rs75932628, p = 4.8 × 10−12), a common variant in PILRA (rs2405442, p = 1.7 × 10−7), and a novel rare variant in the long non-coding RNA AC099552.4 (7:154,988,675, p = 1.2 × 10−7). These results were attenuated when including age, sex, and APOE ε2/ε3/ε4 allele as covariates.

Table 2

Associations with Individual Variants outside the APOE region

		ADSP Discovery Meta			All Replication (ADGC+CHARGE+ADES-FR+FinnAD)			Discovery + All Replication
Name	gene	MAC (EA/CH)	best P	Model (group)	MAC	P Model 1	P Model 2	P Model 0	P Model 1	P Model 2
6:41129252:C:T (R47H)	TREM2	120/0	4.8E-12	0 (EA)	224	1.6E-06	2.7E-06	3.2E-16	2.8E-10	1.6E-10
7:154988675:G:A	AC099552.4	10/0	1.2E-07	2 (EA)	0	NA	NA	1.3E-02	2.0E-07	1.2E-07
7:99971313:T:C (rs2405442)	PILRA	6,219/219	1.7E-07	0 (EA)	22,798	5.3E-05	2.3E-05	9.5E-10	1.1E-06	5.0E-07
20:62729814:C:T (rs148484121)	OPRL1	61/4	5.8E-07	1 (all)	111	3.4E-01	5.6E-01	3.7E-03	1.4E-04	4.5E-04
11:59940599:T:A (rs7232)	MS4A6A	7,540/258	7.7E-07	0 (all)	20,963	1.4E-11	3.1E-09	5.6E-17	3.8E-14	2.6E-11
17:44828931:G:A (rs199533)	NSF	4,238/135	1.3E-06	0 (all)	11,120	2.5E-01	1.4E-02	2.1E-04	1.6E-02	1.9E-04
14:106235767:C:T (rs77307099)	IGHG3	6,200/176	1.9E-06	0 (all)	721	4.0E-01	3.5E-01	1.4E-06	1.3E-04	7.9E-05
14:106235766:G:A (rs78376194)	IGHG3	6,202/176	1.9E-06	0 (all)	719	4.2E-01	3.6E-01	1.5E-06	1.4E-04	8.5E-05
6:15638035:C:T (rs77460377)	DTNBP1	16/3	1.9E-06	2 (all)	35	8.5E-01	8.7E-01	8.7E-02	5.2E-03	3.0E-03
6:33041297:G:A (rs112178281)	HLA-DPA1	10/0	2.9E-06	1 (EA)	6	7.5E-01	9.2E-01	1.4E-01	2.1E-05	2.0E-05
11:59945745:T:C (rs12453)	MS4A6A	8,265/258	3.2E-06	0 (EA)	23,420	4.7E-11	3.0E-08	1.4E-15	6.0E-13	1.2E-09
3:195506101:T:A	MUC4	38/6	3.8E-06	1 (all)	0	NA	NA	3.0E-04	3.8E-06	8.1E-06
10:88446985:T:C (rs76615432)	LDB3	760/62	5.0E-06	1 (CH)	2,303	5.3E-01	5.9E-01	7.0E-01	6.3E-01	6.4E-01
19:1047507:AGGAGCAG:A	ABCA7	67/0	4.3E-06	0 (EA)	11	8.8E-02	9.7E-02	2.4E-04	1.6E-02	1.7E-02
14:106236128:T:A (rs12890612)	IGHG3	6,395/369	4.5E-06	0 (all)	1,473	8.5E-02	7.5E-02	9.8E-07	8.0E-05	6.4E-05
7:99799845:T:A (rs104395)	STAG3	5,248/248	5.5E-06	0 (EA)	15,948	3.0E-03	1.2E-03	8.8E-07	1.2E-04	4.0E-05

Table shows variants with P < 6.1 × 10−6 in EA, CH, or combined strata in the discovery sample. Exome-wide significant results (P < 3.1 × 10−7) and suggestive results which improved in meta analysis of discovery + replication data are highlighted in bold. Results without variation data in the replication datasets are indicated in italics

Fig. 1

Manhattan plot showing genome-wide association results for individual common variants. The plot shows the p-values from the Discovery meta-analysis against their genomic position for association with AD. Only variants with a combined minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes containing the variant are indicated above points that surpassed our significance threshold for follow-up. The dotted line indicates the threshold for follow-up, p < 6.1 × 10−6, corresponding to (1 / #variants) tested. The dashed line indicates the threshold for exome-wide significance, p < 3.1 × 10−7, corresponding to (0.05 / #variants tested)

Associations with Individual Variants outside the APOE region Table shows variants with P < 6.1 × 10−6 in EA, CH, or combined strata in the discovery sample. Exome-wide significant results (P < 3.1 × 10−7) and suggestive results which improved in meta analysis of discovery + replication data are highlighted in bold. Results without variation data in the replication datasets are indicated in italics Manhattan plot showing genome-wide association results for individual common variants. The plot shows the p-values from the Discovery meta-analysis against their genomic position for association with AD. Only variants with a combined minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes containing the variant are indicated above points that surpassed our significance threshold for follow-up. The dotted line indicates the threshold for follow-up, p < 6.1 × 10−6, corresponding to (1 / #variants) tested. The dashed line indicates the threshold for exome-wide significance, p < 3.1 × 10−7, corresponding to (0.05 / #variants tested)

Gene-based association analysis combining SNVs and indels

We aggregated 918,053 variants with a combined MAF < 0.05 and annotated as high or moderate impact into gene-based tests using SKAT-O. This corresponds to 17,613 genes with more than one variant and a cumulative minor allele count (cMAC) of at least 10 copies. Applying more stringent filtering, we limited to variants annotated as high impact; aggregating 42,502 rare or uncommon (MAF < 0.05) variants into 4,634 genes (again, limiting to genes with >1 variant and a cMAC ≥ 10). For the purposes of identifying novel associations, we considered all genes or variants within 250kb of APOE as part of the APOE locus. Three known genes (ABCA7, TREM2 and CBLC in the APOE region) and two novel genes (OPRL1 and GAS2L2) achieved exome-wide statistical significance for their respective tests in the discovery analyses (Table 3, Fig. 2). Four additional genes (ZNF655, RHBDD1, SIRPB1, and RPS16) reached suggestive significance across the nine models (Fig. 2, Supplementary Table S4). Analyses filtered to include only variants with CADD scores ≥ 15 or ≥ 20 produced most of the same top-ranked results as the VEP gene-based results (Supplementary Table S5), noting that the overall VEP High/Moderate and CADD≥15 results, as well as the VEP High and CADD≥20 results, are only moderately correlated (Spearman rank correlation r = 0.51) (Supplementary Figure S8). Three novel genes (CACNB3, HHIP-AS1, and RP11-68L1.1) were exome-wide statistically significant in the CH group in analyses restricted to variants with CADD scores ≥ 20 (Supplementary Table S5), however these are likely false positives because in each instance the result is accounted for by a single variant that was observed in one person only.

Table 3

Gene-based Association Results

		ADSP Discovery Meta			All Replication			Discovery + All Replication
Gene	Variants b	SNVs	best P	Model	SNVs	P Model 1	P Model 2	SNVs	P Model 0	P Model 1	P Model 2
TREM2	High-Mod	50	1.8E-11	0	33	9.3E-10	5.4E-09	65	2.0E-17	3.8E-11	6.0E-11
CBLC ^a	High-Mod	44	1.1E-07	0	35	2.5E-20	6.7E-03	61	1.0E-27	6.1E-22	4.9E-02
OPRL1	High-Mod	42	2.6E-06	1	37	1.3E-01	3.0E-01	64	8.3E-03	5.4E-04	1.7E-03
CBX3	High-Mod	8	6.0E-05	0	10	1.3E-01	2.8E-01	17	4.9E-04	4.6E-02	6.1E-02
BCAM ^a	High-Mod	90	5.2E-04	1	88	4.7E-19	3.7E-03	144	3.5E-27	2.8E-20	4.8E-02
GAS2L2	High	7	3.9E-06	2	5	5.1E-02	6.7E-02	10	4.5E-01	3.9E-02	2.9E-02
ZNF655	High	9	2.8E-05	0	6	3.2E-02	3.4E-02	13	7.9E-06	8.4E-04	3.4E-04
RHBDD1	High	2	3.2E-05	2	4	8.8E-01	9.8E-01	5	3.5E-01	4.8E-01	2.7E-01
SIRPB1	High	6	8.0E-05	2	3	9.2E-01	7.9E-01	8	6.4E-01	3.0E-01	2.6E-01
RPS16	High	5	1.6E-04	2	2	7.4E-01	4.2E-01	5	4.4E-02	7.7E-03	6.5E-03
ABCA7	LoF	43	2.1E-06	0	16	1.5E-01	1.1E-01	51	1.2E-04	1.2E-03	3.4E-04
GAS2L2	LoF	7	3.9E-06	2	3	3.9E-02	4.8E-02	8	5.2E-01	4.3E-02	2.5E-02
ZNF655	LoF	8	1.9E-05	0	4	3.9E-02	3.0E-02	10	5.0E-06	4.6E-04	2.0E-04
RPS16	LoF	3	1.6E-04	2	2	7.4E-01	4.2E-01	3	4.1E-02	7.9E-03	6.4E-03

Table shows genes with P-value < 5.7 × 10−5 (High-Mod), 2.2 × 10−4 (High), or 2.8 × 10−4 (LoF) in the total discovery sample. Results surpassing discovery stage Bonferroni corrected significance thresholds -- P = 2.8 × 10−6 (High-Mod), 1.1 × 10−5 (High), and 1.4 × 10−5 (LoF) – are indicated in bold.

alocated in APOE region

btype of functional variants included in gene-based test

Fig. 2

Manhattan plots showing exome-wide association results for gene-based tests of rare functional variants. The plots show the gene-based p-values from the Discovery meta-analysis against their genomic position for association with AD. Each point represents a p-value from SKAT-O test aggregating rare variants (MAF < 5%), by gene, on the basis of predicted functional impact. Only genes with a cumulative minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes are indicated above points that surpassed our significance threshold for follow-up in tests aggregating only (a) moderate or high impact variants, (b) high impact variants; (c) loss-of-function variants. In each plot, the dotted line indicates the threshold for follow-up: (a) p<5.5 × 10−5, (b) p<6.3 × 10−5, (c) p<2.8 × 10−4, each corresponding to 1 / # genes tested. The dashed line indicates the threshold for exome-wide significance: (a) p<2.7 × 10−6, (b) p<3.1 × 10−6, (c) p < 1.4 × 10−5, each corresponding to 0.05 / # genes tested

Gene-based Association Results Table shows genes with P-value < 5.7 × 10−5 (High-Mod), 2.2 × 10−4 (High), or 2.8 × 10−4 (LoF) in the total discovery sample. Results surpassing discovery stage Bonferroni corrected significance thresholds -- P = 2.8 × 10−6 (High-Mod), 1.1 × 10−5 (High), and 1.4 × 10−5 (LoF) – are indicated in bold. alocated in APOE region btype of functional variants included in gene-based test Manhattan plots showing exome-wide association results for gene-based tests of rare functional variants. The plots show the gene-based p-values from the Discovery meta-analysis against their genomic position for association with AD. Each point represents a p-value from SKAT-O test aggregating rare variants (MAF < 5%), by gene, on the basis of predicted functional impact. Only genes with a cumulative minor allele count of ≥ 10 were included; the minimum p-value from the three adjustment models for either the meta-analysis, European Ancestry (EA), or Caribbean Hispanic (CH) is plotted for each variant. Genes are indicated above points that surpassed our significance threshold for follow-up in tests aggregating only (a) moderate or high impact variants, (b) high impact variants; (c) loss-of-function variants. In each plot, the dotted line indicates the threshold for follow-up: (a) p<5.5 × 10−5, (b) p<6.3 × 10−5, (c) p<2.8 × 10−4, each corresponding to 1 / # genes tested. The dashed line indicates the threshold for exome-wide significance: (a) p<2.7 × 10−6, (b) p<3.1 × 10−6, (c) p < 1.4 × 10−5, each corresponding to 0.05 / # genes tested

Loss-of-function (LOF) association analysis

Among 78,529 unfiltered high impact variants, 72,694 were annotated as LoF and 68,121 were further deemed as high-confidence, most of which were frameshift and stop-gained (Fig. 3). As expected, over 90% of these high-confidence LoF variants were singletons (53,120, 78%), doubletons (6,579, 10%), or tripletons (2,222, 3%), and most of these were observed in European Americans only. Association analysis of 2,378 high-confidence LoF variants with MAC ≥ 10 with adjustment for sequencing center and PCs revealed one Bonferroni corrected significant p < 2.1 × 10−5) variant, a previously reported frameshift deletion in ABCA7 (Table 3) [42]. Gene-based analysis of 32,863 high-confidence LoF variants with MAF ≤ 5% mapping to 3,558 genes with at least two variants and cMAC ≥ 10 (Supplementary Table S4) also showed that ABCA7 with adjustment for sequencing center and PCs (Model 0) and GAS2L2 with adjustment for sequencing center, PCs, sex, age, and APOE genotype (Model 2) reached experiment-wide significance threshold p < 1.4 × 10−5).

Fig. 3

Distribution of high impact, LoF and high-confidence LoF variants grouped by predicted consequence

Replication analysis

Of the 16 single variants outside the APOE region tested in the replication samples (Table 2, Supplementary Table S3), the TREM2 R47H mutation and four variants in three other previously known genes (one missense and one synonymous variant in MS4A6A, a synonymous variant in PILRA, and a missense variant in CR1) were significantly associated with AD in the combined discovery and replication analysis (Table 2). Associations with two variants in a novel gene STAG3 (rs1043915, p = 5.5 × 10−6) were also replicated and significantly associated with AD. We were unable to assess replication with the novel AC099552.4 variant because it was not observed or imputed in the replication datasets. One of the IGHG3 variants (rs12890621) showed borderline evidence for association in models 1 and 2 (p = 0.085 and 0.075, respectively), and evidence for association was strengthened to near exome-wide significance (p = 9.8 × 10−7) in the combined discovery and replication sample. In total, 19 genes across the nine models were significantly or suggestively associated with AD and were tested in the replication stage (Supplementary Table S4). Gene-based tests including high or moderate impact variants showed evidence for replication and reached genome-wide significance in the combined discovery and replication analysis for three genes:TREM2 and two genes in the APOE region (CBLC and BCAM) (Table 3, Supplementary Table S6). The association with GAS2L2, the potential novel gene identified in a model 2 SKAT-O test with high impact SNVs in the discovery sample, was slightly above the nominal significance level (p = 0.051 and p = 0.067, respectively, in models 1 and 2) in meta-analysis across four replication cohorts. However, this association was only nominally significant in the meta-analysis combining the discovery and replication cohorts (p = 0.029 for model 2). In gene-based tests including only high impact SNVs, the known AD risk gene ABCA7 and the potential novel gene ZNF655 reached the nominal p-value of 0.05 in meta-analysis with replication cohorts as well as in a meta-analysis of discovery and replication samples (Table 3, Supplementary Table S6). These two genes were also nominally significant in SKAT-O tests, limited to high impact variants (ZNF655: p = 7.9 × 10−6; ABCA7: p = 6.2 × 10−5) and LOF variants (ZNF655: p = 5.0 × 10−6). Because PILRA, a previously established AD gene [43] is proximate to STAG3 (159 kb) and ZNF655 (797 kb), we performed conditional analysis in the discovery sample to determine whether these novel association signals are independent. These analyses demonstrated that the association with multiple rare variants ZNF655 in the gene-based test is distinct from those with common variants in PILRA (p = 1.08 × 10−4) and STAG3 (p = 7.75 × 10−5). In a model containing both PILRA and STAG3 variants the association with PILRA remains significant (p = 0.011) but the association with STAG3 does not (p = 0.21) (Supplementary Table S7).

Follow-up in ADSP family-based data

We also followed-up the significant and suggestive single-variant and gene-based results from the discovery stage in the ADSP whole-genome sequenced (WGS) family-based dataset. A rare missense variant (rs61756195, MAF = 0.001) in STAG3 segregated with disease in three CH families and trended toward association in the case-control study (p = 0.052) (Supplementary Table S8). Gene-based testing identified a nominal association for GAS2L2 (p = 0.049) in EA families (Supplementary Table S9).

Rare variants in established genes from GWAS

We interrogated our individual variant and gene-based aggregate association tests for 299 previously associated AD genes. Among the SNVs and indels, a total of 1,172 variants with MAF < 0.05 and annotated as either HIGH or MODERATE impact are located within 253 interrogated AD genes (Supplementary Table S10). Five of these variants were at least suggestively significant (p<8.9 × 10−4) in single variant testing. The most significant associations included the TREM2 R47H missense mutation (p = 4.8 × 10−12) and ABCA7 frameshift mutation E709fs (p = 4.3 × 10−6) which was previously associated with AD in Belgian families [40]. Additional notable signals included variants in SORL1 A528T (p = 8.7 × 10−5), which was previously associated with AD in a CH population [15], and ACP2 D353E (p = 7.8 × 10−4). Perturbation of murine Acp2 causes lysosomal storage deficits, kyphoscoliosis, cerebellar abnormalities, and ataxia [44, 45]. For gene-based tests, we aggregated variants on the basis of annotated function, and examined only genes with more than one contributing variant and a cMAC ≥ 10. Of the 299 AD genes, tests were performed on 281 genes aggregating high or moderate impact variants and 86 genes limited to high impact variants. Among these, 13 unique genes surpassed suggestive significance thresholds for high (p < 1.16 × 10−2)orhigh-moderate (p = 3.56 × 10−3) impact variants. The strongest associations were observed for moderate impact variants in TREM2 (p = 4.81 × 10−12) and SORL1 (p = 8.68 × 10−5), and a high impact variant in ABCA7 (p = 4.33 × 10−6). Other noteworthy signals included moderate impact variants in NUP88 (p = 4.63 × 10−4) and ACP2 (p = 7.80 × 10−4).

Discussion

Our WES study, the largest for AD conducted to date, identified novel associations with variants in three genes not previously implicated in AD including one common nearly exome-wide significant variant each in IGHG3 (p = 9.8 × 10−7) and STAG3 (p = 8.8 × 10−7), and one rare exome-wide significant variant in AC099552.4 (p = 1.2 × 10−7). We also observed a gene-wide significant association with ZNF655 in a gene-based test including nine high-impact rare variants (p = 5.0 × 10−6). These results remained significant after multiple test correction and were confirmed in or strengthened by a replication sample comprised of four independent datasets, with the exception of the variant in AC099552.4 which was invariant in the replication samples. We also confirmed associations with common and rare variants in several previously established AD genes including ABCA7, APOE, HLA-DPA1, MS4A6A, PILRA, SORL1 and TREM2. ZNF655 is expressed in brain and encodes the Vav-interacting Krüppel-like factor 1 [46]. Krüppel-like factors (KLFs) are zinc finger-containing transcription factors that regulate diverse biological processes, including proliferation, differentiation, growth, development, survival, and responses to external stress [47]. Several KLFs have been shown to participate in neuronal morphogenesis and to control the regenerative capacity of neurons in the central nervous system. AC099552.4 is a long non-coding RNA, an abundant class of RNA sequences which regulate gene transcription and expression [48] and impact neuronal development, neuroplasticity, and cognition [49]. Non-coding RNA-dependent regulation affecting AD-related processes has been demonstrated for SORL1 [50] and in a triple transgenic model of AD [51]. IGHG3 encodes immunoglobulin heavy constant gamma 3 and is a member of the IgG family for which antibodies have been shown to cross-react with fibril and oligomer amyloid-β aggregates [52] leading to speculation that Immunoglobulin GM (γ marker) genes contain functional risk and protective factors for AD [53]. The anti-amyloidogenic activity of IgG appears to be an inherent property of free human IgG heavy chains [54]. Recent analysis of structural variants in whole genome sequence data for 578 members of 101 families with multiple AD subjects included in the ADSP [26] yielded additional evidence supporting IGHG3 as an AD risk locus. A total of nine distinct deletions in the IGH region were identified as disproportionately represented in AD cases compared to controls. One of these is a 188 bp deletion that was observed in 35 AD cases and 8 controls and is located 592 bp from the AD-associated SNV (rs12890621) in this study. This deletion eliminates a large portion of IGHG3 intron 2 and exon 3 (reference transcript ENST00000390551), and is predicted to have high impact on the encoded product. It is unlikely that the deletion and rs12890621 tag the same effector of AD risk because the deletion is rare, whereas rs12890621 is more common (MAF = 0.0475 in EA subjects according to the ExAC database). Of note, a nearby pseudogene in the IgG family, IGHV1-67, located approximately 350 kb from IGHG3, has been previously reported in a gene-wide association study conducted by the International Genetics of Alzheimer’s Project (IGAP) [1]. The association with the common synonymous variant in STAG3 (rs1043915, MAF = 0.26) is not independent of the finding with a common SNV in PILRA, a previously reported AD-associated gene [43] located in an established AD locus [9]. However, rare variants in STAG3 identified by WGS showed evidence of co-segregation with AD in CH families suggesting the possibility that STAG3 has a distinct mechanistic role in AD. STAG3, stromal antigen 3, encodes a subunit of the cohesin complex which regulates the cohesion of sister chromatids during cell division. Whether the association with AD observed here is mediated at least in part through STAG3 function or simply reflects linkage disequilibrium with other causal variants/genes in the region remains to be established. Rare coding STAG3 variants have been identified in primary ovarian insufficiency [55]. Although STAG3 is expressed in the brain, its role remains unclear. Interestingly, data from GTEx show that the associated variant is an eQTL for multiple genes in various brain tissues, including STAG3, AGFG2, GAL3ST4, GATS, and PVRIG. In a mouse model of diabetes, microvascular damage in the neurovascular unit of the retina was associated with alteration in STAG3 expression [56]. A variant in NSF showed nominally significant evidence of association in the replication sample (p = 0.014) in the model adjusting for age, sex, and APOE ε4 status, whereas the result in the discovery sample was observed in the model without these covariates. NSF encodes N-ethylmaleimide sensitive factor, vesicle fusing ATPase is involved in membrane trafficking of proteins and neurotransmitter release [57], and has been observed in brain homogenates of cases of familial neuronal intranuclear inclusion disease [58]. NSF SNVs have been associated with cocaine dependence [59] and its expression is reduced in prefrontal cortex in schizophrenia patients [60]. Vesicular trafficking has an important role in AD exemplified by genetic and biological evidence for neuronal sorting proteins including SORL1 [61-63]. We were unable to replicate variants at five loci that showed significant association in the discovery sample (p ≤ 5.0 × 10−6). Failure to replicate findings for the OPRL1 and DTNBP1 variants may be due to their lower MAF and, hence, uncertainty in the imputation quality and lack of imputed indels in the ADGC GWAS replication sample. Nonetheless, both of these genes are potentially attractive biological candidates. Opioid related nociceptin receptor 1 modulates a variety of biological functions and neurobehavior, including learning and memory, and inflammatory and immune functions [64, 65]. DTNBP1 encodes the dystrobrevin binding protein 1 which has been genetically linked to multiple psychiatric disorders, as well as cognitive and memory functions in healthy human subjects [66, 67]. Analysis of rare variants in the regions of genes previously identified as related to AD by GWAS revealed genome-wide significant or suggestive evidence of association in established genes including TREM2, SORL1, and ABCA7. In addition, notable associations were observed with other genes in these regions not previously linked to AD including TREML4, SPPL2A, and AP4M1 (Supplementary Table S10). TREML4 is located near TREM2 and encodes a TREM family receptor that, similar to TREM2, is expressed on the surface of myeloid cells and participates in the phagocytic clearance of dead cells [68]. SPPL2A encodes an endosomal-lysosomal protease and presenilin homolog that regulates B-cell homeostasis in vivo [69]. Homozygous mutations in AP4M1, located in the region including PILRA and STAG3, cause spastic tetraplegia, intellectual disability, and white matter loss [70]. Its encoded protein is a component of the AP-4 trafficking complex that regulates APP processing and beta-amyloid secretion in cell models [71]. Further studies are needed to conclude whether the association findings in this latter group of genes are robust and warrant experiments to determine their functional relevance to AD. Notably, there is little overlap of our results with findings of large GWAS focused on common variants [1, 2, 9]. This is due in part to our focus on only infrequent or rare variants (MAF < 0.05) that are functionally-annotated to be of at least moderate impact and may not have been well covered by GWAS arrays or imputation. With the notable exception of APOE, common variants associated with AD have very modest effect on risk (OR < 1.3) [9], and all but a few of these associations [4, 5, 8, 10, 12] required a sample between two and nearly seven times larger than the sample in this study to have sufficient power to detect them [1, 2, 9]. Our study has several notable strengths and limitations. The ascertainment scheme for this sample is optimal for detection of association with both risk and protective variants for AD [26]. Specifically, the AD cases were selected to have relatively early onset (with a minimum age of 65) and a lower frequency of the APOE ε4 allele with the expectation that they were likely to be more enriched for rare high-penetrant AD risk-variants compared to most late-onset AD cases. Controls were selected to be as old as possible with preference given to those having at least one APOE ε4 allele to enrich this group for protective variants. However, this scheme introduced confounding between age and AD status which reduced power for detecting associations. To overcome this limitation, we included a model without age adjustment which yielded the largest number of new association findings including several that were replicated in independent datasets which were analyzed with age adjustment. Thus, it was important to include models which did or did not include a covariate for age in order to account for confounding with AD status as well as age-dependent effects of the genetic factor. Despite simulations showing that this sample had sufficient power to detect associations with variants whose frequencies were as low as 0.005 and an effect size greater than 1.8 [26], the number of novel rare variant findings were few. We also acknowledge that p-value thresholds did not account for the number of models tested, however the models are highly correlated (Supplementary Table S2). The inclusion of CH participants who were a pivotal portion of a multi-ethnic sample leading to the discovery of common variant associations in other AD loci, most notably SORL1 [62], but for rare variant discovery these samples may have reduced power by increasing genetic heterogeneity of the total sample. This conclusion is consistent with observations of few novel findings in this WES study showing discernable contributions by the CH dataset and by discovery of novel rare variant associations in a whole genome sequence study that were unique to EA and CH families, respectively [34, 35]. Nonetheless, the non-Hispanic portion of our sample was sufficiently large to detect multiple novel associations. Our findings suggest that additional large and ancestrally diverse cohorts with deep sequence data will need to be examined for replication and to provide a larger discovery sample. Successful replication of only some of the most significant findings in novel genes not in the APOE region (4/8 individual variants in Table 2, 1/18 genes in Table 3) is somewhat concerning but highlights the difficulty of designing well-powered replication studies of sequencing findings. Although it is possible that some of these findings are false positives, we acknowledge that the size of the WES replication samples combined (2,778 AD cases, 7,262 controls) was inadequate. In addition, many rare variants were not well-imputed or, in the case of most indels, not imputed at all in the ADGC GWAS dataset, despite the use of the HRC reference panel which contains haplotypes derived from whole genome sequence data for more than 30,000 individuals who were not ascertained for AD research. Thus, additional large WES samples will need to be studied to obtain definitive evidence about findings that did not replicate. In summary, our significant association findings with functional rare variants in novel genes provide further support for the roles of neuroinflammation (IGHG3) and transcriptional regulation (AC099552.4 and ZNF655) in AD. In addition, we identified many novel associations with rare functional variants in previously established AD genes. In most cases, these rare variants do not explain association signals that were previously identified by GWAS with common and predominantly non-functional variants. Hence, many of our findings will provide insight into disease mechanisms and targets for biological experiments to gain further understanding about the role of these genes in AD pathogenesis. However, other deep sequencing approaches (e.g., whole genome, target gene resequencing) will be needed to identify variants which account for association signals in non-coding regions and the contribution of structural variants (e.g., larger insertions and deletions, copy number variants, etc.) to AD risk. Supplementary tables and figures supplementary tables

64 in total

1. A mutation in APP protects against Alzheimer's disease and age-related cognitive decline.

Authors: Thorlakur Jonsson; Jasvinder K Atwal; Stacy Steinberg; Jon Snaedal; Palmi V Jonsson; Sigurbjorn Bjornsson; Hreinn Stefansson; Patrick Sulem; Daniel Gudbjartsson; Janice Maloney; Kwame Hoyte; Amy Gustafson; Yichin Liu; Yanmei Lu; Tushar Bhangale; Robert R Graham; Johanna Huttenlocher; Gyda Bjornsdottir; Ole A Andreassen; Erik G Jönsson; Aarno Palotie; Timothy W Behrens; Olafur T Magnusson; Augustine Kong; Unnur Thorsteinsdottir; Ryan J Watts; Kari Stefansson
Journal: Nature Date: 2012-08-02 Impact factor: 49.962

2. Group B streptococcal colonization among hospital personnel.

Authors: E Concia; P Marone; L Marino; A Riccardi; E Sciarra
Journal: Boll Ist Sieroter Milan Date: 1985

3. Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease.

Authors: Jean-Charles Lambert; Simon Heath; Gael Even; Dominique Campion; Kristel Sleegers; Mikko Hiltunen; Onofre Combarros; Diana Zelenika; Maria J Bullido; Béatrice Tavernier; Luc Letenneur; Karolien Bettens; Claudine Berr; Florence Pasquier; Nathalie Fiévet; Pascale Barberger-Gateau; Sebastiaan Engelborghs; Peter De Deyn; Ignacio Mateo; Ana Franck; Seppo Helisalmi; Elisa Porcellini; Olivier Hanon; Marian M de Pancorbo; Corinne Lendon; Carole Dufouil; Céline Jaillard; Thierry Leveillard; Victoria Alvarez; Paolo Bosco; Michelangelo Mancuso; Francesco Panza; Benedetta Nacmias; Paola Bossù; Paola Piccardi; Giorgio Annoni; Davide Seripa; Daniela Galimberti; Didier Hannequin; Federico Licastro; Hilkka Soininen; Karen Ritchie; Hélène Blanché; Jean-François Dartigues; Christophe Tzourio; Ivo Gut; Christine Van Broeckhoven; Annick Alpérovitch; Mark Lathrop; Philippe Amouyel
Journal: Nat Genet Date: 2009-09-06 Impact factor: 38.330

4. Catabolic instability, plasmid gene deletion and recombination in Alcaligenes sp. BR60.

Authors: R C Wyndham; R K Singh; N A Straus
Journal: Arch Microbiol Date: 1988 Impact factor: 2.552

5. Variant of TREM2 associated with the risk of Alzheimer's disease.

Authors: Thorlakur Jonsson; Hreinn Stefansson; Stacy Steinberg; Ingileif Jonsdottir; Palmi V Jonsson; Jon Snaedal; Sigurbjorn Bjornsson; Johanna Huttenlocher; Allan I Levey; James J Lah; Dan Rujescu; Harald Hampel; Ina Giegling; Ole A Andreassen; Knut Engedal; Ingun Ulstein; Srdjan Djurovic; Carla Ibrahim-Verbaas; Albert Hofman; M Arfan Ikram; Cornelia M van Duijn; Unnur Thorsteinsdottir; Augustine Kong; Kari Stefansson
Journal: N Engl J Med Date: 2012-11-14 Impact factor: 91.245

6. Common polygenic variation enhances risk prediction for Alzheimer's disease.

Authors: Valentina Escott-Price; Rebecca Sims; Christian Bannister; Denise Harold; Maria Vronskaya; Elisa Majounie; Nandini Badarinarayan; Kevin Morgan; Peter Passmore; Clive Holmes; John Powell; Carol Brayne; Michael Gill; Simon Mead; Alison Goate; Carlos Cruchaga; Jean-Charles Lambert; Cornelia van Duijn; Wolfgang Maier; Alfredo Ramirez; Peter Holmans; Lesley Jones; John Hardy; Sudha Seshadri; Gerard D Schellenberg; Philippe Amouyel; Julie Williams
Journal: Brain Date: 2015-10-21 Impact factor: 13.501

7. Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer's disease.

Authors: Adam C Naj; Gyungah Jun; Gary W Beecham; Li-San Wang; Badri Narayan Vardarajan; Jacqueline Buros; Paul J Gallins; Joseph D Buxbaum; Gail P Jarvik; Paul K Crane; Eric B Larson; Thomas D Bird; Bradley F Boeve; Neill R Graff-Radford; Philip L De Jager; Denis Evans; Julie A Schneider; Minerva M Carrasquillo; Nilufer Ertekin-Taner; Steven G Younkin; Carlos Cruchaga; John S K Kauwe; Petra Nowotny; Patricia Kramer; John Hardy; Matthew J Huentelman; Amanda J Myers; Michael M Barmada; F Yesim Demirci; Clinton T Baldwin; Robert C Green; Ekaterina Rogaeva; Peter St George-Hyslop; Steven E Arnold; Robert Barber; Thomas Beach; Eileen H Bigio; James D Bowen; Adam Boxer; James R Burke; Nigel J Cairns; Chris S Carlson; Regina M Carney; Steven L Carroll; Helena C Chui; David G Clark; Jason Corneveaux; Carl W Cotman; Jeffrey L Cummings; Charles DeCarli; Steven T DeKosky; Ramon Diaz-Arrastia; Malcolm Dick; Dennis W Dickson; William G Ellis; Kelley M Faber; Kenneth B Fallon; Martin R Farlow; Steven Ferris; Matthew P Frosch; Douglas R Galasko; Mary Ganguli; Marla Gearing; Daniel H Geschwind; Bernardino Ghetti; John R Gilbert; Sid Gilman; Bruno Giordani; Jonathan D Glass; John H Growdon; Ronald L Hamilton; Lindy E Harrell; Elizabeth Head; Lawrence S Honig; Christine M Hulette; Bradley T Hyman; Gregory A Jicha; Lee-Way Jin; Nancy Johnson; Jason Karlawish; Anna Karydas; Jeffrey A Kaye; Ronald Kim; Edward H Koo; Neil W Kowall; James J Lah; Allan I Levey; Andrew P Lieberman; Oscar L Lopez; Wendy J Mack; Daniel C Marson; Frank Martiniuk; Deborah C Mash; Eliezer Masliah; Wayne C McCormick; Susan M McCurry; Andrew N McDavid; Ann C McKee; Marsel Mesulam; Bruce L Miller; Carol A Miller; Joshua W Miller; Joseph E Parisi; Daniel P Perl; Elaine Peskind; Ronald C Petersen; Wayne W Poon; Joseph F Quinn; Ruchita A Rajbhandary; Murray Raskind; Barry Reisberg; John M Ringman; Erik D Roberson; Roger N Rosenberg; Mary Sano; Lon S Schneider; William Seeley; Michael L Shelanski; Michael A Slifer; Charles D Smith; Joshua A Sonnen; Salvatore Spina; Robert A Stern; Rudolph E Tanzi; John Q Trojanowski; Juan C Troncoso; Vivianna M Van Deerlin; Harry V Vinters; Jean Paul Vonsattel; Sandra Weintraub; Kathleen A Welsh-Bohmer; Jennifer Williamson; Randall L Woltjer; Laura B Cantwell; Beth A Dombroski; Duane Beekly; Kathryn L Lunetta; Eden R Martin; M Ilyas Kamboh; Andrew J Saykin; Eric M Reiman; David A Bennett; John C Morris; Thomas J Montine; Alison M Goate; Deborah Blacker; Debby W Tsuang; Hakon Hakonarson; Walter A Kukull; Tatiana M Foroud; Jonathan L Haines; Richard Mayeux; Margaret A Pericak-Vance; Lindsay A Farrer; Gerard D Schellenberg
Journal: Nat Genet Date: 2011-04-03 Impact factor: 38.330

8. Common variants at ABCA7, MS4A6A/MS4A4E, EPHA1, CD33 and CD2AP are associated with Alzheimer's disease.

Authors: Paul Hollingworth; Denise Harold; Rebecca Sims; Amy Gerrish; Jean-Charles Lambert; Minerva M Carrasquillo; Richard Abraham; Marian L Hamshere; Jaspreet Singh Pahwa; Valentina Moskvina; Kimberley Dowzell; Nicola Jones; Alexandra Stretton; Charlene Thomas; Alex Richards; Dobril Ivanov; Caroline Widdowson; Jade Chapman; Simon Lovestone; John Powell; Petroula Proitsi; Michelle K Lupton; Carol Brayne; David C Rubinsztein; Michael Gill; Brian Lawlor; Aoibhinn Lynch; Kristelle S Brown; Peter A Passmore; David Craig; Bernadette McGuinness; Stephen Todd; Clive Holmes; David Mann; A David Smith; Helen Beaumont; Donald Warden; Gordon Wilcock; Seth Love; Patrick G Kehoe; Nigel M Hooper; Emma R L C Vardy; John Hardy; Simon Mead; Nick C Fox; Martin Rossor; John Collinge; Wolfgang Maier; Frank Jessen; Eckart Rüther; Britta Schürmann; Reiner Heun; Heike Kölsch; Hendrik van den Bussche; Isabella Heuser; Johannes Kornhuber; Jens Wiltfang; Martin Dichgans; Lutz Frölich; Harald Hampel; John Gallacher; Michael Hüll; Dan Rujescu; Ina Giegling; Alison M Goate; John S K Kauwe; Carlos Cruchaga; Petra Nowotny; John C Morris; Kevin Mayo; Kristel Sleegers; Karolien Bettens; Sebastiaan Engelborghs; Peter P De Deyn; Christine Van Broeckhoven; Gill Livingston; Nicholas J Bass; Hugh Gurling; Andrew McQuillin; Rhian Gwilliam; Panagiotis Deloukas; Ammar Al-Chalabi; Christopher E Shaw; Magda Tsolaki; Andrew B Singleton; Rita Guerreiro; Thomas W Mühleisen; Markus M Nöthen; Susanne Moebus; Karl-Heinz Jöckel; Norman Klopp; H-Erich Wichmann; V Shane Pankratz; Sigrid B Sando; Jan O Aasly; Maria Barcikowska; Zbigniew K Wszolek; Dennis W Dickson; Neill R Graff-Radford; Ronald C Petersen; Cornelia M van Duijn; Monique M B Breteler; M Arfan Ikram; Anita L DeStefano; Annette L Fitzpatrick; Oscar Lopez; Lenore J Launer; Sudha Seshadri; Claudine Berr; Dominique Campion; Jacques Epelbaum; Jean-François Dartigues; Christophe Tzourio; Annick Alpérovitch; Mark Lathrop; Thomas M Feulner; Patricia Friedrich; Caterina Riehle; Michael Krawczak; Stefan Schreiber; Manuel Mayhaus; S Nicolhaus; Stefan Wagenpfeil; Stacy Steinberg; Hreinn Stefansson; Kari Stefansson; Jon Snaedal; Sigurbjörn Björnsson; Palmi V Jonsson; Vincent Chouraki; Benjamin Genier-Boley; Mikko Hiltunen; Hilkka Soininen; Onofre Combarros; Diana Zelenika; Marc Delepine; Maria J Bullido; Florence Pasquier; Ignacio Mateo; Ana Frank-Garcia; Elisa Porcellini; Olivier Hanon; Eliecer Coto; Victoria Alvarez; Paolo Bosco; Gabriele Siciliano; Michelangelo Mancuso; Francesco Panza; Vincenzo Solfrizzi; Benedetta Nacmias; Sandro Sorbi; Paola Bossù; Paola Piccardi; Beatrice Arosio; Giorgio Annoni; Davide Seripa; Alberto Pilotto; Elio Scarpini; Daniela Galimberti; Alexis Brice; Didier Hannequin; Federico Licastro; Lesley Jones; Peter A Holmans; Thorlakur Jonsson; Matthias Riemenschneider; Kevin Morgan; Steven G Younkin; Michael J Owen; Michael O'Donovan; Philippe Amouyel; Julie Williams
Journal: Nat Genet Date: 2011-04-03 Impact factor: 38.330

9. Gene-wide analysis detects two new susceptibility genes for Alzheimer's disease.

Authors: Valentina Escott-Price; Céline Bellenguez; Li-San Wang; Seung-Hoan Choi; Denise Harold; Lesley Jones; Peter Holmans; Amy Gerrish; Alexey Vedernikov; Alexander Richards; Anita L DeStefano; Jean-Charles Lambert; Carla A Ibrahim-Verbaas; Adam C Naj; Rebecca Sims; Gyungah Jun; Joshua C Bis; Gary W Beecham; Benjamin Grenier-Boley; Giancarlo Russo; Tricia A Thornton-Wells; Nicola Denning; Albert V Smith; Vincent Chouraki; Charlene Thomas; M Arfan Ikram; Diana Zelenika; Badri N Vardarajan; Yoichiro Kamatani; Chiao-Feng Lin; Helena Schmidt; Brian Kunkle; Melanie L Dunstan; Maria Vronskaya; Andrew D Johnson; Agustin Ruiz; Marie-Thérèse Bihoreau; Christiane Reitz; Florence Pasquier; Paul Hollingworth; Olivier Hanon; Annette L Fitzpatrick; Joseph D Buxbaum; Dominique Campion; Paul K Crane; Clinton Baldwin; Tim Becker; Vilmundur Gudnason; Carlos Cruchaga; David Craig; Najaf Amin; Claudine Berr; Oscar L Lopez; Philip L De Jager; Vincent Deramecourt; Janet A Johnston; Denis Evans; Simon Lovestone; Luc Letenneur; Isabel Hernández; David C Rubinsztein; Gudny Eiriksdottir; Kristel Sleegers; Alison M Goate; Nathalie Fiévet; Matthew J Huentelman; Michael Gill; Kristelle Brown; M Ilyas Kamboh; Lina Keller; Pascale Barberger-Gateau; Bernadette McGuinness; Eric B Larson; Amanda J Myers; Carole Dufouil; Stephen Todd; David Wallon; Seth Love; Ekaterina Rogaeva; John Gallacher; Peter St George-Hyslop; Jordi Clarimon; Alberto Lleo; Anthony Bayer; Debby W Tsuang; Lei Yu; Magda Tsolaki; Paola Bossù; Gianfranco Spalletta; Petra Proitsi; John Collinge; Sandro Sorbi; Florentino Sanchez Garcia; Nick C Fox; John Hardy; Maria Candida Deniz Naranjo; Paolo Bosco; Robert Clarke; Carol Brayne; Daniela Galimberti; Elio Scarpini; Ubaldo Bonuccelli; Michelangelo Mancuso; Gabriele Siciliano; Susanne Moebus; Patrizia Mecocci; Maria Del Zompo; Wolfgang Maier; Harald Hampel; Alberto Pilotto; Ana Frank-García; Francesco Panza; Vincenzo Solfrizzi; Paolo Caffarra; Benedetta Nacmias; William Perry; Manuel Mayhaus; Lars Lannfelt; Hakon Hakonarson; Sabrina Pichler; Minerva M Carrasquillo; Martin Ingelsson; Duane Beekly; Victoria Alvarez; Fanggeng Zou; Otto Valladares; Steven G Younkin; Eliecer Coto; Kara L Hamilton-Nelson; Wei Gu; Cristina Razquin; Pau Pastor; Ignacio Mateo; Michael J Owen; Kelley M Faber; Palmi V Jonsson; Onofre Combarros; Michael C O'Donovan; Laura B Cantwell; Hilkka Soininen; Deborah Blacker; Simon Mead; Thomas H Mosley; David A Bennett; Tamara B Harris; Laura Fratiglioni; Clive Holmes; Renee F A G de Bruijn; Peter Passmore; Thomas J Montine; Karolien Bettens; Jerome I Rotter; Alexis Brice; Kevin Morgan; Tatiana M Foroud; Walter A Kukull; Didier Hannequin; John F Powell; Michael A Nalls; Karen Ritchie; Kathryn L Lunetta; John S K Kauwe; Eric Boerwinkle; Matthias Riemenschneider; Mercè Boada; Mikko Hiltunen; Eden R Martin; Reinhold Schmidt; Dan Rujescu; Jean-François Dartigues; Richard Mayeux; Christophe Tzourio; Albert Hofman; Markus M Nöthen; Caroline Graff; Bruce M Psaty; Jonathan L Haines; Mark Lathrop; Margaret A Pericak-Vance; Lenore J Launer; Christine Van Broeckhoven; Lindsay A Farrer; Cornelia M van Duijn; Alfredo Ramirez; Sudha Seshadri; Gerard D Schellenberg; Philippe Amouyel; Julie Williams
Journal: PLoS One Date: 2014-06-12 Impact factor: 3.240

10. TREM2 variants in Alzheimer's disease.

Authors: Rita Guerreiro; Aleksandra Wojtas; Jose Bras; Minerva Carrasquillo; Ekaterina Rogaeva; Elisa Majounie; Carlos Cruchaga; Celeste Sassi; John S K Kauwe; Steven Younkin; Lilinaz Hazrati; John Collinge; Jennifer Pocock; Tammaryn Lashley; Julie Williams; Jean-Charles Lambert; Philippe Amouyel; Alison Goate; Rosa Rademakers; Kevin Morgan; John Powell; Peter St George-Hyslop; Andrew Singleton; John Hardy
Journal: N Engl J Med Date: 2012-11-14 Impact factor: 91.245

82 in total

1. Network analysis in aged C. elegans reveals candidate regulatory genes of ageing.

Authors: Foteini Aktypi; Nikoletta Papaevgeniou; Konstantinos Voutetakis; Aristotelis Chatziioannou; Tilman Grune; Niki Chondrogianni
Journal: Biogerontology Date: 2021-04-19 Impact factor: 4.277

Review 2. Interpretation of risk loci from genome-wide association studies of Alzheimer's disease.

Authors: Shea J Andrews; Brian Fulton-Howard; Alison Goate
Journal: Lancet Neurol Date: 2020-01-24 Impact factor: 44.182

3. Significant abundance of cis configurations of coding variants in diploid human genomes.

Authors: Margret R Hoehe; Ralf Herwig; Qing Mao; Brock A Peters; Radoje Drmanac; George M Church; Thomas Huebsch
Journal: Nucleic Acids Res Date: 2019-04-08 Impact factor: 16.971

4. Identification of Alzheimer's disease-associated rare coding variants in the ECE2 gene.

Authors: Xinxin Liao; Fang Cai; Zhanfang Sun; Yun Zhang; Juelu Wang; Bin Jiao; Jifeng Guo; Jinchen Li; Xixi Liu; Lina Guo; Yafang Zhou; Junling Wang; Xinxiang Yan; Hong Jiang; Kun Xia; Jiada Li; Beisha Tang; Lu Shen; Weihong Song
Journal: JCI Insight Date: 2020-02-27

5. Is there an antagonistic pleiotropic effect of a LRRK2 mutation on leprosy and Parkinson's disease?

Authors: Deng-Feng Zhang; Dong Wang; Yu-Ye Li; Yong-Gang Yao
Journal: Proc Natl Acad Sci U S A Date: 2020-04-28 Impact factor: 11.205

Review 6. Functional genomics links genetic origins to pathophysiology in neurodegenerative and neuropsychiatric disease.

Authors: Brie Wamsley; Daniel H Geschwind
Journal: Curr Opin Genet Dev Date: 2020-07-04 Impact factor: 5.578

7. Emerging links between pediatric lysosomal storage diseases and adult parkinsonism.

Authors: Daniel Ysselstein; Joshua M Shulman; Dimitri Krainc
Journal: Mov Disord Date: 2019-02-06 Impact factor: 10.338

8. Non-coding and Loss-of-Function Coding Variants in TET2 are Associated with Multiple Neurodegenerative Diseases.

Authors: J Nicholas Cochran; Ethan G Geier; Luke W Bonham; J Scott Newberry; Michelle D Amaral; Michelle L Thompson; Brittany N Lasseigne; Anna M Karydas; Erik D Roberson; Gregory M Cooper; Gil D Rabinovici; Bruce L Miller; Richard M Myers; Jennifer S Yokoyama
Journal: Am J Hum Genet Date: 2020-04-23 Impact factor: 11.025

9. Whole-Exome Sequencing Analysis of Alzheimer's Disease in Non-APOE*4 Carriers.

Authors: Kang-Hsien Fan; Eleanor Feingold; Samantha L Rosenthal; F Yesim Demirci; Mary Ganguli; Oscar L Lopez; M Ilyas Kamboh
Journal: J Alzheimers Dis Date: 2020 Impact factor: 4.472

10. Examination of the Effect of Rare Variants in TREM2, ABI3, and PLCG2 in LOAD Through Multiple Phenotypes.

Authors: Claudia Olive; Laura Ibanez; Fabiana H Geraldo Farias; Fengxian Wang; John P Budde; Joanne B Norton; Jen Gentsch; John C Morris; Zeran Li; Umber Dube; Jorge Del-Aguila; Kristy Bergmann; Joseph Bradley; Bruno A Benitez; Oscar Harari; Anne Fagan; Beau Ances; Carlos Cruchaga; Maria Victoria Fernandez
Journal: J Alzheimers Dis Date: 2020 Impact factor: 4.472