Literature DB >> 30636644

Genome-wide analysis of genetic predisposition to Alzheimer's disease and related sex disparities.

Alireza Nazarian1, Anatoliy I Yashin2, Alexander M Kulminski3.   

Abstract

BACKGROUND: Alzheimer's disease (AD) is the most common cause of dementia in the elderly and the sixth leading cause of death in the United States. AD is mainly considered a complex disorder with polygenic inheritance. Despite discovering many susceptibility loci, a major proportion of AD genetic variance remains to be explained.
METHODS: We investigated the genetic architecture of AD in four publicly available independent datasets through genome-wide association, transcriptome-wide association, and gene-based and pathway-based analyses. To explore differences in the genetic basis of AD between males and females, analyses were performed on three samples in each dataset: males and females combined, only males, or only females.
RESULTS: Our genome-wide association analyses corroborated the associations of several previously detected AD loci and revealed novel significant associations of 35 single-nucleotide polymorphisms (SNPs) outside the chromosome 19q13 region at the suggestive significance level of p < 5E-06. These SNPs were mapped to 21 genes in 19 chromosomal regions. Of these, 17 genes were not associated with AD at genome-wide or suggestive levels of associations by previous genome-wide association studies. Also, the chromosomal regions corresponding to 8 genes did not contain any previously detected AD-associated SNPs with p < 5E-06. Our transcriptome-wide association and gene-based analyses revealed that 26 genes located in 20 chromosomal regions outside chromosome 19q13 had evidence of potential associations with AD at a false discovery rate of 0.05. Of these, 13 genes/regions did not contain any previously AD-associated SNPs at genome-wide or suggestive levels of associations. Most of the newly detected AD-associated SNPs and genes were sex specific, indicating sex disparities in the genetic basis of AD. Also, 7 of 26 pathways that showed evidence of associations with AD in our pathway-bases analyses were significant only in females.
CONCLUSIONS: Our findings, particularly the newly discovered sex-specific genetic contributors, provide novel insight into the genetic architecture of AD and can advance our understanding of its pathogenesis.

Entities:  

Keywords:  Alzheimer’s disease; Gene-based analysis; Genome-wide association study; Meta-analysis; Sex disparities

Mesh:

Year:  2019        PMID: 30636644      PMCID: PMC6330399          DOI: 10.1186/s13195-018-0458-8

Source DB:  PubMed          Journal:  Alzheimers Res Ther            Impact factor:   6.982


Background

Alzheimer’s disease (AD) is a slowly progressive neurodegenerative disorder that usually manifests with insidious deterioration of cognitive functions such as memory, language, judgment, and reasoning. Visuospatial deficits and neuropsychiatric symptoms like anxiety, irritability, depression, delusion, and personality changes may occur in the course of the disease, and these are eventually followed by impairment of most daily activities [1, 2]. The median survival is 3.3–11.7 years after disease manifestation [3]. Except for some uncommon autosomal dominant forms, AD is mainly a complex disorder with a polygenic nature [2, 4] that predominantly affects elderly individuals, also known as late-onset AD. It is the most common cause of dementia in the elderly worldwide [5] and is the sixth leading cause of death in the United States [6]. Age is the main risk factor for AD. The annual incidence increases from 1% at age 65 years to 6–8% after 85 years [7], and its prevalence increases from 11% to 32% [5]. In addition, AD is more prevalent in females than males [7-10], with their lifetime risk of developing the disease being almost twice that of males [7]. This might be to some extent justified by different life expectancies of males and females. However, Genin et al. [11] suggested that the age-adjusted penetrance of Apolipoprotein E (APOE) was sex dependent as well. For instance, they found that the lifetime risks for homozygote APOE-ε4 carriers were 51% and 60% in males and females older than 85 years, respectively. The corresponding risks for heterozygote APOE-ε3ε4 carriers were 23% and 30%, respectively [11]. AD is also more severe in females than males [9]. Henderson and Buckwalter [12] reported that female AD patients had greater impairment of naming task, verbal fluency, and delayed recall compared to male patients. In another study, Barnes et al. [13] suggested that females were more likely to develop clinical AD compared to males in response to pathology changes (e.g., amyloid beta (Aβ) and neurofibrillary tangles) in the brain. They found that each additional unit of pathology in the brain would increase the odds of overt AD by 20-fold and 3-fold in females and males, respectively [13]. The underlying mechanisms of sex disparity in AD are not fully clear [9, 14]. This may raise the possibility that such sex disparities might be in part due to potential differences in the genetic bases of AD between males and females. Investigating such differences is important, particularly for tailoring more effective medical interventions [14, 15]. Give the considerable physical, emotional, and economic burdens imposed by AD on patients, their families, and societies, exploring the genetic and nongenetic mechanisms underlying its pathogenesis has become a public health priority. With increased life expectancy, the prevalence and global economic costs of AD are forecast to increase considerably by 2050 [5]. Many studies have investigated the genetic basis of AD. APOE was the first gene linked to late-onset AD [16], and, in particular, the dosage of its ε4 allele was implicated in increasing the risks of disease and earlier onset [17]. More susceptibility loci were detected with the advent of genome-wide association (GWA) methodology, although not all of them were consistently replicated in independent datasets. In addition to APOE, which was almost universally replicated, BIN1, CLU, CR1, CD2AP, CD33, MS4A4E, MS4A6A, EPHA1, and PICALM genes have been associated with the polygenic form of AD in different studies [18, 19]. The narrow-sense heritability (h2) of AD (i.e., the proportion of its phenotypic variance explained by additive genetic variance) has been estimated to be 58–79% by twin studies [20]. Furthermore, Ridge et al. [19], using a linear mixed models (LMMs) framework, found that 53% of phenotypic variance of AD can be explained by ~ 8 million single-nucleotide polymorphisms (SNPs). They also noticed that SNPs inside known AD-associated genes or within their 50 kb upstream/downstream regions can only explain ~ 31% of AD phenotypic variance (~ 59% of genetic variance) [19], leaving a sizable portion of its h2 to be explained. In this study, we investigated the genetic architecture of polygenic AD through genome-wide association (GWA), transcriptome-wide association (TWA), gene-based, and pathway-based analyses in four independent datasets (two with family designs and two with population designs) using genetic information for approximately 2 million genotyped and imputed SNPs. Since exploring the genetic sex disparity of AD was of particular interest, in addition to analyzing the entire sample of males and females in each dataset, two alternative plans were also considered in which either only males or only females were included in analyses.

Methods

Study participants

Four independent datasets were used to fulfill the aims of this study: Late-Onset Alzheimer’s Disease Family Study from the National Institute on Aging (NIA-LOADFS) [21]; Framingham SNP Health Association Resource (SHARe) project from Framingham Heart Study (FHS) [22-24]; SNP Typing for Association with Multiple Phenotypes from Existing Epidemiologic Data (STAMPEED) project from Cardiovascular Health Study (CHS) [25]; and University of Michigan Health and Retirement Study (HRS) [26]. All four datasets were approved by the institutional review boards (IRBs) and had gathered data after obtaining written informed consent from participants or their legal guardians/proxies. Details about the designs of the NIA-LOADFS, FHS, CHS, and HRS studies can be found in the original publications. Briefly, the NIA-LOADFS is a family-based study primarily initiated to investigate late-onset AD risk factors. It recruited families with multiple affected members if the age at AD onset or diagnosis of proband was above 60 years. Controls were selected from unaffected individuals with a minimum age of 50 years who had no history of major neurological/psychiatric disorders or life-threatening conditions. Of 9468 participants with phenotype data, 5220 subjects (2319 affected with AD), predominantly Caucasians, were genotyped using Illumina’s Human 610-Quad array. The FHS is an ongoing longitudinal study with a family-based design that provides phenotype and genotype information on individuals from three-generational families with Caucasian ancestry. The main objective of the study was to investigate cardiovascular disorder risk factors. It was first initiated by recruiting 5209 participants (i.e., original cohort) between ages 30 and 62 years with no history of cardiac disease or stroke. Later, the cohort was expanded by adding the offspring of the original cohort and their spouses (5124 subjects as the offspring cohort) and their grandchildren (4095 subjects as the third generation). Of these, 9274 individuals (1529, 3852, and 3893 individuals from the three aforementioned generations, respectively) were genotyped using the Affymetrix Human Mapping 500 K array in the SHARe project. The CHS is a population-based longitudinal study with the main objective of investigating risk factors contributing to heart diseases. It was initiated by recruiting an original cohort of 5221 mainly Caucasian participants who were older than 65 years and had not been institutionalized. Later, a new cohort of 687 participants, predominantly African-Americans, was added to the study. Of these, 3989 and 803 individuals were genotyped by Illumina’s Human CNV370-Duo and Human Omni1-Quad arrays, respectively, in the STAMPEED project. The HRS is a population-based longitudinal study launched to provide age-related health and economic information on more than 20,000 individuals older than 50 years. The HRS makes use of administrative records such as Social Security and Medicare claims to gather information of interest about participants. The study was expanded in 2006 to include a biomarker and genetic component in which 12,595 individuals, predominantly Caucasian, were genotyped by Illumina’s Human Omni2.5-Quad array. Our study focused on people of Caucasian ancestry from the four aforementioned studies to increase the sample size and power of the analyses. The LOADFS and FHS datasets directly identify cases with Alzheimer’s disease and unaffected controls. For the CHS and HRS datasets, the International Classification of Disease codes, ninth revision (ICD-9) were used to define cases and controls. Finally, to make the four datasets comparable in terms of participants age, we only included the original and offspring cohorts from the FHS dataset. Demographic information about the cohorts included in our study is presented in Table 1. Also, Additional file 1: Table S1 lists the numbers of cases and controls in these cohorts.
Table 1

Demographic information about the four cohorts under consideration

CohortTotalFemale%Case%AgeCase (SD)AgeControl (SD)
LOADFS371662.4349.7885.93 (8.39)71.19 (11.53)
FHS440954.779.3779.85 (8.49)62.77 (11.65)
CHS319760.286.1674.76 (5.36)73.01 (5.57)
HRS615857.314.5680.44 (6.71)73.69 (7.85)

Female% percentage of females in cohort, Case% percentage of patients with Alzheimer’s disease in cohort Age average age, SD standard deviation, LOADFS Late-Onset Alzheimer’s Disease Family Study from the National Institute on Aging, FHS Framingham Heart Study, CHS Cardiovascular Health Study, HRS University of Michigan Health and Retirement Study

Demographic information about the four cohorts under consideration Female% percentage of females in cohort, Case% percentage of patients with Alzheimer’s disease in cohort Age average age, SD standard deviation, LOADFS Late-Onset Alzheimer’s Disease Family Study from the National Institute on Aging, FHS Framingham Heart Study, CHS Cardiovascular Health Study, HRS University of Michigan Health and Retirement Study

Imputation of genotype data

Since the four datasets of interest were genotyped using different platforms, imputation was conducted to generate a common set of 2,928,658 SNPs. Only autosomal SNPs were subject to imputation. Genome coordinates of SNPs in our data (NCBI build 38/UCSC hg38) were lifted over to NCBI build 37/UCSC hg19 using LiftOver software [27]. After removing duplicate SNPs, preimputation quality control (QC) was performed using PLINK software [28] to remove low-quality SNPs/subjects by setting the following QC criteria: minor allele frequency < 0.01, SNPs and subject call rates < 95%, and Hardy–Weinberg p < 1E–06. For the LOADFS and FHS cohorts that have family-based designs, a Mendel error rate of 2% was set to remove SNPs and subjects/families with high Mendelian errors. The SHAPEIT2 (i.e., Segmented Haplotype Estimation and Imputation Tool) package [29] was used to ensure that alleles were aligned to the same DNA strand in our and the reference data. Haplotype phasing was then conducted using SHAPEIT2 to estimate the haplotypes for subjects in each dataset. Finally, genotypes were imputed by Minimac3 software [30] over prephased haplotypes. SHAPEIT2 and Minimac3 were run using default values for input arguments and European population (EUR) haplotypes from 1000 Genomes Phase 3 data (release October 2014) as the reference panel.

Postimputation QC

Directly genotyped SNPs along with the imputed SNPs, for which the squared correlation (r2) between imputed and expected true genotypes was > 0.7, were selected for preanalysis QC. This step was performed based on the same criteria explained earlier for preimputation QC. Additional file 1: Table S2 contains information on the numbers of genotyped and imputed SNPs that remained in each of the four datasets of interest after QC.

Population structure

The top 20 principal components (PCs) of genotype data were obtained through principal component analysis (PCA) to be included in downstream genetic analyses to address potential population stratification. In each dataset, PCA was performed over a subset of unrelated individuals and a subset of SNPs that were not in high linkage disequilibrium (LD) measured by r2 [31]. KING (i.e., Kinship-based Inference for Genome-wide association studies) software [32] was used to obtain the subset of unrelated subjects by keeping one subject per family or relative cluster whose identity-by-descent (IBD) was > 0.0884 (i.e., closer than third-degree relatives). The genotyped autosomal SNPs on each chromosome were then pruned by PLINK software [28] in an unrelated set of subjects such that no SNP pairs with r2 > 0.2 were kept within any 100-SNP windows. PCA was then conducted over the selected low-LD SNPs with the GENESIS R package [33, 34]. Additional file 1: Table S3 contains genomic inflation factors (λ values) resulting from logistic regression models for the four datasets under consideration. The λ values were less than 1.1 in all cases, indicating a subtle impact of population structure on our analyses [35, 36].

Genetic analysis

GWA analysis

The associations between SNPs and AD were investigated by fitting logistic regression models. The genetic analyses of each dataset were performed under three alternative plans analyzing the entire sample, only males, and only females. The top five PCs and subject’s birth cohort (i.e., birth year) were included in the models as fixed-effects covariates. In addition, sex was considered a fixed-effect covariate under plan 1. Only additive genetic effects were modeled; dominance effects were ignored. The birth cohort is a proxy for the age and environmental exposures which are characteristic for a cohort. Thus, this adjustment controls for age and overtime trends in the incidence of AD. The logistic models were fitted using PLINK software (v1.07) [28]. It was previously suggested that for samples with a family-based design, ignoring family relationships would not generate considerable bias in effect sizes of SNPs but may increase type I error rates whose magnitude depends on pedigree complexity (e.g., nuclear family vs extended family) and trait heritability. For instance, the inflation of type I error rates has been suggested to be trivial in datasets with simple pedigrees. On the other hand, type I error rates may increase by a factor of 2–3 when family structure is ignored in a dataset with an extended family pedigree and trait heritability values of 0.6–0.9. Therefore, a two-step screening–validating approach could be used with such datasets to prevent inflation of type I error rates and decrease the computational burden of analysis [37]. For the LOADFS and FHS cohorts, we adopted a two-step approach in which the SNPs with p < 0.05 in the logistic models explained earlier were subjected to fitting generalized linear mixed models (GLMMs) by including all aforementioned fixed-effects covariates along with family IDs as a random-effects covariate. GLMMs were fitted using the lme4 R package [33, 38]. All GWA analyses were conducted in a discovery–replication manner. Each of the LOADFS, FHS, CHS, and HRS datasets was considered a discovery set to detect SNPs in significant associations with AD. Results from the discovery stage in a particular dataset were then subject to further replication in the remaining three datasets. At the discovery stage, a genome-wide significance level of p < 5E–08 was set to select statistically significant associations, and SNPs with p values between 5E–08 and 5E–06 were considered suggestive AD-associated markers. These significance levels are widely accepted by genome-wide association studies in order to decrease the type I error rate (i.e., false-positive findings) due to multiple testing issues arising from investigating associations of millions of SNPs [39, 40]. A Bonferroni-corrected significance threshold of 0.0167 (i.e., 0.05/3, where 3 is the number of replication datasets for validating any significant association signals from a discovery dataset) was considered at the replication stage. Finally, a conventional fixed-effects meta-analysis, using the inverse variance method, was conducted over the results under each plan from the four investigated datasets to obtain combined statistics for the tested SNPs. To avoid missing heterogeneous associations of opposite directions of effects, we also performed a meta-analysis on absolute values of coefficients in addition to the conventional meta-test. The results from the meta-analysis on absolute values of coefficients were used just as an additional piece of information to determine how heterogeneous effects in different cohorts can affect the results of a conventional inverse-variance meta-analysis. The meta-analysis results were interpreted according to the significance level at the discovery phase. The meta-analysis was performed using GWAMA (i.e., Genome-Wide Association Meta-Analysis) software [41]. Also, for SNPs that had significant p values only in males or females (i.e., plans 2 or 3), a Wald χ2 statistic with 1 degree of freedom was calculated according to the following formula [42] to investigate whether their odds ratios were significantly different between the two sexes: where bm and bf are the coefficients (i.e., the natural logarithm of odds ratios) for any SNP in males and females, respectively, and sem and sef are their corresponding standard errors. The significant findings from GWA analyses were compared to previous studies using the GRASP (i.e., Genome-Wide Repository of Associations Between SNPs and Phenotypes) search tool (v2.0.0.0) [43]. Also, LD between significant SNPs and previously detected AD-associated loci in their 1-Mb flanking regions (r2 ≥ 0.4 or significant p value from χ2 test for LD) was investigated in the CEU population (i.e., Utah Residents with Northern and Western European Ancestry) through the HaploR R package [33, 44] and the LDlink web-tool [45]. The genes coordinate’s list provided by PLINK [28] was used to find the closest genes of the significant SNPs. The chromosomal regions (i.e., cytogenetic bands) were determined using the annotation database from UCSC Genome Browser [46].

Gene-based analysis

Under each of three aforementioned plans, gene-based analysis was performed over the meta-analysis results using the fastBAT (i.e., Fast set-Based Association Test) method [47] implemented in the GCTA (i.e., Genome-wide Complex Trait Analysis) package (v1.26.0) [48]. This method combines z-statistics for a set of SNPs corresponding to each gene into a quadratic form of a multivariate normal variable. SNPs located within a gene or its 50 kb upstream/downstream regions were considered as an SNP set for that gene. The HRS dataset was used as the reference panel for LD calculation (i.e., r2 metric) in order to remove one of each pair of SNPs with r2 > 0.9 from any given set. To deal with multiple-testing issues, the false discovery rate (FDR) method suggested by Benjamini and Hochberg [49] was used to rank and select significant findings. Genes with significant p values at the FDR level of 0.05 were considered novel AD-associated ones if there were no SNPs with p < 5E–08 in their 1-Mb upstream/downstream regions in the current or previous studies.

Pathway-based analysis

A pathway-based analysis was also performed using the fastBAT method using the pathways predefined by the REACTOME pathway knowledgebase [50] and PID (i.e., the Pathway Interaction Database) [51]. These were provided by the molecular signatures database (MSigDB) at the Broad Institute gene set enrichment analysis (GSEA) website [52, 53]. Here, a SNP set corresponding to a particular pathway was defined as the SNPs within 50 kb of the genes in that pathway. As with the gene-based analysis, the HRS cohort was used to prune the SNP sets based on the pairwise LD measures of SNPs. The significant results were interpreted at the FDR levels of 0.05 (plans 1 and 2) and 0.025 (plan 3) to ensure that the number of possible false-positives was < 1 under each analysis plan.

TWA analysis

Results from conducted meta-analyses along with summary data from a publicly available expression quantitative trait loci (eQTLs) study on peripheral blood [54] were used to perform a transcriptome-wide association analysis using SMR (i.e., Summary-data-based Mendelian Randomization) software (v0.68) [55]. The eQTLs summary data were downloaded from the SMR software website. Both cis-eQTLs and trans-eQTLs were of interest. Trans-eQTLs were defined as eQTLs located at least 5 Mb away from a probe on the same chromosome or located on other chromosomes. Probes for which at least one eQTL with p < 5E–08 had been detected by Lloyd-Jones et al. [54] were included in our analyses provided that the corresponding eQTLs were among the genotyped or imputed SNPs in our study. This resulted in the inclusion of sets of up to 8257 probes with cis-eQTLs and 2763 probes with trans-eQTLs. The significance of p values resulting from SMR testing (i.e., PSMR) was interpreted at an FDR level of 0.025–0.05. The appropriate FDR level for each of three analysis plans was chosen so we can ensure that the number of possible false-positive findings among significant probes was < 1. To identify the pleiotropic effects of SNPs on gene expression levels and AD development, probes with significant PSMR values were then subject to heterogeneity testing (i.e., the HEIDI test) which can differentiate pleiotropy from linkage [55, 56]. Genes corresponding to probes that passed both the SMR and HEIDI tests (i.e., significant PSMR and PHEIDI ≥ 0.05) were deemed significant as their expression profiles might be associated with AD because of the pleiotropic effect of a single variant that affects both probe expression and AD susceptibility. Selected genes were considered potentially novel AD genes if there were no SNPs with p < 5E–08 within their 1-Mb upstream/downstream regions in the current or previous studies. Finally, we also performed TWA analyses using summary results from a publically available tissue-specific eQTLs study [57] which contains eQTLs data for several regions of the brain, including the amygdala, anterior cingulate cortex (BA24), basal ganglia (e.g., caudate, nucleus accumbens, and putamen), cerebellar hemisphere, cerebellum, cortex, frontal cortex (BA9), hippocampus, hypothalamus, and substantia nigra. Once again, probes that had significant eQTLs with p < 5E–08 were included in our analyses. This resulted in the inclusion of sets of 597–3566 probes with cis-eQTLs (based on the brain region). The results of brain-specific TWA analyses were interpreted at a FDR level of 0.05.

URLs

dbGaP: https://www.ncbi.nlm.nih.gov/gap; GCTA: http://cnsgenomics.com/software/gcta/#Overview; GENESIS R Package: https://bioconductor.org/packages/release/bioc/html/GENESIS.html; GRASP: https://grasp.nhlbi.nih.gov/Search.aspx; GSEA: http://software.broadinstitute.org/gsea/index.jsp; GWAMA: https://www.geenivaramu.ee/en/tools/gwama; HaploR R package: https://cran.r-project.org/web/packages/haploR/index.html; KING: http://people.virginia.edu/~wc9c/KING/; LDlink: https://ldlink.nci.nih.gov/?tab=home; LiftOver: https://genome.ucsc.edu/cgi-bin/hgLiftOver; Lme4 R Package: https://cran.r-project.org/web/packages/lme4/index.html; Minimac3: https://genome.sph.umich.edu/wiki/Minimac3; PLINK: http://zzz.bwh.harvard.edu/plink/index.shtml; SHAPEIT: https://mathgen.stats.ox.ac.uk/genetics_software/shapeit/shapeit.html; SMR: http://cnsgenomics.com/software/smr/#Overview; 1000 Genomes: http://www.internationalgenome.org/data/; https://mathgen.stats.ox.ac.uk/impute/1000GP_Phase3.html

Results

GWA analysis

GWA analyses were performed in four independent datasets (i.e., LOADFS, FHS, CHS, and HRS). Each of these datasets served as a discovery set to detect SNPs with significant association signals (at either a genome-wide significance level of p < 5E–08 or a suggestive level between 5E–08 and 5E–06), which were then subject to further replication (at the significance level of 0.0167) in the other three datasets. These analyses provided replicated and nonreplicated sets of SNPs. Finally, results from the individual datasets were combined through meta-analysis and interpreted according to the significance level at the discovery phase. Additional file 1: Tables S4–S12 provide an overview of replicated, nonreplicated, and meta-analysis sets of SNPs that were significantly associated with AD in males and females combined (plan 1) or males and females separately (plans 2 and 3). As seen in these tables, most of the newly detected AD-associated SNPs, particularly those in nonreplicated and meta-analysis sets, had significant p values only in one of the three study plans. For instance, among 44 and 72 newly detected SNPs in males and females, 36 and 51 SNPs had sex-specific significant p values, respectively. Additional file 1: Figures S1–S6 show the Manhattan and QQ plots of the GWA results in the four investigated datasets, as well as in the conducted meta-analyses under these three plans. In general, SNPs with p values smaller than the genome-wide significance threshold were mostly located on chromosome 19.

Replicated sets of SNPs

The replicated sets of SNPs under plans 1–3 contained 31, 20, and 23 SNPs, respectively (Additional file 1: Tables S4–S6). These SNPs had significant p values at the genome-wide level or a suggestive level of associations at the discovery stage and were then replicated in another dataset. Additional files 2, 3, and 4 contain detailed information (e.g., allele frequencies, odds ratios (ORs), p values, etc.) about the replicated SNPs in the four tested datasets under the three analysis plans. Notably, 12, 8, and 8 replicated SNPs, respectively, had not been previously associated with AD. The other SNPs had some evidence of direct association signals [43]. Among previously detected SNPs, rs9882471 (plan 2) was nominally associated with AD in previous studies (5E–06 ≤ p < 5E–02) [58]. Most of the newly detected SNPs were located inside a previously well-known susceptibility region for AD on chromosome 19q13 (i.e., APOE cluster gene region) and were mostly significant under different analysis plans. This subset of newly detected SNPs mostly had p < 5E–08, the same directions of effects in discovery and replication datasets, and significant p values (at genome-wide or suggestive levels of significance) in the meta-analysis. Table 2 summarizes information about the four newly detected SNPs located outside the chromosome 19q13 region. Among these SNPs, rs62402815 was significant under plan 1 (i.e., males and females) and plan 3 (i.e., only females); and rs9918162 and rs726411 were significant only in males (i.e., plan 2). Their association signals were significant only at the suggestive level of associations (except rs62402815, which had a genome-wide level significant p = 1.2E–08 in females) in the discovery stage. The two SNPs that were significant in males did not have p < 5E–06 in conventional fixed-effects meta-analyses, which might be partially due to the heterogeneity of their effects across different datasets. These heterogeneous effects were reflected by high i2 inconsistency metrics and significant Q-statistics in Cochran’s heterogeneity test (Pq < 0.05). A meta-analysis based on the absolute values of the coefficients confirmed a substantial role of heterogeneity by providing smaller p values for most of these SNPs.
Table 2

Newly detected replicated and meta-analysis sets of significant SNPs located outside chromosome 19q13 region

ChromosomeClosest geneSNPPositionA1Sig? P min EffectsFreqOR (se) P meta P q i 2 P abs Proxy?Gene?Region?
Plan 1—males and females
 6p22.3 LOC101928519 rs62402815a19,350,484GNYYN1.94E–07– – – –0.9410.690 (0.050)1.80E–061.41E–040.8531.80E–06NNN
 7q22.1 TRIM56 rs10953322101,103,377GYYNN2.89E–03– – – –0.8710.758 (0.043)4.65E–067.84E–0104.66E–06NNY
 9p22.3C9orf92rs496166416,161,235TYNNY3.45E–03– – – –0.7110.811 (0.035)4.44E–068.14E–0104.44E–06NNN
 9p13.2 PAX5 rs228207937,036,250GY?NN6.49E–05– ? – –0.9590.595 (0.06)3.81E–064.48E–0103.81E–06NNN
 11p15.5 AP2A2 rs10794342924,904CYNYY2.46E–04– – – –0.4560.824 (0.033)4.45E–061.19E–010.4884.42E–06YYY
 13q33.3 MYO16 rs9555561109,152,426CYNNN6.99E–05+ + + +0.7271.249 (0.056)2.53E–064.64E–0102.54E–06NYY
 13q33.3 MYO16 rs912322109,155,938AYNNY5.44E–05+ + + +0.7331.260 (0.057)1.29E–064.56E–0101.29E–06NYY
 17q12 LHX1 rs807011436,817,647ANNNY6.07E–04– – – –0.9490.675 (0.053)3.58E–066.06E–0103.58E–06NNN
 17q12 LHX1 rs149719736,819,274ANNNY7.69E–04– – – –0.9490.668 (0.053)2.96E–064.79E–0102.98E–06NNN
 18q12.1 MIR302F rs3524277230,214,382CYYNY3.31E–03+ + + +0.3561.231 (0.051)1.31E–068.42E–0101.31E–06NNY
 21q21.2 LINC00158 rs7625296925,277,446GYYNN1.12E–04– – – –0.9710.594 (0.061)4.86E–063.47E–010.0924.86E–06NNN
 21q21.3 LINC00515,MRPL39 rs229836925,583,969CYYYN6.02E–06– – – +0.6050.802 (0.033)3.67E–071.81E–040.8491.43E–10NNY
Plan 2—only males
 5q15 LNPEP rs9918162a96,987,845TYNYN4.66E–06+ – – –0.9660.958 (0.138)8.00E–012.95E–080.9212.27E–06NNN
 8q24.22 ADCY8 rs726411a130,734,543GNYNY2.16E–06– – + –0.9390.621 (0.071)2.45E–045.65E–040.8281.65E–05NYY
 3p14.1 KBTBD8 rs986284966,855,351CYNNY1.12E–03– – – –0.9000.603 (0.059)2.66E–067.53E–0102.65E–06NNY
 23q21.31 KLHL4 rs596911787,181,248CYN?N1.67E–05+ + ? +0.2951.490 (0.114)1.42E–068.61E–0101.42E–06NNY
Plan 3—only females
 6p22.3 LOC101928519 rs62402815a,b19,350,484GNYYN1.20E–08– – – –0.9410.610 (0.054)4.29E–075.89E–040.8274.31E–07NNN
 2p13.3 ANTXR1 rs756120769,138,666AY??N7.35E–05– ? ? –0.0550.470 (0.066)4.08E–069.22E–0104.08E–06NNN
 4p16.2 STK32B rs176756405,095,813GYYYN7.90E–04– – – –0.6670.769 (0.040)1.89E–064.52E–0101.89E–06NYY
 4p16.2 STK32B rs68387925,096,839CYYYN7.04E–05– – – –0.6170.772 (0.040)1.61E–061.17E–010.4911.61E–06NYY
 4p16.2 STK32B rs8956815,099,404TYYYN1.23E–04– – – –0.6170.776 (0.040)2.72E–061.41E–010.4502.72E–06NYY
 6p21.33 TNXB rs1196975932,053,353CNNYY3.65E–04– – – –0.9420.616 (0.057)2.10E–063.86E–010.0122.10E–06NNY
 6p21.33 TNXB rs1094723032,056,618CNNYY3.19E–04– – – –0.9410.619 (0.057)2.33E–064.36E–0102.34E–06NNY
 6p21.33 TNXB rs777419732,078,498ANNYY3.68E–04– – – –0.9410.627 (0.058)4.17E–063.47E–010.0924.20E–06NNY
 9q22.2 SYK rs117292290,726,252ANYNY5.27E–04+ + + +0.1181.408 (0.098)4.56E–061.14E–010.4954.46E–06NNN
 12q24.33 SFSWAP rs73156187131,542,412GYNYN1.35E–04– – – –0.8940.695 (0.051)4.47E–062.08E–010.3414.45E–06NNY
 12q24.33 SFSWAP rs7963314131,573,284GYNYN6.65E–04– – – –0.8810.700 (0.049)2.72E–064.06E–0102.73E–06NNY
 21q21.3 MIR155HG rs12386284b25,517,756TYYN?3.61E–03+ + + ?0.2491.367 (0.087)4.55E–069.24E–0104.54E–06NNY
 21q21.3 MIR155HG rs178301225,547,104TYYN?2.11E–03– – – ?0.7480.732 (0.047)4.59E–069.40E–0104.59E–06NNY
 21q21.3 MIR155HG rs178301325,547,257TYYN?2.11E–03– – – ?0.7480.733 (0.047)4.65E–069.49E–0104.65E–06NNY
 21q21.3 MIR155HG rs92696325,547,744TYYN?2.11E–03– – – ?0.7480.732 (0.047)4.75E–069.49E–0104.75E–06NNY
 21q21.3 MIR155HG rs189365025,568,503TYYNN2.32E–03+ + + +0.2471.320 (0.075)3.94E–065.41E–0103.99E–06NNY
 21q21.3 MIR155HG rs222632625,569,648AYYNN2.57E–03+ + + +0.2451.319 (0.075)4.38E–066.34E–0104.38E–06NNY
 21q21.3 MIR155HG rs2829803b25,575,998GYYNN2.39E–03+ + + +0.2471.319 (0.075)4.09E–065.40E–0104.10E–06NNY
 21q21.3 LINC00515,MRPL39 rs2298369b25,583,969CYNYN6.79E–05– – – +0.6060.754 (0.039)3.11E–071.77E–020.7032.21E–07NNY
 21q21.3 MRPL39 rs282982325,599,076AYYNN2.37E–03+ + + +0.2451.320 (0.075)4.06E–065.96E–0104.07E–06NNY
 21q21.3 MRPL39 rs282983225,601,939TYYNN2.47E–03+ + + +0.2451.318 (0.075)4.68E–066.04E–0104.65E–06NNY

SNP single-nucleotide polymorphism, Chromosome chromosomal region based on cytogenetic bands, Position position of SNP based on Human Genome version 38 (hg38), A1 effect allele, Sig? if SNP had p < 0.0167 in LOADFS, FHS, CHS, and HRS datasets, respectively (Yes, No, Missing), P minimum p value detected for SNP in aforementioned datasets, Effects direction of SNP’s effects in aforementioned datasets (Positive, Negative, Missing), Freq frequency of effect allele based on meta-analysis, OR (se) odds ratio and its standard error based on meta-analysis, P p value of SNP in meta-analysis, P p value of Q-statistics (Cochran’s heterogeneity test), i, inconsistency metric, P p value of SNP in meta-analysis on absolute values of effect sizes (i.e., β coefficients), Proxy? if SNP is in linkage disequilibrium with any previously detected AD-associated loci whose p value is less than that detected in this study (Yes, No), Gene? if previous studies have detected AD-associated SNPs with p < 5E–06 in closest gene to detected SNP (Yes, No), Region? if previous studies have detected AD-associated SNPs with p < 5E–06 in chromosomal region in which detected SNP is located (Yes, No), Y yes, N no, AD Alzheimer’s disease

aReplicated SNPs

bSNP did not have significant sex-specific effects. All SNPs that were significant only in males or females also had significant sex-specific effects except rs62402815, rs12386284, rs2829803, and rs2298369

Newly detected replicated and meta-analysis sets of significant SNPs located outside chromosome 19q13 region SNP single-nucleotide polymorphism, Chromosome chromosomal region based on cytogenetic bands, Position position of SNP based on Human Genome version 38 (hg38), A1 effect allele, Sig? if SNP had p < 0.0167 in LOADFS, FHS, CHS, and HRS datasets, respectively (Yes, No, Missing), P minimum p value detected for SNP in aforementioned datasets, Effects direction of SNP’s effects in aforementioned datasets (Positive, Negative, Missing), Freq frequency of effect allele based on meta-analysis, OR (se) odds ratio and its standard error based on meta-analysis, P p value of SNP in meta-analysis, P p value of Q-statistics (Cochran’s heterogeneity test), i, inconsistency metric, P p value of SNP in meta-analysis on absolute values of effect sizes (i.e., β coefficients), Proxy? if SNP is in linkage disequilibrium with any previously detected AD-associated loci whose p value is less than that detected in this study (Yes, No), Gene? if previous studies have detected AD-associated SNPs with p < 5E–06 in closest gene to detected SNP (Yes, No), Region? if previous studies have detected AD-associated SNPs with p < 5E–06 in chromosomal region in which detected SNP is located (Yes, No), Y yes, N no, AD Alzheimer’s disease aReplicated SNPs bSNP did not have significant sex-specific effects. All SNPs that were significant only in males or females also had significant sex-specific effects except rs62402815, rs12386284, rs2829803, and rs2298369 Also, rs62402815 and rs726411 had the same direction of effects in the discovery and replication datasets. The directions of effects of rs9918162 were opposite in the discovery and replication sets. While genetic variants that have the same direction of effects in multiple independent cohorts are generally of more interest, those with opposite effects can be important as well because they may be indicative of the genetic heterogeneity of the studied trait in different cohorts arising, for example, from the epistasis or differences in LD patterns [59-61]. Although no evidence of direct association with AD was found in previous studies for the newly detected subsets of replicated SNPs, their 1-Mb upstream/downstream regions harbor AD-associated SNPs. We therefore investigated their LD with AD-associated loci in their 1-Mb flanking regions in the CEU population [45]. Newly detected SNPs were considered informative AD markers if their p values were smaller than those of the top AD-associated SNPs in their neighborhood or they were not in LD with previously AD-associated loci whose p values were smaller than those detected in this study. Additional file 1: Table S13 contains LD information about those newly detected SNPs for which proxy AD-associated loci have been reported. As seen in Additional file 1: Table S13, all newly detected SNPs on chromosome 19q13 had larger p values than the top AD-associated loci in their neighborhood and were in LD with them. Therefore, they were likely to relay the same information as their neighboring AD-associated SNPs. On the other hand, the p values of SNPs located outside the chromosome 19q13 region were mostly smaller than the previously detected association signals in their flanking regions and were not in LD with such loci. As seen in Table 2, among the closest genes to these SNPs, only ADCY8 (corresponding to rs726411 located in the 8q24.22 region) was associated with AD in previous GWAS at a suggestive level of associations (rs263238 with p = 2.40E–06 [62]). In addition, none of the chromosomal regions (i.e., cytogenetic bands) in which other SNPs are located contained any previously AD-associated SNPs with p < 5E–06 [43]. Detailed information about the genes and chromosomal regions corresponding to the newly detected SNPs that contain previously AD-associated SNPs can be found in Additional files 2, 3, and 4.

Nonreplicated sets of SNPs

Additional file 1: Tables S7–S9 (corresponding to plans 1–3) show that 54, 40, and 46 SNPs had significant p values at genome-wide or suggestive levels of associations in only one of the four datasets of interest. Most of them were newly detected (41, 33, and 40 SNPs, respectively), as there was no evidence of their direct association with AD in previous studies [43]. Also, they were mostly plan specific and demonstrated evidence of sex disparity. Most were located in chromosomal regions other than 19q13 and were significant at a suggestive level of associations. Detailed information about nonreplicated sets of SNPs (e.g., allele frequencies, ORs, p values, etc.) can be found in Additional files 2, 3, and 4. Of those SNPs previously associated with AD, rs11038106, rs9597722, rs723804, rs17697225 [63], rs2065706 [64] (plan 1), rs4679840 [58] (plan 2), and rs1359176 [65] (plan 3) were only nominally significant (5E–06 ≤ p < 5E–02) in previous studies. Once again, SNPs located outside the chromosome 19q13 region either had smaller p values than previously detected AD-associated loci in their proximity or were not in LD with them, except for rs34779859 on chromosome 2 (plan 3) which was significant in females. LD information about those newly detected SNPs for which proxy AD-associated loci have been previously identified can be found in Additional file 1: Table S13. Among the closest genes to newly detected SNPs outside the chromosome 19q13 region, BIN1, FRMD4A, and CDH4 that were significant under plan 3 were previously associated with AD with p < 5E–06 (rs744373 with p = 2.60E–14 [66], rs7921545 with p = 5.40E–07 [67], and rs4925189 with p = 6.30E–07 [68], respectively). Also, several other genes were located in AD-associated chromosomal regions. Information about these genes/regions is summarized in Additional files 2, 3, and 4.

Meta-analysis sets of SNPs

Additional file 1: Tables S10–S12 show that 17, 4, and 24 SNPs that were not among replicated or nonreplicated sets of significant SNPs under analysis plans 1–3 passed the significance threshold in the meta-analysis. Additional files 2, 3, and 4 summarize the GWA results for these SNPs. The meta-analysis p values of these SNPs were mostly significant at the level of suggestive associations, except for rs76366838, rs115881343 (plan 1), rs73048293, rs57537848, and rs76366838 (plan 3) on chromosome 19q13 which had p < 5E–08. Also, they were mostly located outside chromosome 19q13 and were plan specific (i.e., they were not among replicated, nonreplicated, or meta-analysis sets of significant SNPs under other plans). For example, significant SNPs in males were not significant in females and vice versa. In addition, most SNPs (14, 3, and 24 SNPS under plans 1–3, respectively) were not associated with AD in previous studies [43]. Summary information about the newly detected subset of meta-analysis sets of SNPs that were outside chromosome 19q13 is presented in Table 2. As with the replicated and nonreplicated sets of SNPs, most of the newly detected SNPs not on chromosome 19q13 had smaller p values than the ones reported for their nearby AD-associated loci or were not in LD with them. These SNPs, therefore, were considered novel and informative AD markers. On the other hand, proxy AD-associated SNPs were found for all newly detected SNPs that were located on chromosome 19q13 (Additional file 1: Table S13). As seen in Table 2, among the closest genes to newly detected SNPs outside the chromosome 19q13 region, AP2A2 (corresponding to rs10794342), MYO16 (corresponding to rs9555561 and rs912322 in the 13q33.3 region) and STK32B (corresponding to rs17675640, rs6838792, and rs895681 in the 4p16.2 region) were previously associated with AD with p < 5E–06 (rs17393344 with p = 1.70E–08; and rs78647349 with p = 5.20E–07, respectively [69]). In addition, several chromosomal regions including 3p14.1 (KBTBD8), 6p21.33 (TNXB), 7q22.1 (TRIM56), 12q24.33 (SFSWAP), 18q12.1 (MIR302F), 21q21.3 (MIR155HG, LINC00515, and MRPL39), and 23q21.31 (KLHL4) were associated with AD at a suggestive level of associations by previous GWAS. However, no AD-associated SNPs with p < 5E–06 have been previously detected in chromosomal regions corresponding to C9orf92, PAX5, LHX1, and LINC00158 genes (i.e., 9p22.3, 9p13.2, 17q12, and 21q21.2, respectively) that were significant under plan 1; and ANTXR1 and SYK genes (i.e., 2p13.3 and 9q22.2, respectively) [43]. Detailed information about these AD-associated genes and chromosomal regions is provided in Additional files 2, 3, and 4.

Nominally significant sets of SNPs

Under each of the three analysis plans, there were several SNPs associated with AD at a nominal level of significance (5E–06 ≤ p < 5E–02) in all datasets they were present in. They were mostly present in three datasets as they were excluded from one dataset by the QC procedure. These SNPs (30, 28, and 28 SNPs under plans 1–3, respectively) are listed in Additional files 2, 3, and 4. Although they did not have highly significant p values, they are reported here due to the consistency in their association signals that was observed in multiple tested datasets. With the exception of rs575088, which had nominally significant p values in all datasets under plans 1 and 3, the significance pattern of the other SNPs was observed under only one plan. Also, rs2282079 (detected in females) was among the meta-analysis set of SNPs under plan 1 as well. None of these SNPs had p < 5E–06 in the conducted meta-analyses. The lack of meta-analysis power could be due to the small sample size, weak association signals, absence of some SNPs in one dataset, or heterogeneous effects of some SNPs across the different datasets as evidenced by their high i2 values, significant Q tests, and smaller p values in meta-analysis on absolute values of coefficients. The SNPs whose associated signals were reported here for the first time were not in LD with previously detected AD-associated loci (p < 5E–06) in their 1-Mb flanking regions (Additional file 1: Table S13). Interestingly, 22 out of 28 SNPs detected in males had the opposite pattern of significance in females (i.e., p > 0.05 in all datasets). Also, 26 out of 28 SNPs detected in females had the opposite pattern of significance in males (Additional file 5). Not all SNPs with opposite patterns of significance in females-only vs males-only analyses had the same pattern in the meta-analysis. Closest genes to some of these SNPs were located in chromosomal regions that were previously associated with AD with p < 5E–06. Information about these genes/regions can be found in Additional files 2, 3, and 4.

Adjustment by APOE SNPs

For the AD-associated SNPs that were located on chromosome 19, we further investigated whether their association signals may change after adjustment for APOE genotypes in the models. For each subject, the APOE genotype was determined based on its genotypes at rs429358 and rs7412 loci using the coding schema provided in Additional file 1: Table S14. We found that none of the tested SNPs had p < 5E–06 once APOE was added as a covariate to the models. Additional file 1: Table S15 summarizes the information regarding the LD between SNPs detected in our study and APOE SNPs. Among newly AD-associated SNPs, only six SNPs were in LD with one or both of the APOE SNPs. Others were not in LD with the two APOE SNPs (i.e., r2 = 0.001–0.072) [45]. Therefore, it should be noted that despite a major impact of the APOE genotypes on the associations of other SNPs inside the chromosome 19q13 region with AD, this result would not automatically imply that the APOE SNPs (i.e., rs429358 and rs7412) are the only contributors to AD pathogenesis because APOE-adjusted models highlighted the statistical correlations rather than biological (i.e., genetic) linkage. Further analyses such as those examining the role of haplotypes and epistatic interactions would be helpful to more comprehensively dissect the genetic heterogeneity of this region, and to elucidate the biological relevance of the APOE-adjusted models [70].

Sex-specific effects

We also investigated the sex-specific effects of SNPs that were significantly associated with AD only in males or females by performing a Wald χ2 test to determine whether their odds ratios were significantly different between males and females. Additional file 1: Tables S16 and S17 summarize the results from this test for replicated, nonreplicated, and meta-analysis sets of AD-associated SNPs. We found that the differences between odds ratios of the SNPs in males and females were significant (p < 0.05) in most cases, except rs62405605, rs1062851, rs62510850, rs7000333, rs6572843 (among nonreplicated set of SNPs in females), and rs12386284 (among meta-analysis set of SNPs in females). Detailed information about the results from the Wald χ2 test can be found in Additional file 6. In addition, the SNPs that had significant p values only in males or females were searched against the GRASP catalog [43] to find out whether they were among the known sex-linked autosomal SNPs or were associated with any other diseases/traits at suggestive level of associations. We noticed that there was no evidence of such associations in previous studies.

Gene-based analysis

The significant findings from gene-based analyses corresponding to plans 1–3 are summarized in Table 3. Under all plans, most genes with significant p values at the FDR of 0.05 were located in the chromosome 19q13 region. Since the chromosome 19q13 region harbors several SNPs with p < 5E–08 in both current and previous studies, significant genes in this region are not discussed here as they do not meet the criteria set for detecting novel AD genes. The only significant genes outside the APOE cluster region were LINC00158 under plan 1 and LINC00158, MIR155HG, MIR155, LINC00515, MRPL39, and JAM2 under plan 3 that were located in the chromosome 21q21.3 region. None of the SNPs inside or within 1-Mb flanking regions of these genes had significant p values at the genome-wide level in our study, although several had suggestive-level p values in conducted meta-analyses under plans 1 and 3. Also, SNPs in 1-Mb nearby regions of these genes were only nominally associated with AD (8.0E–04 < p < 5E–02) in previous studies [58, 65, 71–73]. However, the chromosome 21q21.3 region was associated with AD by previous GWAS at a suggestive level of associations (rs239713 with p = 5.00E–07 [68]). This SNP is located ~ 1.6 Mb away from significant genes reported in our study.
Table 3

Significantly AD-associated genes from gene-based analyses

GeneChromosomal regionStartEndNumber of SNPsPlan 1 (males and females)Plan 2 (only males)Plan 3 (only females)
χ 2 p value χ 2 p value χ 2 p value
CEACAM16 19q13.3244,699,15044,710,71483330.477.52E–06NSNSNSNS
BCL3 19q13.3244,748,72044,760,04463274.813.95E–06NSNS254.315.44E–06
MIR8085 19q13.3244,758,65644,758,72152235.688.72E–06NSNSNSNS
CBLC 19q13.3244,777,86844,800,64662419.912.02E–10NSNSNSNS
BCAM 19q13.3244,809,05844,821,42151455.391.07E–10NSNS298.415.00E–07
PVRL2 19q13.3244,846,13544,889,228903861.004.74E–721357.124.52E–252304.337.11E–43
TOMM40 19q13.3244,891,21944,903,689733664.749.12E–861288.148.98E–302171.921.26E–50
APOE 19q13.3244,905,78144,909,393703590.184.32E–851262.221.09E–292121.683.58E–50
APOC1 19q13.3244,914,66344,919,349673482.704.76E–871230.831.63E–302049.544.25E–51
APOC1P1 19q13.3244,926,80244,931,386573427.136.60E–931209.891.01E–322008.342.00E–54
APOC4 19q13.3244,942,23744,945,496463063.645.63E–891058.607.90E–311792.113.94E–52
APOC4–APOC2 19q13.3244,942,23744,949,565463063.645.63E–891058.607.90E–311792.113.94E–52
APOC2 19q13.3244,945,98144,949,566422242.858.65E–73760.491.16E–241347.919.14E–44
CLPTM1 19q13.3244,954,58444,993,346472037.763.42E–60700.139.56E–211217.185.03E–36
LINC00158 21q21.325,385,81925,431,70139223.581.20E–06NSNS181.032.27E–05
MIR155HG 21q21.325,562,14425,575,16827NSNSNSNS170.351.52E–05
MIR155 21q21.325,573,97925,574,04422NSNSNSNS152.697.00E–06
LINC00515 21q21.325,582,77425,583,22424NSNSNSNS156.499.36E–06
MRPL39 21q21.325,585,65525,607,48936NSNSNSNS269.624.20E–06
JAM2 21q21.325,639,28125,717,56253NSNSNSNS276.763.02E–05

Genomic coordinates based on Human Genome version 38 (hg38)

AD Alzheimer’s disease, SNP single-nucleotide polymorphism, NS nonsignificant

Significantly AD-associated genes from gene-based analyses Genomic coordinates based on Human Genome version 38 (hg38) AD Alzheimer’s disease, SNP single-nucleotide polymorphism, NS nonsignificant

Pathway-based analysis

We found that 19, 10, and 19 pathways were significantly associated with AD under plans 1–3, respectively (Table 4). The proper FDR levels at which the numbers of possible false-positives were less than 1 were 0.05 under plans 1 and 2, and 0.025 under plan 3. We found that 12 pathways were significant under two or three analysis plans (i.e., they were not plan specific). There were also seven pathways that were significant only under plan 1 (males and females), and seven others were significant only in females (i.e., plan 3). No pathways were specifically significant in males (i.e., plan 2).
Table 4

Significantly AD-associated pathways from pathway-based analyses

PathwayPathway GSEA IDNumber of genesPlan 1 (males and females)Plan 2 (only males)Plan 3 (only females)
χ 2 p value χ 2 p value χ 2 p value
Mitochondrial protein importM590584939.523.83E–442741.074.65E–123622.617.07E–25
Chylomicron-mediated lipid transportM14162164180.441.11E–411968.737.13E–142854.574.86E–25
Nectin adhesion pathwayaM72305872.186.52E–393393.556.09E–094310.191.39E–19
HDL-mediated lipid transportM5056154184.215.08E–382195.671.98E–122779.676.00E–20
Lipoprotein metabolismM3462285221.102.11E–373011.781.86E–103825.743.47E–20
Lipid digestion, mobilization and transportM1023465889.335.64E–363631.174.67E–094448.012.37E–18
E-cadherin stabilization pathwayaM232425976.761.72E–273670.509.69E–054517.863.67E–12
Immunoregulatory interactions between a lymphoid and a nonlymphoid cellM8240706335.344.04E–194056.722.39E–074720.408.48E–11
Adherens junctions interactionsM11980277432.117.54E–185204.261.54E–056110.311.67E–10
Cell–cell junction organizationM820568940.071.34E–176464.132.61E–047758.514.48E–11
Cell junction organizationM192487810,542.401.71E–17NSNS9337.946.78E–11
NF-κB atypical pathwayaM26171630.193.26E–07NSNSNSNS
Platelet sensitization by LDLM91916994.695.38E–04NSNS1002.454.21E–04
Signaling by EGFR in cancerM5631098116.416.44E–04NSNSNSNS
Trans-Golgi network vesicle buddingM539602967.317.13E–04NSNSNSNS
FAS (CD95) signaling pathwayaM94382322.577.15E–04NSNSNSNS
Golgi-associated vesicle biogenesisM1877532758.567.26E–04NSNSNSNS
Signaling by PDGFM20491229443.317.57E–04NSNSNSNS
mRNA processingM25311616183.567.93E–04NSNSNSNS
Prostacyclin signaling through prostacyclin receptorM92619NSNSNSNS1579.244.50E–05
G-protein activationM1311527NSNSNSNS2138.826.18E–05
ADP signaling through P2Y purinoceptor 12M84121NSNSNSNS1680.251.54E–04
ADP signaling through P2Y purinoceptor 1M81125NSNSNSNS2101.012.71E–04
Glucagon-type ligand receptorsM1032233NSNSNSNS2483.814.51E–04
Signal amplificationM937931NSNSNSNS2412.574.99E–04
Gβγ signaling through PI3KγM1430125NSNSNSNS1808.245.30E–04

AD Alzheimer’s disease, GSEA Gene Set Enrichment Analysis Platform, HDL high-density lipoprotein, LDL low-density lipoprotein, EGFR epidermal growth factor receptor, PDGF platelet-derived growth factor, NS nonsignificant

aDefinition of pathway is based on the Pathway Interaction Database (PID). Other pathways are from the REACTOME pathway knowledgebase

Significantly AD-associated pathways from pathway-based analyses AD Alzheimer’s disease, GSEA Gene Set Enrichment Analysis Platform, HDL high-density lipoprotein, LDL low-density lipoprotein, EGFR epidermal growth factor receptor, PDGF platelet-derived growth factor, NS nonsignificant aDefinition of pathway is based on the Pathway Interaction Database (PID). Other pathways are from the REACTOME pathway knowledgebase

TWA analysis

Analyzing probes with cis-eQTLs

Using eQTLs data from peripheral blood, we found that four, eight, and four probes/genes passed both the SMR (PSMR < 6.03E–05) and HEIDI (PHEIDI ≥ 0.05) tests under plans 1–3, respectively. The significant FDR level for interpreting the results from the SMR test was set to 0.05 under plan 1 and 0.025 under plans 2 and 3 to ensure that the expected number of false-positive findings was < 1. Table 5 presents information about these 16 probes/genes, their top eQTLs, and the respective p values. The top eQTLs corresponding to these probes/genes were all nominally significant in our GWA analyses (2.01E–04 ≤ PGWAS ≤ 2.47E–02). Moreover, we did not identify any SNPs with significant p values at the genome-wide significance level within 1 Mb of these genes. However, several SNPs within 1 Mb of MS4A6A [64, 66, 74, 75] and UQCC [76] were associated with AD with p < 5E–08 in previous studies. Among 14 other genes, SNPs in regions around TRA2A [64], IRAK3 [77], and ESPN [78] were previously associated with AD at the suggestive level of associations. In addition, ATG10 [77] and LPXN [74] were located in chromosomal regions (i.e., 5q14.1 and 11q12.1) that contained AD-associated SNPs with p < 5E–06.
Table 5

Significantly AD-associated probes/genes from transcriptome-wide analyses

Probe IDGeneTop eQTLChromosomal regionPositionA1A1 Freq P GWAS P eQTL b SMR SESMR P SMR P HEIDI
Plan 1—males and females
 ILMN_1754501 C2orf74 rs7202012p1561,149,328C0.4136.94E–041.34E–1640.1890.0475.54E–056.64E–01
 ILMN_2206098 ATG10 rs117415695q14.182,030,091G0.0909.96E–049.69E–27−0.3910.0975.66E–058.32E–02
 ILMN_2359800 MS4A6A rs710866311q12.260,260,669C0.4122.17E–041.41E–480.3800.0881.59E–058.57E–01
 ILMN_2343047 ABCB9 rs64176012q24.31123,034,319T0.2142.75E–049.79E–17−0.6500.1626.03E–051.14E–01
Plan 2—only males
 ILMN_1700307 ZNF815 rs1178565607p22.15,852,503C0.0481.50E–028.23E–490.3890.0922.22E–058.80E–02
 ILMN_1731043 TRA2A rs10461357p15.323,530,965T0.1351.89E–034.32E–115−0.2940.0671.07E–052.51E–01
 ILMN_1671603 MED30 rs101113288q24.11117,525,450T0.4263.79E–041.67E–201.0110.2321.30E–058.22E–02
 ILMN_1742789 LPXN rs792856511q12.158,587,146C0.0122.47E–027.17E–86−0.2870.0562.68E–076.20E–01
 ILMN_1913678 IRAK3 rs143684912q14.366,181,338T0.4161.25E–036.14E–35−0.6560.1604.11E–056.60E–02
 ILMN_1737561 N4BP2L2 rs71844413q13.132,572,915T0.3751.16E–031.80E–53−0.4890.1193.96E–052.90E–01
 ILMN_1724734 UQCC rs242506220q11.2235,335,763G0.3832.01E–044.58E–13−1.3560.3212.42E–057.78E–01
 ILMN_2296950 APOBEC3F rs1108992822q13.139,133,424A0.0682.42E–037.21E–20−0.7240.1681.60E–058.31E–02
 ILMN_1806607 SFN a rs38176041p36.11/4p16.31,297,549T0.1204.10E–045.77E–101.1790.2925.49E–053.92E–01
Plan 3—only females
 ILMN_1806710 ESPN rs120743791p36.316,434,683T0.0407.26E–038.51E–400.4210.1013.38E–052.84E–01
 ILMN_1696003 GNAI3 rs23012291p13.3109,594,056T0.1754.92E–041.45E–211−0.2090.0454.00E–061.21E–01
 ILMN_1741881 C9orf72 rs22822409p21.227,572,636T0.2642.49E–041.18E–90−0.3270.0703.27E–065.39E–02
 ILMN_1803925 MTMR3 rs4117422q12.230,030,080T0.3258.34E–043.03E–760.3210.0784.21E–054.12E–01

Genomic coordinates are based on Human Genome version 38 (hg38)

AD Alzheimer’s disease, eQTL expression quantitative trait loci, A1 effect allele, A1/A1 freq effect allele and its frequency, GWAS genome-wide association study, SMR summary-data-based Mendelian randomization, SE standard error, HEIDI Heterogeneity in Dependent Instruments

aTrans-eQTL

Significantly AD-associated probes/genes from transcriptome-wide analyses Genomic coordinates are based on Human Genome version 38 (hg38) AD Alzheimer’s disease, eQTL expression quantitative trait loci, A1 effect allele, A1/A1 freq effect allele and its frequency, GWAS genome-wide association study, SMR summary-data-based Mendelian randomization, SE standard error, HEIDI Heterogeneity in Dependent Instruments aTrans-eQTL Our TWA analyses on brain-specific eQTLs data revealed associations of two probes/genes with AD in males (i.e., CRIPAK and PRDM10), and two others in females (i.e., AHSA2 and ATG10) at the FDR level of 0.05 (Table 6). No probe/gene passed the SMR and HEIDI tests under analysis plan 1. The probe corresponding to the AHSA2 gene was significantly associated with AD in several brain regions (i.e., caudate basal ganglia, cerebellum, cortex, hypothalamus, nucleus accumbens, putamen basal ganglia, and substantia nigra). Also, ATG10 was significantly associated with AD in the nucleus accumbens and putamen basal ganglia. The corresponding top eQTLs were nominally significant in our GWA analyses in males and females (4.30E–05 ≤ PGWAS ≤ 9.33E–02). There were no SNPs with significant p values at the genome-wide significance level within 1 Mb of these genes in our study; however, SNPs with significant p values at the suggestive level of significance were found in flanking regions of ATG10 in the nonreplicated set of SNPs in females (see Additional file 4). In addition, the SNPs within 1 Mb of these four genes were only nominally associated with AD in previous studies [43]. In terms of chromosomal regions, in addition to ATG10 as explained earlier, SNPs in the chromosome 11q24.3 region (PRDM10 gene) were also previously associated with AD at a genome-wide significance level [69].
Table 6

Significantly AD-associated probe/genes from transcriptome-wide analyses on brain tissue data

Probe IDGeneTop eQTLChromosomal regionPositionA1A1 Freq P GWAS P eQTL b SMR SESMR P SMR P HEIDI Brain region
Plan 2—only males
 ENSG00000179979.7 CRIPAK rs38176044p16.31,297,549T0.1204.10E–041.14E–090.3580.0896.24E–054.13E–01Hippocampus
 ENSG00000170325.10 PRDM10 rs5574621811q24.3129,964,624C0.0769.33E–032.10E–090.8670.2052.33E–052.81E–01Cerebellar hemisphere
Plan 3—only females
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–041.53E–22−0.3300.0751.19E–059.70E–02Caudate basal ganglia
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–049.79E–40−0.2160.0474.36E–061.29E–01Cerebellum
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–046.30E–28−0.2920.0657.72E–061.33E–01Cortex
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–042.30E–15−0.3350.0803.08E–058.00E–02Hypothalamus
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–042.65E–24−0.2620.0591.01E–057.39E–02Nucleus accumbens
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–047.23E–22−0.2170.0501.27E–052.11E–01Putamen basal ganglia
 ENSG00000173209.18 AHSA2 rs26006672p1561,177,796T0.2971.75E–045.43E–13−0.3310.0825.05E–053.20E–01Substantia nigra
 ENSG00000152348.11 ATG10 rs117488685q14.182,066,661C0.0914.30E–052.70E–09−0.2810.0661.85E–058.05E–02Nucleus accumbens
 ENSG00000152348.11 ATG10 rs117488685q14.182,066,661C0.0914.30E–051.39E–09−0.2630.0611.55E–051.02E–01Putamen basal ganglia

Genomic coordinates are based on Human Genome version 38 (hg38)

AD Alzheimer’s disease, eQTL expression quantitative trait loci, A1 effect allele, A1/A1 freq effect allele and its frequency, GWAS genome-wide association study, SMR summary-data-based Mendelian randomization, SE standard error, HEIDI Heterogeneity in Dependent Instruments

Significantly AD-associated probe/genes from transcriptome-wide analyses on brain tissue data Genomic coordinates are based on Human Genome version 38 (hg38) AD Alzheimer’s disease, eQTL expression quantitative trait loci, A1 effect allele, A1/A1 freq effect allele and its frequency, GWAS genome-wide association study, SMR summary-data-based Mendelian randomization, SE standard error, HEIDI Heterogeneity in Dependent Instruments

Analyzing probes with trans-eQTLs

Using eQTLs data from peripheral blood, one probe mapping to the SFN gene on chromosome 1p36 had significant PSMR at the FDR level of 0.05, and passed the HEIDI test under plan 2 (Table 5). The corresponding top eQTL was located on chromosome 4p16 in the intronic region of the MAEA gene and was nominally associated with AD in our study (PGWAS = 4.10E–04). There were no significant association signals at the genome-wide significance level in the SFN gene or its 1-Mb flanking regions in current or previous studies [43].

Discussion

The genetic architecture of AD has been widely studied in recent years, and so far more than 60,000 SNPs have been associated with AD with p < 0.05. Of these, 281 SNPs (mapped to 49 genes) and 593 SNPs (mapped to 165 genes) had significant p values at the genome-wide and suggestive levels of associations, respectively [43]. Despite these efforts, a major proportion of h2 of AD has remained unexplained. Exploring the genetic risk factors contributing to AD is highly important from a precision medicine perspective where the goal is to personalize diagnostic and therapeutic interventions. The current study provides further insight into the genetic architecture of AD through GWA, TWA, gene-based, and pathway-based analyses of four independent datasets. These datasets, particularly the LOADFS cohort, were partially used in previous genetic studies of AD [21, 72, 75, 79–87]. Our GWA analyses corroborated the associations of a number of previously detected AD loci and revealed some significant novel association signals. Among previously detected AD-associated SNPs, we found several SNPs with p values that were smaller than those reported before. Also, the significant association signals for three SNPs inside the chromosome 19q13 region (i.e., nonreplicated rs2965169 SNP under plan 1, rs10426423 from the meta-analysis set of SNPs under plan 1, and rs769450 from the replicated set of SNPs under plan 1 and the nonreplicated sets of SNPs under plans 2 and 3) were previously reported only in African-Americans (p = 2.6E–8, p = 9.9E–7, and p = 5.3E–27, respectively [88]). Most newly detected AD-associated SNPs, particularly those outside the chromosome 19q13 region, can be considered informative AD markers because their p values in our study were smaller than those for other AD-associated loci in their 1-Mb upstream/downstream regions and they were not in LD with such loci. For instance, as seen in Table 2 that summarizes the replicated and meta-analysis sets of SNPs, 11, 4, and 21 novel AD-associated SNPs were detected under plans 1–3, respectively. These SNPs were mapped to 21 genes in 19 chromosomal regions (i.e., cytogenetic regions). Of these, four genes had been associated with AD in previous GWAS with p < 5E–06. Also, nine genes were located in eight chromosomal regions that contained previously AD-associated SNPs that were > 1 Mb away from the SNPs detected in our study. The other eight genes/regions had not been associated with AD in previous studies at genome-wide or suggestive levels of associations [43]. Our GWA analyses also revealed associations of a number of SNPs (41, 33, and 40 SNPs under plans 1–3, respectively) with AD that were present only in one of the four investigated cohorts. While successful replication of a discovered association in an independent cohort has become the gold standard in genome-wide association studies for substantiating the real genetic effects, failure to replicate SNP–disease associations does not necessarily indicate that they are false-positive findings. Instead, they might be real genetic contributors that confer population-specific risks due to the genetic heterogeneity of the disease [2, 60, 89, 90]. Other reasons for nonreproducibility can be the lack of statistical power due to insufficient sample sizes, the presence of environmental or gene–gene interactions, and a lack of genotyping information for particular loci in different studies. For instance, small between-population allele frequency differences at an interacting locus may result in a lack of power to detect the main effect of a genuine association signal in independent cohorts [60]. These reasons can also justify why not all previously discovered AD-associated SNPs were replicated in our study. Of particular interest was to investigate the sex disparity in the genetic basis of AD. Addressing sex differences in biomedical research has been emphasized by the National Institutes of Health as an approach that can eventually bolster the personalized medicine paradigm [14, 15]. Our results revealed a number of new sex-specific genetic contributors to AD at the SNP, gene, and transcriptome levels. For instance, most of the newly detected SNPs, particularly SNPs outside chromosome 19q13, were sex specific as they had significant p values either in males or females and, in addition, their odds ratios were significantly different between the two genders. Interestingly, there were two additional subsets of SNPs that were nominally associated with AD in all datasets in one sex while they were nonsignificant in all datasets in the other. Such consistent sex-specific association signals, although weak, might be important in exploring the differences in genetic risk factors of AD between males and females and may demonstrate genome-wide significance in larger samples. Another level of sex disparity was observed in the gene-based and TWA analyses where several genes were significantly associated with AD in either males or females. Also, there were several pathways that were specifically significant in females. These will be further discussed in the following paragraphs. In the gene-based analysis, LINC00158, MIR155HG, MIR155, LINC00515, MRPL39, and JAM2 were significantly associated with AD when the entire sample of individuals and/or only females were analyzed. These genes are located near each other on chromosome 21q21.3 in a ~ 332-kb region. The APP gene implicated in early onset familial AD or Down syndrome-related AD [4] is also located 163–449 kb from these genes. There were no AD-associated SNPs with p < 5E–08 within their 1 Mb in current or previous studies [43]. However, there were several SNPs with significant p values at the suggestive level of associations in that chromosomal region among meta-analysis sets of SNPs under plan 1 (i.e., rs76252969 and rs2298369) and plan 3 (i.e., rs12386284, rs1783012, rs1783013, rs926963, rs1893650, rs2226326, rs2829803, rs2298369, rs2829823, and rs2829832). The SNPs in the 1-Mb upstream/downstream regions of these genes were previously associated with some potential AD risk factors such as type 2 diabetes, hypertension, coronary artery disease, and lipid profile changes at the genome-wide significance level. They have also been associated with traits such as alcohol and nicotine codependence, age at onset of Parkinson’s disease, and pattern recognition memory at the suggestive significance level of association [43]. Furthermore, functional studies have provided insight into the potential roles of some of these genes in AD pathogenesis. For instance, MIR155HG and MIR155 encode two microRNAs. MIR155 overexpression was previously implicated in downregulation of complement factor H (CFH) expression in AD and other neurodegenerative diseases which in turn may prevent spontaneous immune system activation [91]. MRPL39 encodes a mitochondrial ribosomal protein involved in the oxidative–phosphorylation pathway. Impaired mitochondrial function has been reported in neurons of patients with AD [92, 93]. Lunnon et al. [92] reported that the expression levels of MRPL39 and another nearby gene (i.e., ATP5J involved in the oxidative–phosphorylation pathway) were slightly reduced in AD patients compared to controls. JAM2 encodes a membrane protein found at the tight junctions of epithelial and endothelial cells that acts as an adhesive ligand for immune cells. It belongs to the immunoglobulin superfamily of adhesive molecules that has been implicated in AD pathogenesis [94]. Also, duplication of an ~ 600-kb region on chromosome 21 containing the JAM2, ATP5J, and APP genes has been reported in autosomal dominant AD [95]. In TWA analyses using brain-specific eQTLs data, four probes/genes were associated with AD (two in males and two in females). Also, using eQTLs data from peripheral blood, the expression level of 17 probes/genes passed both the SMR and the HEIDI tests, indicating that variants influencing the expression of these genes may also have pleiotropic effects on developing AD [55, 56]. It should be noted that due to the tissue-specific expression of genes, using data from eQTLs studies on blood is not ideal for capturing associations between the transcriptome levels and AD. However, it increases the power of SMR analysis since such studies take advantage of more samples compared to brain-specific eQTLs studies [55]. Significant SNPs with p < 5E–08 were detected within 1 Mb of MS4A6A and UQCC genes (significant in TWA analyses of blood eQTLs data) in our GWAS or previous reports [43]. SNPs with p < 5E–06 were present only in 1-Mb upstream/downstream regions of ATG10 (significant in brain-specific TWA analyses) in our GWA analyses of females, although several AD-associated SNPs with p < 5E–06 were reported in regions around TRA2A [64], IRAK3 [77], and ESPN [78]. This is likely indicative of the lack of power of conducted GWAS due to insufficient sample sizes [55]. Taken together, all AD-associated genes in our TWA analyses except MS4A6A and UQCC can be considered novel potential AD-associated genes. Further functional analyses are needed to explore their potential roles in AD pathogenesis as detected associations do not imply causation. Instead, they provide a list of prioritized candidates for follow-up studies. SNPs in 1-Mb upstream/downstream regions around these genes have been previously associated with some other traits (e.g., autoimmune diseases or serum cholesterol levels) with p < 5E–06. Examples include associations of SNPs corresponding to ABCB9 with college completion and years of education, ATG10 with vascular dementia, C9orf72 with amyotrophic lateral sclerosis, frontotemporal lobar degeneration, and response of rheumatoid arthritis patients to anti-TNF treatment, GNAI3 with total and low-density lipoprotein cholesterol (LDL-C) and major depression, LPXN with inflammatory bowel disease, MED30 with rheumatoid arthritis and fasting blood glucose, PRDM10 with type 2 diabetes, and SFN with high-density lipoprotein cholesterol (HDL-C) [43]. Notably, none of the novel AD-associated genes detected in males were among the significant genes in females and vice versa. Among the significant genes detected in females, a pathologic hexa-nucleotide repeat expansion in the C9orf72 gene has been linked to frontotemporal dementia and may contribute to AD pathogenesis [96-99]. Also, the GNAI3 gene was reported to be overexpressed in AD intact mice compared to AD impaired ones [100]. CRIPAK, which was among significant genes detected in brain-specific TWA analyses in males, is an inhibitor of the PAK1 gene [101]. The PAK gene family was found to play roles in learning and memory, and the dysregulations were implicated in AD, Huntington disease, and mental retardation [102]. Also, rs1923775 located ~ 700 kb away from CRIPAK has shown relatively strong association (p = 5.60E–6) with AD in African Americans [88]. Of 26 pathways that were significantly associated with AD in our pathway-based analyses, 12 were not plan specific, seven were specifically significant only under plan 1 (males and females), and seven were specifically significant only in females (i.e., plan 3). Pathways that were significant in more than one plan were mostly involved in processes such as mitochondrial function, lipid metabolism, cell junctions, and immune and inflammatory responses that were implicated in AD [93, 103–106]. There are several lines of evidence in previous empirical studies substantiating the potential roles of some of the detected plan-specific pathways in AD pathogenesis. For instance, it was suggested that deactivation of the epidermal growth factor receptor (EGFR) signaling pathway may attenuate the Aβ-induced memory loss in Drosophila and mice models [107]. Also, the fragmentation and dysfunction of Golgi apparatus, an organelle involved in the posttranslational modifications and trafficking of proteins, has been implicated in AD pathogenesis [108, 109]. The upregulation of the Fas signaling pathway, involved in the apoptosis and modulating immune responses, was reported to contribute to the Aβ-induced cell death and neurodegeneration in AD [110, 111]. Also, dysregulation of the platelet-derived growth factor (PDGF) signaling pathway was suggested to increase Aβ production and contribute to the neurodegeneration in AD [112, 113]. Among the female-specific pathways, G-protein activation is a signal transduction pathway that can modulate the production and action of different intracellular effector proteins. The G protein-coupled receptors play important roles in the initiation and regulation of inflammatory responses such as phagocyte chemotaxis and cytokine production [50, 114]. The pathologically increased inflammatory responses were reported in the brain of patients with AD [93]. Gβγ signaling through the PI3Kγ pathway is involved in the regulation of immune system responses and platelet activation [115]. Also, the ADP signaling, signal amplification, and prostacyclin signaling pathways are involved in the regulation of platelets activation in response to injury or in healthy blood vessels [50]. Platelets, as the major sources of amyloid precursor protein (APP) and Aβ in blood, were reported to be overactivated in AD patients possibly due to their stimulation by injured cerebral endothelial cells or by their cell membrane abnormalities [116, 117]. The glucagon-type ligand receptors are found in the gastrointestinal epithelium and brain neurons. Glucagon-like peptide-1 (GLP-1) has been suggested as a potential treatment to reverse the neurodegeneration in AD and Parkinson’s disease [118, 119].

Conclusions

In summary, our study revealed significant associations of several SNPs at genome-wide or suggestive levels of significance which were not reported before. Most of the SNPs that were located outside the APOE cluster gene region were not in LD with previously discovered AD-associated polymorphisms that had p < 5E–06 (Table 2). These SNPs were mapped to 21 genes in 19 chromosomal regions. Of these, 8 genes/regions had not been associated with AD in previous GWAS with p < 5E–06. Also, 26 genes located outside the chromosome 19q13 region, and 26 pathways, showed evidence of associations with AD at the FDR level of 0.05 in our TWA, gene-based, and pathway-based analyses. Thirteen of these 26 genes were located in chromosomal regions with no AD-associated SNPs at the genome-wide or suggestive level of significance. Most of the significantly detected SNPs and genes as well as several AD-associated pathways were sex specific, indicating sex disparities in the genetic basis of AD. By detecting a number of novel potential AD-associated SNPs and discovering suggestive associations of several genes and transcripts, our study provides new insight into the genetic architecture of AD. Particularly, identifying sex-specific genetic contributors can advance our understanding of AD pathogenesis. Despite the rigor of this study, there are some limitations. The case/control status in the four cohorts used in this study was mainly determined clinically. The routine clinical diagnosis of AD based on the symptoms and neurologic examinations may not provide the optimal case/control classification. Instead, the National Institute on Aging and the Alzheimer’s Association suggested that integrating additional paraclinical tests (e.g., histopathologic findings in brain biopsy, measuring AD-related cerebrospinal fluid (CSF) biomarkers, or detecting neurodegeneration by the imaging study) into the diagnostic protocols can aid researchers to more accurately identify AD patients and healthy controls [120, 121]. Beach et al. [122] investigated the accuracy of clinical diagnosis of AD by comparing such diagnoses to the histopathology findings from brain autopsies in a sample of 1198 subjects. They found that the sensitivity and specificity of clinical diagnostic classification were 70.9–87.3% and 44.3–70.8%, respectively, indicating a relatively high possibility of clinically false-negative and false-positive classification of subjects as controls and cases, respectively [122]. Finally, since the power of GWA analyses is affected by the sample sizes, and in particular the number of cases, the current study with 2741 cases and 14,739 controls may not have the optimal power. Further studies, possibly with larger sample sizes, are needed to clarify the genotype–phenotype relationships in AD. Table S1. Cases and controls included. Table S2. QC-passed SNPs analyzed in datasets. Table S3. Genomic inflation factors (λ values) from logistic regression models. Table S4–S6. Replicated set of SNPs detected under analysis plan 1 (males and females), plan 2 (only males), and plan 3 (only females). Table S7–S9. Nonreplicated set of SNPs detected under analysis plan 1 (males and females), plan 2 (only males), and plan 3 (only females). Table S10–S12. Meta-analysis set of SNPs detected under analysis plan 1 (males and females), plan 2 (only males), and plan 3 (only females). Table 13. LD information about newly detected SNPs under plans 1–3 for which proxy AD-associated loci exist in 1-Mb flanking regions [8, 9]. Table S14. Coding schema used to determine APOE genotypes. Table 15. Information about LD between APOE SNPs and AD-associated SNPs located on chromosome 19 [8]. Table S16–S17. Wald χ2 test to compare ORs of SNPs between males and females for SNPs that were specifically significant in males and in females. Figure S1–S6. Manhattan plot and QQ plot of genome-wide association results under analysis plan 1 (males and females), plan 2 (only males), and plan 3 (only females). Supporting Acknowledgment. Furthre information about the four cohorts under consideration. (DOCX 284 kb) Detailed information about the AD-associated SNPs under analysis plan 1 (males and females). (XLSX 72 kb) Detailed information about the AD-associated SNPs under analysis plan 2 (only males). (XLSX 57 kb) Detailed information about the AD-associated SNPs under analysis plan 3 (only females). (XLSX 72 kb) Detailed information about the nominally AD-associated SNPs with the opposite pattern of significance in males and females. (XLSX 40 kb) Detailed information about the results from the Wald χ2 test to compare ORs of SNPs between males and females. (XLSX 107 kb)
  116 in total

1.  The Framingham Offspring Study. Design and preliminary data.

Authors:  M Feinleib; W B Kannel; R J Garrison; P M McNamara; W P Castelli
Journal:  Prev Med       Date:  1975-12       Impact factor: 4.018

2.  Robust relationship inference in genome-wide association studies.

Authors:  Ani Manichaikul; Josyf C Mychaleckyj; Stephen S Rich; Kathy Daly; Michèle Sale; Wei-Min Chen
Journal:  Bioinformatics       Date:  2010-10-05       Impact factor: 6.937

Review 3.  Regulation of complement factor H (CFH) by multiple miRNAs in Alzheimer's disease (AD) brain.

Authors:  Walter J Lukiw; Peter N Alexandrov
Journal:  Mol Neurobiol       Date:  2012-08       Impact factor: 5.590

Review 4.  Cadherins as targets for genetic diseases.

Authors:  Aziz El-Amraoui; Christine Petit
Journal:  Cold Spring Harb Perspect Biol       Date:  2010-01       Impact factor: 10.005

5.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets.

Authors:  Zhihong Zhu; Futao Zhang; Han Hu; Andrew Bakshi; Matthew R Robinson; Joseph E Powell; Grant W Montgomery; Michael E Goddard; Naomi R Wray; Peter M Visscher; Jian Yang
Journal:  Nat Genet       Date:  2016-03-28       Impact factor: 38.330

6.  Atypical, slowly progressive behavioural variant frontotemporal dementia associated with C9ORF72 hexanucleotide expansion.

Authors:  Baber K Khan; Jennifer S Yokoyama; Leonel T Takada; Sharon J Sha; Nicola J Rutherford; Jamie C Fong; Anna M Karydas; Teresa Wu; Robin S Ketelle; Matthew C Baker; Mariely-Dejesus Hernandez; Giovanni Coppola; Daniel H Geschwind; Rosa Rademakers; Suzee E Lee; Howard J Rosen; Gil D Rabinovici; William W Seeley; Katherine P Rankin; Adam L Boxer; Bruce L Miller
Journal:  J Neurol Neurosurg Psychiatry       Date:  2012-04       Impact factor: 10.154

Review 7.  Survival in dementia and predictors of mortality: a review.

Authors:  Stephen Todd; Stephen Barr; Mark Roberts; A Peter Passmore
Journal:  Int J Geriatr Psychiatry       Date:  2013-03-22       Impact factor: 3.485

8.  Quality control and conduct of genome-wide association meta-analyses.

Authors:  Thomas W Winkler; Felix R Day; Damien C Croteau-Chonka; Andrew R Wood; Adam E Locke; Reedik Mägi; Teresa Ferreira; Tove Fall; Mariaelisa Graff; Anne E Justice; Jian'an Luan; Stefan Gustafsson; Joshua C Randall; Sailaja Vedantam; Tsegaselassie Workalemahu; Tuomas O Kilpeläinen; André Scherag; Tonu Esko; Zoltán Kutalik; Iris M Heid; Ruth J F Loos
Journal:  Nat Protoc       Date:  2014-04-24       Impact factor: 13.491

9.  Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer's disease.

Authors:  Adam C Naj; Gyungah Jun; Gary W Beecham; Li-San Wang; Badri Narayan Vardarajan; Jacqueline Buros; Paul J Gallins; Joseph D Buxbaum; Gail P Jarvik; Paul K Crane; Eric B Larson; Thomas D Bird; Bradley F Boeve; Neill R Graff-Radford; Philip L De Jager; Denis Evans; Julie A Schneider; Minerva M Carrasquillo; Nilufer Ertekin-Taner; Steven G Younkin; Carlos Cruchaga; John S K Kauwe; Petra Nowotny; Patricia Kramer; John Hardy; Matthew J Huentelman; Amanda J Myers; Michael M Barmada; F Yesim Demirci; Clinton T Baldwin; Robert C Green; Ekaterina Rogaeva; Peter St George-Hyslop; Steven E Arnold; Robert Barber; Thomas Beach; Eileen H Bigio; James D Bowen; Adam Boxer; James R Burke; Nigel J Cairns; Chris S Carlson; Regina M Carney; Steven L Carroll; Helena C Chui; David G Clark; Jason Corneveaux; Carl W Cotman; Jeffrey L Cummings; Charles DeCarli; Steven T DeKosky; Ramon Diaz-Arrastia; Malcolm Dick; Dennis W Dickson; William G Ellis; Kelley M Faber; Kenneth B Fallon; Martin R Farlow; Steven Ferris; Matthew P Frosch; Douglas R Galasko; Mary Ganguli; Marla Gearing; Daniel H Geschwind; Bernardino Ghetti; John R Gilbert; Sid Gilman; Bruno Giordani; Jonathan D Glass; John H Growdon; Ronald L Hamilton; Lindy E Harrell; Elizabeth Head; Lawrence S Honig; Christine M Hulette; Bradley T Hyman; Gregory A Jicha; Lee-Way Jin; Nancy Johnson; Jason Karlawish; Anna Karydas; Jeffrey A Kaye; Ronald Kim; Edward H Koo; Neil W Kowall; James J Lah; Allan I Levey; Andrew P Lieberman; Oscar L Lopez; Wendy J Mack; Daniel C Marson; Frank Martiniuk; Deborah C Mash; Eliezer Masliah; Wayne C McCormick; Susan M McCurry; Andrew N McDavid; Ann C McKee; Marsel Mesulam; Bruce L Miller; Carol A Miller; Joshua W Miller; Joseph E Parisi; Daniel P Perl; Elaine Peskind; Ronald C Petersen; Wayne W Poon; Joseph F Quinn; Ruchita A Rajbhandary; Murray Raskind; Barry Reisberg; John M Ringman; Erik D Roberson; Roger N Rosenberg; Mary Sano; Lon S Schneider; William Seeley; Michael L Shelanski; Michael A Slifer; Charles D Smith; Joshua A Sonnen; Salvatore Spina; Robert A Stern; Rudolph E Tanzi; John Q Trojanowski; Juan C Troncoso; Vivianna M Van Deerlin; Harry V Vinters; Jean Paul Vonsattel; Sandra Weintraub; Kathleen A Welsh-Bohmer; Jennifer Williamson; Randall L Woltjer; Laura B Cantwell; Beth A Dombroski; Duane Beekly; Kathryn L Lunetta; Eden R Martin; M Ilyas Kamboh; Andrew J Saykin; Eric M Reiman; David A Bennett; John C Morris; Thomas J Montine; Alison M Goate; Deborah Blacker; Debby W Tsuang; Hakon Hakonarson; Walter A Kukull; Tatiana M Foroud; Jonathan L Haines; Richard Mayeux; Margaret A Pericak-Vance; Lindsay A Farrer; Gerard D Schellenberg
Journal:  Nat Genet       Date:  2011-04-03       Impact factor: 38.330

Review 10.  NIA-AA Research Framework: Toward a biological definition of Alzheimer's disease.

Authors:  Clifford R Jack; David A Bennett; Kaj Blennow; Maria C Carrillo; Billy Dunn; Samantha Budd Haeberlein; David M Holtzman; William Jagust; Frank Jessen; Jason Karlawish; Enchi Liu; Jose Luis Molinuevo; Thomas Montine; Creighton Phelps; Katherine P Rankin; Christopher C Rowe; Philip Scheltens; Eric Siemers; Heather M Snyder; Reisa Sperling
Journal:  Alzheimers Dement       Date:  2018-04       Impact factor: 21.566

View more
  20 in total

1.  The Big Picture of Neurodegeneration: A Meta Study to Extract the Essential Evidence on Neurodegenerative Diseases in a Network-Based Approach.

Authors:  Nicolas Ruffini; Susanne Klingenberg; Raoul Heese; Susann Schweiger; Susanne Gerber
Journal:  Front Aging Neurosci       Date:  2022-06-27       Impact factor: 5.702

2.  Genome-wide analysis identified abundant genetic modulators of contributions of the apolipoprotein E alleles to Alzheimer's disease risk.

Authors:  Alireza Nazarian; Yury Loika; Liang He; Irina Culminskaya; Alexander M Kulminski
Journal:  Alzheimers Dement       Date:  2022-01-03       Impact factor: 16.655

3.  Genetic heterogeneity of Alzheimer's disease in subjects with and without hypertension.

Authors:  Alireza Nazarian; Konstantin G Arbeev; Arseniy P Yashkin; Alexander M Kulminski
Journal:  Geroscience       Date:  2019-05-05       Impact factor: 7.713

Review 4.  Reassessment of Pioglitazone for Alzheimer's Disease.

Authors:  Ann M Saunders; Daniel K Burns; William Kirby Gottschalk
Journal:  Front Neurosci       Date:  2021-06-16       Impact factor: 4.677

5.  The impact of disregarding family structure on genome-wide association analysis of complex diseases in cohorts with simple pedigrees.

Authors:  Alireza Nazarian; Konstantin G Arbeev; Alexander M Kulminski
Journal:  J Appl Genet       Date:  2019-11-21       Impact factor: 3.240

6.  Alzheimer Disease Pathology-Associated Polymorphism in a Complex Variable Number of Tandem Repeat Region Within the MUC6 Gene, Near the AP2A2 Gene.

Authors:  Yuriko Katsumata; David W Fardo; Adam D Bachstetter; Sergey C Artiushin; Wang-Xia Wang; Angela Wei; Lena J Brzezinski; Bela G Nelson; Qingwei Huang; Erin L Abner; Sonya Anderson; Indumati Patel; Benjamin C Shaw; Douglas A Price; Dana M Niedowicz; Donna W Wilcock; Gregory A Jicha; Janna H Neltner; Linda J Van Eldik; Steven Estus; Peter T Nelson
Journal:  J Neuropathol Exp Neurol       Date:  2020-01-01       Impact factor: 3.685

7.  Identifying genetic risk variants associated with brain volumetric phenotypes via K-sample Ball Divergence method.

Authors:  Yue Hu; Haizhu Tan; Cai Li; Heping Zhang
Journal:  Genet Epidemiol       Date:  2021-06-29       Impact factor: 2.344

Review 8.  Sex differences in the incidence and prevalence of young-onset Alzheimer's disease: A meta-analysis.

Authors:  Geeske Peeters; Katerine Katelekha; Brian Lawlor; Naiara Demnitz
Journal:  Int J Geriatr Psychiatry       Date:  2021-08-12       Impact factor: 3.850

9.  Neuronal ROS-induced glial lipid droplet formation is altered by loss of Alzheimer's disease-associated genes.

Authors:  Matthew J Moulton; Scott Barish; Isha Ralhan; Jinlan Chang; Lindsey D Goodman; Jake G Harland; Paul C Marcogliese; Jan O Johansson; Maria S Ioannou; Hugo J Bellen
Journal:  Proc Natl Acad Sci U S A       Date:  2021-12-28       Impact factor: 12.779

10.  Genome-Wide Analysis of Sex Disparities in the Genetic Architecture of Lung and Colorectal Cancers.

Authors:  Alireza Nazarian; Alexander M Kulminski
Journal:  Genes (Basel)       Date:  2021-05-01       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.