Literature DB >> 29688227

Whole genome sequencing of Caribbean Hispanic families with late-onset Alzheimer's disease.

Badri N Vardarajan1,2,3, Sandra Barral1,2,4, James Jaworski5, Gary W Beecham5, Elizabeth Blue6, Giuseppe Tosto1,2, Dolly Reyes-Dumeyer1,2, Martin Medrano7, Rafael Lantigua1,8, Adam Naj9, Timothy Thornton10, Anita DeStefano11, Eden Martin5, Li-San Wang1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17, Lisa Brown10, William Bush12, Cornelia van Duijn13, Allison Goate14, Lindsay Farrer11, Jonathan L Haines12, Eric Boerwinkle15, Gerard Schellenberg9, Ellen Wijsman6,10, Margaret A Pericak-Vance5, Richard Mayeux1,2,3,16,17, Li-San Wang1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17.   

Abstract

OBJECTIVE: To identify rare causal variants underlying known loci that segregate with late-onset Alzheimer's disease (LOAD) in multiplex families.
METHODS: We analyzed whole genome sequences (WGS) from 351 members of 67 Caribbean Hispanic (CH) families from Dominican Republic and New York multiply affected by LOAD. Members of 67 CH and additional 47 Caucasian families underwent WGS as a part of the Alzheimer's Disease Sequencing Project (ADSP). All members of 67 CH families, an additional 48 CH families and an independent CH case-control cohort were subsequently genotyped for validation. Patients met criteria for LOAD, and controls were determined to be dementia free. We investigated rare variants segregating within families and gene-based associations with disease within LOAD GWAS loci.
RESULTS: A variant in AKAP9, p.R434W, segregated significantly with LOAD in two large families (OR = 5.77, 95% CI: 1.07-30.9, P = 0.041). In addition, missense mutations in MYRF and ASRGL1 under previously reported linkage peaks at 7q14.3 and 11q12.3 segregated completely in one family and in follow-up genotyping both were nominally significant (P < 0.05). We also identified rare variants in a number of genes associated with LOAD in prior genome wide association studies, including CR1 (P = 0.049), BIN1 (P = 0.0098) and SLC24A4 (P = 0.040). CONCLUSIONS AND RELEVANCE: Rare variants in multiple genes influence the risk of LOAD disease in multiplex families. These results suggest that rare variants may underlie loci identified in genome wide association studies.

Entities:  

Year:  2018        PMID: 29688227      PMCID: PMC5899906          DOI: 10.1002/acn3.537

Source DB:  PubMed          Journal:  Ann Clin Transl Neurol        ISSN: 2328-9503            Impact factor:   5.430


Introduction

Late‐onset Alzheimer's disease (LOAD) is the most common form of dementia in older adults, it lacks an effective treatment and represents an enormous societal burden. The disease is characterized by progressive deterioration of memory and cognitive functions, leading to loss of autonomy and ultimately requiring full‐time medical care. Pathologically, LOAD is defined by severe neuronal loss, aggregation of amyloid β (Aβ) in extracellular senile plaques, and formation of intra‐neuronal neurofibrillary tangles consisting of hyper‐phosphorylated tau protein. Over the past decade, genetic research in LOAD has been dominated by genome‐wide association studies (GWAS), a hypothesis‐free scan of the genome using dense genotyping arrays based on common variants (single nucleotide polymorphisms, SNPs). Several genes within LOAD susceptibility loci cluster in specific pathways,1, 2, 3, 4, 5, 6 including amyloid processing, oxidative stress and immune or inflammatory pathways. Collectively, GWAS demonstrates that apart from the strongest risk factor, APOE‐ε4 a large number of loci with modest effect size also contribute to LOAD risk. Common variants identified through GWAS may not have functional consequences, simply reflecting linkage disequilibrium with the unobserved causal variants. It is also possible that these causal variants are rare and have large effects, such as TREM2, 7, 8, 9, 10, 11, 12, 13 and are not covered by commercially available GWAS platforms. In fact, putatively damaging variants have already been identified (for example TREM2, SORL1, and ABCA7) in some of these LOAD susceptibility loci, advancing our understanding of disease risk.14, 15, 16 Whole genome sequencing (WGS) provides a comprehensive and detailed investigation of human genetic variation allowing interrogation of coding and noncoding regions of the genome. Increasingly, WGS studies have provided the strongest evidence that rare genetic variants can have large cumulative effects on human diseases.17, 18, 19, 20 Family‐based studies represent an implicit enrichment strategy for identifying rare variants.21 Transmission of variants from parents to offspring maximizes the chance that multiple copies of rare variants exist in the pedigree. Moreover, compared with analyses using unrelated samples, sequencing in families provides data concerning identical‐by‐descent or sharing among relatives, greatly reducing false positives and permitting detection of sequencing errors, while facilitating the identification of alleles that cause genetic disorders.22, 23 In the current report, we focused on Caribbean Hispanic families multiply affected by LOAD. The frequency of LOAD among multiplex families from the Dominican Republic is known to be significantly higher (5‐fold) than expected for similarly aged individuals in a non‐Hispanic white population from the United States.24 Furthermore, we found that a moderate degree of inbreeding was present and a predictor of LOAD risk in this population.25 As part of the national Alzheimer's Disease Sequencing Project (ADSP), we identified several chromosomal regions with strong evidence for linkage in Caribbean Hispanic families with multiple LOAD cases.26 In the present study, we used WGS data (Data S1) from 67 families as discovery to detect rare variants in previously identified linkage regions and in previously reported LOAD candidate genes. Genotyping in additional WGS on additional 47 Caucasian families were used to replicate the findings from the CH families.

Material and Methods

Family characteristics

All participants (Table 1) were recruited after providing informed consent and with approval by the relevant institutional review boards both in the United States and in the Dominican Republic. Patients, unaffected family members and healthy unrelated controls were required to have had standard neuropsychological tests and neurological examinations to verify their clinical status and diagnosis. Most individuals have been evaluated on multiple occasions over the past 10 years. Families in which patients had known early‐onset disease mutations in APP, PSEN1, PSEN2, GRN, or MAPT were excluded from this analysis to increase power of discovery variants predisposing to LOAD. All selected probands came from families with three or more affected individuals recruited in the Dominican Republic and New York. Recruitment for this family study began in 1998, and was restricted to Caribbean Hispanics,27, 28 predominately from the Dominican Republic. As a part of the ADSP, a set of 67 CH families and 47 Caucasian families were selected for whole genome sequencing from approximately 1400 families reviewed. Selection was based on the number of affected individuals, the number of generations affected, age at onset of clinical symptoms, and presence of APOEε4 alleles.29
Table 1

Demographics of the Caribbean‐Hispanic families and case–control cohorts used in WGS and validation genotyping

Pedigrees Sequenced in ADSP discovery
Number of pedigrees67
Total number of subjects sampled860
Average subjects sampled per pedigree12.84 ± 7.28
Total number of subjects sequenced351
Average samples sequenced in each pedigree5.24 ± 1.67
Affected302
Unaffected49
Average affected per pedigree7.42 ± 3.61
Age (sequenced individuals)73.02 ± 10.0
Women (%)58.72
APOE17.03%
Additional pedigrees used in validation genotyping
Number of pedigrees48
Total number of subjects sampled404
Average subjects sampled per pedigree8.41 ± 4.8
Average affected per pedigree5.08 ± 2.3
Age70.76 ± 10.11
Women (%)65
APOE29.90%
Unrelated case–control set
Total number of subjects sampled450
No of affected individuals152
Age (affecteds)85.5 ± 6.5
Age79.3 ± 6.7
Women (%)68.7
APOE12.80%
Demographics of the Caribbean‐Hispanic families and case–control cohorts used in WGS and validation genotyping

Whole Genome Sequence analysis of GWAS linkage peaks

Details about WGS methodology and variant calling pipelines can be found in Data S1. Three regions demonstrating linkage previously identified26 using GWAS data were prioritized for WGS data analysis. Results from genome‐wide linkage analysis revealed a strong linkage signal (HLOD = 3.6) on chromosomal region 3q29.26 We previously reported microsatellite linkage analyses in this region as strongly associated with LOAD in Caribbean Hispanics,30 including a single family that achieved a LOD score of 1.28. WGS data was generated for four of the family members with LOAD. Analysis of the WGS data was limited to an approximate 2 Mb region encompassing the 1‐LOD interval around linkage peak (3q29: 197,052,973 bp–199,212,658 bp). We also selected two chromosomal regions where we observed genome‐wide significant linkage and association signals across the families: 7p14.3 and 11q12.3.26 Annotation of the identified variants in these linkage regions was performed using the ANNOVAR software.31

Whole genome sequence variants prioritization

Consensus linkage regions

We followed a principled filtering strategy (Fig. 1, Table S4) to test the segregation of rare variants with LOAD status in the three linkage regions: 7p14.3 and 11q12.3 and 3q29 (Table S6). The first criterion used to prioritize the identified variants was based on familial segregation of cases. We required that at least 75% of the patients with LOAD and WGS data in the families were carriers. Additionally we also prioritized variants that were observed in at least five affected individuals in multiple Caribbean Hispanic families.
Figure 1

Filtering strategy to prioritize variants for follow‐up analyses in the validation sample.

Filtering strategy to prioritize variants for follow‐up analyses in the validation sample. We filtered variants with minor allele frequency (MAF) of less than 0.10 in the ADSP families and with an MAF of 0.02 in Caucasian and African American populations in ExAC database (for exonic variants)32 and 1000 genomes project (for intronic and intergenic variants)33 to restrict our analyses to rare variants in the population. Functional annotations were also used as filtering strategy, that is, variants were analyzed in order of increasing priority: exonic, inter‐genic, intronic and others (non‐coding RNA, 3′/5′ UTR, upstream/downstream gene). We also included any intronic or intergenic variants with a Combined Annotation Dependent Depletion (CADD)34 score of greater than 30 to include highly conserved and putative non‐exonic regulatory variants in the analyses. Further criteria for variant prioritization included, additional LOAD patients from different families carrying the variant and that the variant was either not present or present in very low frequency in the non‐demented family members.

Follow‐up genotyping validation

To confirm putative variants that segregated in 67 discovery families and candidate GWAS loci, we genotyped all of the family members in the discovery families, an additional 48 families and unrelated, unaffected controls of the same ancestry and age (Table 1). The validation cohort consisted of Caribbean Hispanic individuals from 115 multiplex families and 450 unrelated cases and controls from the same ethnic background (Table 1). Population allele frequencies for novel variants were estimated from unaffected persons in the Caribbean Hispanic population from samples/families that we have amassed over 15 years. Genotypes were generated using the KASPTM genotyping technology which uses allele‐specific polymerase chain reaction for accurate calling of single nucleotide variants (SNVs) and indels.35

Single variant association analyses

SNV genetic associations were restricted to individuals 60 years of age or older and were tested using General Linear Models in generalized estimating equations (GEE) to adjust for familial correlation in the data. The family ID was used as a repeat measure to account for correlation in the genotype data within family members. All analyses were adjusted for sex, age at last evaluation, education, APOE‐ε4 genotype and the first ten principal components.

LOAD GWAS and candidate loci

We tested genes associated with LOAD in large GWAS studies with multiple replication reports. We also included known early‐onset AD genes and genes implicated in earlier sequencing efforts in LOAD.1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16 Candidate genes evaluated included: APP, PSEN1, PSEN2, GRN, MAPT, TREM2, PLD3, APOE, ABCA7, SORL1, CR1, BIN1, CD2AP, EPHA1, CLU, MS4A6A, PICALM, CD33, HLA‐DRB5, HLA‐DRB1, PTK2B, SLC24A4, RIN3, INPP5D, MEF2C, NME8, ZCWPW1, CELF1, FERMT2, CASS4, TREML2, and AKAP9.

Gene‐based association analyses

Gene‐based F‐SKAT36 analyses were conducted for genes in the three consensus linkage regions. The F‐SKAT test was restricted to variants defined as exonic and rare (<2% in Europeans, African Americans and overall in the ExAC database). To define a testable‐set, we used two sets of variants in the gene‐based test based on functional annotation: (1) Stop‐loss, stop gain and missense damaging variants (as defined by SIFT37 or PolyPhen38) and (2) including all non‐synonymous variants. We also tested gene‐sets that included both loss of function variants and variants that had a high CADD score (>30). The CADD score filter captures putatively non‐functional variants in the non‐coding regions. For most genes (Table S5), we did not observe variants in non‐coding regions with a high CADD score. In addition functional characterization of GWAS loci have indicated that cis‐regulation is a common mechanism underlying these associations.39 The most frequent elements affected are transcriptional enhancers and silencers that regulate transcription through long‐range interactions, typically located more than 1Kb from their target genes. Therefore, to examine the top GWAS signals,4 we have considered a chromosomal region of 1 Mb upstream and 1 Mb downstream the reported GWAS loci. We conducted F‐SKAT analyses of genes within ±1 Mb encompassing the reported LOAD‐GWAS SNPs. Each gene was independently assessed for association with LOAD using F‐SKAT with the modified R code to adjust for family structure. We tested models of association also adjusting for (1) age, sex, population substructure, ancestry proportions for African‐American and native American ancestry (Data S1) and (2) including APOE‐ε4 dosage with other covariates. Analyses were subsequently repeated adjusting for the possible effect of the reported GWAS SNP to assess the additional contribution of rare variants in conferring LOAD risk. We also conducted region‐based FSKAT test combining variants from all the genes in the 7p14.3 and 11q12.3 regions to assess the joint burden of all variants together in the region.

Results

Families

As detailed in Table 1, 67 Caribbean‐Hispanic families underwent WGS. The families consisted of 351 individuals (302 individuals with LOAD and 49 unaffected family members) with average age of 73.02 ± 10 years, 59% were women. Seventeen percent were carriers of APOE‐ε4 allele. On average, approximately five individuals were sequenced within each of the families. For validation genotyping, we genotyped all the members of the 67 families, an additional 48 Hispanic families and an independent elderly case control cohort of the same ethnic background. The additional 48 Hispanic families had similar characteristics to the discovery families (Table 1). These families also had, on average, five affected individuals, but had a higher frequency for the APOE ε4 allele (29.9%).

Linkage region in a single Caribbean‐Hispanic family at 3q29

After applying quality control metrics, we identified a total of 6551 sequence variants under the 3q29 linkage peak. Applying the filters in the prioritization pipeline, we identified 11 rare SNVs: six intronic and five inter‐genic variants. The 11 identified SNVs were genotyped in 1,720 Caribbean Hispanic unrelated individuals. The strongest association with LOAD (β = 0.83, SE = 0.45, P = 0.064) was found for an intronic variant rs186972238 in the LRCH3 gene. A complex pattern of inheritance with incomplete penetrance emerged when the variant segregation pattern was evaluated within each of these families.

Locus 7q14.3

A total of 35,376 high quality SNVs were observed in 67 families from the WGS experiment and subsequent QC, of which 26,654 SNVs were observed a frequency of <0.10 in the ADSP families. Applying the filtering criteria described above (Fig. 1), we observed 12 non‐synonymous and one synonymous mutation that were observed in at least two affected individuals and absent in unaffected members (Table S1).

Locus 11q12.3

A total of 19,106 SNVs were observed in the 11q12.3 linkage regions in the 67 ADSP families of which 149 SNVs were putatively damaging (stop‐loss, stop‐gain or damaging classified by SIFT or PolyPhen) and novel or rare (Fig. 1). Forty variants were observed to be perfectly or imperfectly segregating in at least one family. Remarkably, p. V707I in TMEM132A was observed in two families with complete segregation in one family. (Table S2)

Analyses of loci under linkage peaks

We selected 14 variants under the linkage peaks (10 from 11q12.3 and 4 from 7q14.3) for follow‐up genotyping in additional family members and unrelated controls. We chose variants that fulfilled at least one of the following criteria: (1) variants observed in two or more Caribbean Hispanic families in at least five affected LOAD individuals, and further prioritized if observed in affected individuals in Caucasian families or showing segregation with LOAD and (2) variants observed in at least four affected LOAD individuals and no unaffected individuals from one Caribbean Hispanic family with CADD score >15. We tested SNV association with LOAD using a GEE model to adjust for familial correlation. We found nominally significant association (P < 0.05) for two SNVs each in MYRF and ASRGL1, respectively, (Table 2).
Table 2

Results from follow‐up genotyping

NO APOE‐ε4a APOE‐ε4b
SNVBETASEPBETASEPGENE
1‐207680070‐C‐T0.130.215.2E‐010.150.214.6E‐01CR1
1‐207741237‐C‐T1.610.991.0E‐011.600.969.6E‐02CR1
7‐29035428‐C‐T0.820.446.3E‐020.800.436.2E‐02CPVL
7‐30876316‐C‐T−0.580.421.7E‐01−0.500.412.3E‐01FAM188B
7‐31682771‐C‐T0.871.254.9E‐010.861.264.9E‐01CCDC129
7‐91630531‐C‐T 1.75 0.86 4.1E‐02 1.84 0.88 3.8E‐02 AKAP9
7‐91667736‐A‐G0.350.353.2E‐010.350.383.5E‐01AKAP9
7‐143091417‐C‐T0.180.688.0E‐010.190.677.7E‐01EPHA1
11‐60703423‐G‐A0.060.458.9E‐010.000.431.0E+00TMEM132A
11‐61015721‐G‐A0.930.548.5E‐020.810.541.3E‐01PGA5
11‐61120560‐G‐A‐0.080.588.9E‐01‐0.070.579.0E‐01CYB561A3
11‐61250246‐G‐A0.100.508.4E‐01‐0.020.479.6E‐01PPP1R32
11‐61546888‐G‐A ‐0.55 0.26 3.8E‐02 ‐0.49 0.26 5.9E‐02 MYRF
11‐62286666‐C‐T0.720.733.2E‐010.750.753.2E‐01AHNAK
11‐62343562‐G‐C 1.09 0.50 2.8E‐02 0.96 0.51 5.9E‐02 ASRGL1
11‐62344743‐C‐T0.720.733.2E‐010.750.753.2E‐01MIR3654
11‐62400108‐G‐A‐0.310.364.0E‐01‐0.260.364.7E‐01GANAB
11‐62886345‐G‐A0.210.546.9E‐010.240.546.6E‐01SLC3A2
14‐93142861‐T‐C0.290.425.0E‐010.310.424.6E‐01RIN3
19‐1057919‐C‐T0.620.703.8E‐010.580.673.9E‐01ABCA7

Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components.

Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components. (nominally significant variants are highlighted in bold).

Results from follow‐up genotyping Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components. Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components. (nominally significant variants are highlighted in bold).

Segregation of variants in established GWAS loci

We evaluated the segregation with LOAD of the 147 observed rare functional variants in 30 candidate genes implicated in LOAD by GWAS,4, 5 next‐generation sequencing or associated with early‐onset Alzheimer's disease. Sixteen of these variants were segregating in at least one family (Table S3). Remarkably, five missense variants in AKAP9 segregated with LOAD status in one or more families, p.R434W (Figs. 2 and 3, Tables S9 and S10) and p.I1448V (Fig. S1) were subsequently genotyped in the all members of these families, and in an additional 48 families and an independent case–control set to determine complete segregation and test association with LOAD. We also genotyped two missense variants in CR1 and one variant each, from EPHA1, RIN3, and ABCA7 (Table 2) and used GEE adjusting for familial correlation. p.R434W was nominally associated with LOAD adjusting for age and sex (OR = 5.77; 95% confidence interval [CI] 1.07–30.9), P = 0.041) and APOE‐ε4 (OR = 6.3; CI: 1.11–35.54, P = 0.038). p.R434W also segregated with LOAD status in two large families (Fig. 2). p.R434W was predicted to be damaging by SIFT and Polyphen and had a highly deleterious CADD score of 32. Previously implicated variants in AKAP9 40 (rs144662445 and rs149979685) were each observed in one LOAD patient.
Figure 2

Segregation pattern of p.R434W in family 285 (Refer to Table S8 for conversion for EFIGA to ADSP ids).

Figure 3

Segregation pattern of p.R434W in family 348 (Refer to Table S9 for conversion for EFIGA to ADSP ids).

Segregation pattern of p.R434W in family 285 (Refer to Table S8 for conversion for EFIGA to ADSP ids). Segregation pattern of p.R434W in family 348 (Refer to Table S9 for conversion for EFIGA to ADSP ids).

F‐SKAT gene based tests

Nominal association was found for CR1 (P = 0.049), SLC24A4 (P = 0.040) and BIN1 (P = 0.0098). CR1 and SLC24A4 were also significant in replication study of the Caucasian families (P = 0.040 and 0.002, respectively). (Table 3). However, only rare variants in BIN1 (P = 0.026) remained significant after adjusting for the significant GWAS‐SNP that was previously associated with LOAD (Table 4).
Table 3

F‐SKAT analyses of AD candidate GWAS loci

GENEN_LOFNo APOEa APOEb
ABCA7120.66550.7486
AKAP990.66590.5895
BIN1 3 0.0098 0.0139
CASS420.38140.3871
CIT60.47670.5128
CR1 9 0.0490 0.0830
DSG250.46230.3587
EPHA130.81590.8659
FRMD4A20.18070.1418
INPP5D40.53280.6067
MAPT50.71630.7605
PSEN120.59140.7302
RIN350.83670.8526
SLC24A4 3 0.0433 0.0505
SORL130.42850.4552
TREM220.98460.9619

Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components.

Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components.

Table 4

Results from FSKAT gene‐based analyses of all the genes within 1 MB of the AD loci implicated in the Lambert et al. report

GENEChrNO ADJUSTMENT FOR IGAP SNPa ADJUSTING FOR IGAP SNPb Top IGAP SNPNearest Gene
N_NSNO APOEAPOEN_NSNO APOEAPOE
MSH5651.77E‐030.0024647451.81E‐032.61E‐03rs9271192HLA‐DRB5
HLA‐DQA2637.37E‐036.50E‐0336.64E‐036.01E‐03rs9271192HLA‐DRB5
CYP3A43765.69E‐034.09E‐0366.92E‐035.09E‐05rs1476679ZCWPW1
TAF6743.97E‐023.11E‐0241.44E‐021.67E‐04rs1476679ZCWPW1
MAP3K2231.56E‐021.48E‐0231.46E‐021.28E‐02rs6733839BIN1
ZSCAN21728.40E‐021.06E‐0121.80E‐024.10E‐03rs1476679ZCWPW1
BIN1242.77E‐023.36E‐0242.59E‐022.88E‐02rs6733839BIN1
CHGA1443.39E‐023.31E‐0243.46E‐023.56E‐02rs10498633SLC24A4
NAT16727.30E‐028.31E‐0224.55E‐021.05E‐01rs1476679ZCWPW1
LGMN1445.24E‐022.74E‐0244.70E‐022.29E‐02rs10498633SLC24A4
TNXB6254.79E‐025.19E‐02254.91E‐025.38E‐02rs9271192HLA‐DRB5
TFR2761.04E‐018.82E‐0261.12E‐011.14E‐02rs1476679ZCWPW1

Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components (without and without APOE ε4 adjustment).

Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components + Corresponding IGAP SNP (without and without APOE ε4 adjustment).

F‐SKAT analyses of AD candidate GWAS loci Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components. Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components. Results from FSKAT gene‐based analyses of all the genes within 1 MB of the AD loci implicated in the Lambert et al. report Model‐ LOAD ~ Gene + Age + Sex + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components (without and without APOE ε4 adjustment). Model‐ LOAD ~ Gene + Age + Sex + number of APOE ε4 alleles + proportion of African ancestry + proportion of Native American Ancestry + First ten principal components + Corresponding IGAP SNP (without and without APOE ε4 adjustment). As described in Table 4, the strongest LOAD associations after adjusting for the effect of the corresponding common variants were found in genes located in three loci: HLA‐DRB5, ZCWPW1 and BIN1. MSH5 (P = 0.0026) and HLA‐DQA2 (P = 0.006) located in HLA‐DRB5 region, and CYP3A43 gene (5.09E‐05) located in ZCWPW1 region were significantly associated with LOAD. LGMN in the SLC24A4 region was modestly associated with LOAD in a model adjusted for APOE dosage (P = 0.033).

Discussion

A variant in AKAP9, (p.R434W), a gene previously associated with LOAD in a case–control study among African Americans,40 segregated in two large families and was nominally associated with LOAD, with fivefold increased risk adjusted for age, sex, and APOE‐ε4. The two different variants in AKAP9 were previously identified, were considered rare in populations African‐descent, and were not present in European‐descent or East Asian‐descent individuals in the 1000 Genomes database. AKAP9 is located on chromosome 7q21–22, and it encodes a member of the A kinase anchoring protein (AKAP) family. The A‐kinase anchor proteins (AKAPs) are structurally diverse proteins that bind to the regulatory subunits of protein kinase A (PKA), confining the holoenzyme to discrete locations within the cell. Alternate splicing of this gene results in at least two isoforms that localize to the centrosome and the Golgi apparatus, and interact with numerous signaling proteins from multiple signal transduction pathways.41 AKAP9 is also expressed in the cerebral cortex, hippocampus, and cerebellum, and is involved in the cytoskeletal attachment of NMDA receptors, important for controlling synaptic plasticity and memory function.42 In addition, missense mutations in Myelin Gene Regulatory Factor (MYRF) and Asparaginase‐Like Protein 1 (ASRGL1) genes under linkage peak on chromosome 11q12.3 segregated completely in one LOAD family and tested nominally significant (P < 0.05) in association with LOAD in follow‐up genotyping. The MYRF gene encodes a transcription factor that is required for central nervous system myelination and may regulate oligodendrocyte differentiation. It increases expression of genes that affect myelin production. Mutant huntingtin mice show progressive neurological symptoms and early death, as well as age‐dependent demyelination and reduced expression of myelin genes that are downstream of MYRF.43A missense mutation in ASRGL1 has been associated with autosomal recessive retinal degeneration,44 but it has not been previously related to Alzheimer's disease or neurodegenerative disorders. In patients with retinal degeneration, variants in ASRGL1 are thought to be the cause of protein misfolding, intracellular mis‐routing and aggregation of misfolded proteins. It is expressed in brain, specifically in astrocytes and oligodendrocytes.45 Examination of known LOAD genes (e.g., from large GWAS‐based meta‐analyses) confirmed the role of rare functional variation in CR1 (P = 0.049), BIN1 (P = 0.0098), and SLC24A4 (P = 0.040). CR1 encodes a transmembrane glycoprotein in the innate immune system. It is a receptor for the complement components C3b and C4b. It has been consistently identified as a risk factor in Alzheimer's disease.3, 4, 46 BIN1 encodes the bridging integrator 1 gene, which has also been consistently associated with Alzheimer's disease in GWAS studies.4, 47 BIN1 is present in neurons, oligodendrocytes and microglia, and its primary role is thought to be in endocytosis and trafficking in neurons and immune response in glia cells.48 SLC24A4 is a solute carrier that has been associated with pigmentation traits in European populations. However, more relevant to LOAD is its association with blood pressure in African Americans, as LOAD may be influenced by vascular disease.49 Of the genes identified within the 2 Mb region surrounding the previously identified GWAS loci from the IGAP study,1, 2, 3, 4, 5, 6 several were nominally significant. CYP3A43 is a member of the cytochrome P450 superfamily of enzymes. The cytochrome P450 proteins catalyze many reactions involved in drug metabolism and synthesis of cholesterol, steroids and other lipids, and are part of a cluster of cytochrome P450 genes on chromosome 7q21.1. MSH5 encodes a member of the mutS family of proteins that are involved in DNA mismatch repair or meiotic recombination processes. Women with premature ovarian failure were found to carry a mutation in MSH5 suggesting a role in meiotic recombination. Genetic variants in a gene within the major histocompatibility complex, HLA‐DQA2, were associated with LOAD in this study. However, this gene has low expression in brain and has been associated with multiple inflammatory disorders. Family‐based WGS on previously localized linkage regions remains a particularly powerful strategy for causal variant identification. Several new disease susceptibility genes have been successfully identified using linkage analysis coupled with WGS, in complex phenotype disorders such as hearing impairment,50 familial goiters,51 and familial hypertension.52 Taken together the results here imply that rare variants in multiple genes are likely to increase the risk of LOAD. Large families multiply affected by LOAD are extremely helpful in identifying novel rare variants even in genes previously investigated by other means. For example, loci identified by genome wide array studies have pointed to a large number of genes many of which have rare variants suggesting that these loci were in linkage disequilibrium with one or more of the mutations. How variants in these multiple genes lead to a common phenotype of LOAD needs to be investigated. However, investigations of gene functions may point to potential targets for new treatments or preventive measures.

Conflict of Interest

The authors have no conflicts of interest to disclose. Data S1. Methods. Figure S1. Details of carriers of pV707I variant in TMEM132A in Caribbean Hispanic family 860. Click here for additional data file. Table S1. WGS rare exonic variants segrgating with LOAD in chromosome 7q14.3 linkage region. Table S2. Interesting variants segregatating with affection status in Chr 11 Linkage region. Table S3. Rare missense variants in candidate LOAD genes segregating with AD in Caribbean Hispanic families. Table S4. Variant count across the chromosomes and linkage regions at each filtering step (corresponding to Fig. 1). Table S5. Gene‐based test for variants sets that including variants with CADD>30. Table S6. Caribbean Hispanic families contributing to the linkage signals on chromosomes 7 and 11. Table S7. Details of carriers of pv707I variant in TMEM132A. Table S8. For family 284, individual and family IDs are displayed as EFIGSA IDs. Table S9. For familiy 348, individual and family IDs are displayed as EFIGA IDs. Click here for additional data file.
  51 in total

Review 1.  Beyond GWASs: illuminating the dark road from association to function.

Authors:  Stacey L Edwards; Jonathan Beesley; Juliet D French; Alison M Dunning
Journal:  Am J Hum Genet       Date:  2013-11-07       Impact factor: 11.025

2.  Expanded genomewide scan implicates a novel locus at 3q28 among Caribbean hispanics with familial Alzheimer disease.

Authors:  Joseph H Lee; Rong Cheng; Vincent Santana; Jennifer Williamson; Rafael Lantigua; Martin Medrano; Alex Arriaga; Yaakov Stern; Benjamin Tycko; Ekaterina Rogaeva; Yosuke Wakutani; Toshitaka Kawarai; Peter St George-Hyslop; Richard Mayeux
Journal:  Arch Neurol       Date:  2006-11

3.  Variant of TREM2 associated with the risk of Alzheimer's disease.

Authors:  Thorlakur Jonsson; Hreinn Stefansson; Stacy Steinberg; Ingileif Jonsdottir; Palmi V Jonsson; Jon Snaedal; Sigurbjorn Bjornsson; Johanna Huttenlocher; Allan I Levey; James J Lah; Dan Rujescu; Harald Hampel; Ina Giegling; Ole A Andreassen; Knut Engedal; Ingun Ulstein; Srdjan Djurovic; Carla Ibrahim-Verbaas; Albert Hofman; M Arfan Ikram; Cornelia M van Duijn; Unnur Thorsteinsdottir; Augustine Kong; Kari Stefansson
Journal:  N Engl J Med       Date:  2012-11-14       Impact factor: 91.245

4.  Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer's disease.

Authors:  Adam C Naj; Gyungah Jun; Gary W Beecham; Li-San Wang; Badri Narayan Vardarajan; Jacqueline Buros; Paul J Gallins; Joseph D Buxbaum; Gail P Jarvik; Paul K Crane; Eric B Larson; Thomas D Bird; Bradley F Boeve; Neill R Graff-Radford; Philip L De Jager; Denis Evans; Julie A Schneider; Minerva M Carrasquillo; Nilufer Ertekin-Taner; Steven G Younkin; Carlos Cruchaga; John S K Kauwe; Petra Nowotny; Patricia Kramer; John Hardy; Matthew J Huentelman; Amanda J Myers; Michael M Barmada; F Yesim Demirci; Clinton T Baldwin; Robert C Green; Ekaterina Rogaeva; Peter St George-Hyslop; Steven E Arnold; Robert Barber; Thomas Beach; Eileen H Bigio; James D Bowen; Adam Boxer; James R Burke; Nigel J Cairns; Chris S Carlson; Regina M Carney; Steven L Carroll; Helena C Chui; David G Clark; Jason Corneveaux; Carl W Cotman; Jeffrey L Cummings; Charles DeCarli; Steven T DeKosky; Ramon Diaz-Arrastia; Malcolm Dick; Dennis W Dickson; William G Ellis; Kelley M Faber; Kenneth B Fallon; Martin R Farlow; Steven Ferris; Matthew P Frosch; Douglas R Galasko; Mary Ganguli; Marla Gearing; Daniel H Geschwind; Bernardino Ghetti; John R Gilbert; Sid Gilman; Bruno Giordani; Jonathan D Glass; John H Growdon; Ronald L Hamilton; Lindy E Harrell; Elizabeth Head; Lawrence S Honig; Christine M Hulette; Bradley T Hyman; Gregory A Jicha; Lee-Way Jin; Nancy Johnson; Jason Karlawish; Anna Karydas; Jeffrey A Kaye; Ronald Kim; Edward H Koo; Neil W Kowall; James J Lah; Allan I Levey; Andrew P Lieberman; Oscar L Lopez; Wendy J Mack; Daniel C Marson; Frank Martiniuk; Deborah C Mash; Eliezer Masliah; Wayne C McCormick; Susan M McCurry; Andrew N McDavid; Ann C McKee; Marsel Mesulam; Bruce L Miller; Carol A Miller; Joshua W Miller; Joseph E Parisi; Daniel P Perl; Elaine Peskind; Ronald C Petersen; Wayne W Poon; Joseph F Quinn; Ruchita A Rajbhandary; Murray Raskind; Barry Reisberg; John M Ringman; Erik D Roberson; Roger N Rosenberg; Mary Sano; Lon S Schneider; William Seeley; Michael L Shelanski; Michael A Slifer; Charles D Smith; Joshua A Sonnen; Salvatore Spina; Robert A Stern; Rudolph E Tanzi; John Q Trojanowski; Juan C Troncoso; Vivianna M Van Deerlin; Harry V Vinters; Jean Paul Vonsattel; Sandra Weintraub; Kathleen A Welsh-Bohmer; Jennifer Williamson; Randall L Woltjer; Laura B Cantwell; Beth A Dombroski; Duane Beekly; Kathryn L Lunetta; Eden R Martin; M Ilyas Kamboh; Andrew J Saykin; Eric M Reiman; David A Bennett; John C Morris; Thomas J Montine; Alison M Goate; Deborah Blacker; Debby W Tsuang; Hakon Hakonarson; Walter A Kukull; Tatiana M Foroud; Jonathan L Haines; Richard Mayeux; Margaret A Pericak-Vance; Lindsay A Farrer; Gerard D Schellenberg
Journal:  Nat Genet       Date:  2011-04-03       Impact factor: 38.330

5.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

Review 6.  A-kinase anchoring proteins: from protein complexes to physiology and disease.

Authors:  Graeme K Carnegie; Christopher K Means; John D Scott
Journal:  IUBMB Life       Date:  2009-04       Impact factor: 3.885

7.  TREM2 variants in Alzheimer's disease.

Authors:  Rita Guerreiro; Aleksandra Wojtas; Jose Bras; Minerva Carrasquillo; Ekaterina Rogaeva; Elisa Majounie; Carlos Cruchaga; Celeste Sassi; John S K Kauwe; Steven Younkin; Lilinaz Hazrati; John Collinge; Jennifer Pocock; Tammaryn Lashley; Julie Williams; Jean-Charles Lambert; Philippe Amouyel; Alison Goate; Rosa Rademakers; Kevin Morgan; John Powell; Peter St George-Hyslop; Andrew Singleton; John Hardy
Journal:  N Engl J Med       Date:  2012-11-14       Impact factor: 91.245

8.  Variants with large effects on blood lipids and the role of cholesterol and triglycerides in coronary disease.

Authors:  Anna Helgadottir; Solveig Gretarsdottir; Gudmar Thorleifsson; Eirikur Hjartarson; Asgeir Sigurdsson; Audur Magnusdottir; Aslaug Jonasdottir; Helgi Kristjansson; Patrick Sulem; Asmundur Oddsson; Gardar Sveinbjornsson; Valgerdur Steinthorsdottir; Thorunn Rafnar; Gisli Masson; Ingileif Jonsdottir; Isleifur Olafsson; Gudmundur I Eyjolfsson; Olof Sigurdardottir; Maryam S Daneshpour; Davood Khalili; Fereidoun Azizi; Dorine W Swinkels; Lambertus Kiemeney; Arshed A Quyyumi; Allan I Levey; Riyaz S Patel; Salim S Hayek; Ingibjorg J Gudmundsdottir; Gudmundur Thorgeirsson; Unnur Thorsteinsdottir; Daniel F Gudbjartsson; Hilma Holm; Kari Stefansson
Journal:  Nat Genet       Date:  2016-05-02       Impact factor: 41.307

9.  Genome-wide association study of Alzheimer's disease.

Authors:  M I Kamboh; F Y Demirci; X Wang; R L Minster; M M Carrasquillo; V S Pankratz; S G Younkin; A J Saykin; G Jun; C Baldwin; M W Logue; J Buros; L Farrer; M A Pericak-Vance; J L Haines; R A Sweet; M Ganguli; E Feingold; S T Dekosky; O L Lopez; M M Barmada
Journal:  Transl Psychiatry       Date:  2012-05-15       Impact factor: 6.222

Review 10.  Late-Onset Alzheimer's Disease Genes and the Potentially Implicated Pathways.

Authors:  Samantha L Rosenthal; M Ilyas Kamboh
Journal:  Curr Genet Med Rep       Date:  2014-03-22
View more
  15 in total

1.  Genetic architecture of RNA editing regulation in Alzheimer's disease across diverse ancestral populations.

Authors:  Olivia K Gardner; Derek Van Booven; Lily Wang; Tianjie Gu; Natalia K Hofmann; Patrice L Whitehead; Karen Nuytemans; Kara L Hamilton-Nelson; Larry D Adams; Takiyah D Starks; Michael L Cuccaro; Eden R Martin; Jeffery M Vance; William S Bush; Goldie S Byrd; Jonathan L Haines; Gary W Beecham; Margaret A Pericak-Vance; Anthony J Griswold
Journal:  Hum Mol Genet       Date:  2022-08-25       Impact factor: 5.121

2.  Progranulin mutations in clinical and neuropathological Alzheimer's disease.

Authors:  Badri N Vardarajan; Dolly Reyes-Dumeyer; Angel L Piriz; Rafael A Lantigua; Martin Medrano; Diones Rivera; Ivonne Z Jiménez-Velázquez; Eden Martin; Margaret A Pericak-Vance; William Bush; Lindsay Farrer; Jonathan L Haines; Li-San Wang; Yuk Yee Leung; Gerard Schellenberg; Walter Kukull; Philip De Jager; David A Bennett; Julie A Schneider; Richard Mayeux
Journal:  Alzheimers Dement       Date:  2022-02-09       Impact factor: 16.655

Review 3.  Genomics of Alzheimer's disease implicates the innate and adaptive immune systems.

Authors:  Yihan Li; Simon M Laws; Luke A Miles; James S Wiley; Xin Huang; Colin L Masters; Ben J Gu
Journal:  Cell Mol Life Sci       Date:  2021-10-27       Impact factor: 9.207

4.  Rare genetic variation implicated in non-Hispanic white families with Alzheimer disease.

Authors:  Gary W Beecham; Badri Vardarajan; Elizabeth Blue; William Bush; James Jaworski; Sandra Barral; Anita DeStefano; Kara Hamilton-Nelson; Brian Kunkle; Eden R Martin; Adam Naj; Farid Rajabli; Christiane Reitz; Timothy Thornton; Cornelia van Duijn; Allison Goate; Sudha Seshadri; Lindsay A Farrer; Eric Boerwinkle; Gerard Schellenberg; Jonathan L Haines; Ellen Wijsman; Richard Mayeux; Margaret A Pericak-Vance
Journal:  Neurol Genet       Date:  2018-11-21

Review 5.  How understudied populations have contributed to our understanding of Alzheimer's disease genetics.

Authors:  Nadia Dehghani; Jose Bras; Rita Guerreiro
Journal:  Brain       Date:  2021-05-07       Impact factor: 13.501

6.  Recent advances in genetic studies of alcohol use disorders.

Authors:  Ishaan Gupta; Rohan Dandavate; Pallavi Gupta; Viplav Agrawal; Manav Kapoor
Journal:  Curr Genet Med Rep       Date:  2020-03-17

Review 7.  The MUC6/AP2A2 Locus and Its Relevance to Alzheimer's Disease: A Review.

Authors:  Peter T Nelson; David W Fardo; Yuriko Katsumata
Journal:  J Neuropathol Exp Neurol       Date:  2020-06-01       Impact factor: 3.685

8.  Ensembl variation resources.

Authors:  Sarah E Hunt; William McLaren; Laurent Gil; Anja Thormann; Helen Schuilenburg; Dan Sheppard; Andrew Parton; Irina M Armean; Stephen J Trevanion; Paul Flicek; Fiona Cunningham
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

9.  Novel truncation mutations in MYRF cause autosomal dominant high hyperopia mapped to 11p12-q13.3.

Authors:  Xueshan Xiao; Wenmin Sun; Jiamin Ouyang; Shiqiang Li; Xiaoyun Jia; Zhiqun Tan; J Fielding Hejtmancik; Qingjiong Zhang
Journal:  Hum Genet       Date:  2019-06-06       Impact factor: 4.132

Review 10.  Insight into the genetic etiology of Alzheimer's disease: A comprehensive review of the role of rare variants.

Authors:  Julie Hoogmartens; Rita Cacace; Christine Van Broeckhoven
Journal:  Alzheimers Dement (Amst)       Date:  2021-02-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.