| Literature DB >> 33797837 |
Dmitry Prokopenko1,2, Sarah L Morgan3,4, Kristina Mullin1, Oliver Hofmann5, Brad Chapman6, Rory Kirchner6, Sandeep Amberkar3, Inken Wohlers7, Christoph Lange8, Winston Hide2,3,4, Lars Bertram7,9, Rudolph E Tanzi1,2.
Abstract
INTRODUCTION: Genome-wide association studies have led to numerous genetic loci associated with Alzheimer's disease (AD). Whole-genome sequencing (WGS) now permits genome-wide analyses to identify rare variants contributing to AD risk.Entities:
Keywords: Alzheimer's disease; LOAD; RVAS; family-based association study; neuronal development; rare variants; synaptic function; whole-genome sequencing
Mesh:
Substances:
Year: 2021 PMID: 33797837 PMCID: PMC8519060 DOI: 10.1002/alz.12319
Source DB: PubMed Journal: Alzheimers Dement ISSN: 1552-5260 Impact factor: 21.566
FIGURE 1Data analysis workflow
FIGURE 2QQ plot of rare (MAF < = 1%) single‐variant association results in the family‐based discovery data set (NIMH and NIA cohorts). The red line corresponds to all statistics, where at least one informative family is observed. The green line corresponds to statistics with at least ten informative families
FIGURE 3Manhattan plot of rare (MAF < = 1%) single‐variant association results in the family‐based discovery data set (NIMH and NIA cohorts). Genes that correspond to replicated variants as described in the workflow (Figure 1) are highlighted
Top single‐variant AD association results
|
|
| |||||
|---|---|---|---|---|---|---|
| Chromosome | 14 | 1 | 7 | 2 | 15 | |
| rsID (additional variants in LD showing the same results) | rs74065194 | rs192471919 | rs112941445 (rs112910644, rs111839960, rs113210961) | rs147918541 | rs141228575 (rs147002962) | |
| Nearest protein‐coding gene | SEL1L | FNBP1L | STK31 | LINC00298 | C15orf41 | |
| Allele frequency, non‐Finnish Europeans, gnomAD v3 | 0.0065 | 0.0054 | 0.0067 | 0.0072 | 0.0069 | |
| Effect allele | T | C | T | A | T | |
| Discovery dataset (NIMH + NIA families) | Z‐score | 3.551 | ‐3.485 | ‐3.461 | ‐1.98 | 2.048 |
| P‐value | 3.84E‐04 | 4.93E‐04 | 5.38E‐04 | 4.78E‐02 | 4.06E‐02 | |
| NINFF | 9 | 6 | 7 | 4 | 7 | |
| Replication dataset NHW ADSP | Z‐score | 2.275 | ‐2.182 | ‐2.698 | ‐3.576 | 3.488 |
| P‐value | 2.29E‐02 | 2.91E‐02 | 6.97E‐03 | 3.49E‐04 | 4.86E‐04 | |
| Sample size | 1669 | 1669 | 1668 | 1669 | 1669 | |
| Meta‐analysis | Effect direction | ++ | ‐‐ | ‐‐ | ‐‐ | ++ |
| Z‐score | 2.529 | ‐2.387 | ‐2.916 | ‐3.668 | 3.613 | |
| P‐value | 1.14E‐02 | 1.70E‐02 | 3.54E‐03 | 2.44E‐04 | 3.03E‐04 | |
| Jansen et al. | Z‐score | 1.134 | ‐2.656 | ‐0.319 | NA | ‐0.919 |
| P‐value | 2.57E‐01 | 7.92E‐03 | 7.50E‐01 | NA | 3.58E‐01 | |
| Kunkle et al. | Z‐score | 1.181 | NA | 1.589 | NA | ‐0.571 |
| P‐value | 2.37E‐01 | NA | 1.12E‐01 | NA | 5.68E‐01 | |
| UCSC | All mRNA | 0 | 13 | 3 | 7 | 6 |
| UCSC/ENCODE | TFBS clusters | 1 | 0 | 0 | 0 | 0 |
| GWAVA | DNase cluster | 0 | 0 | 9 | 0 | 1 |
| Ensembl | TFBS | 0 | 0 | 118 | 0 | 0 |
| GWAVA | GC content | 0.45 | 0.4 | 0.57 | 0.33 | 0.43 |
| Location | Upstream | Intronic | Downstream | Intronic | Intronic | |
| Mayo cohort | Expression in AD temporal cortex | No change | No change | Sig. up | Not tested | No change |
| Illumina bodyMap2 transcriptome | Tissue expression | Ubiquitous expression | Ubiquitous expression | Mostly expressed in testes | Mostly expressed in brain | Mostly expressed in heart |
| 3DSNP | Open chromatin | Fetal heart | Brain cingulate gyrus, liver cells and monocytes | 0 | Digestive tissue | 0 |
| Inferno | Closest enhancers cell type | CL:0000097_mast_cell | . | CL:0000047_neuronal_stem_cell | CL:0000576_monocyte,CL:0000775_neutrophil | CL:0000540_neuron,CL:0002620_skin_fibroblast |
Overlapping/GREAT‐assigned gene did not differ from the nearest gene assignment. For comparison, results from Jansen et al. (2019) and Kunkle et al. (2019) are included. NINFF ‐ number of informative families, TFBS ‐ transcription factor binding site.
a: Single‐variant AD association results with P < 0.0005 (P < 0.0006) in the discovery dataset and consistent (ie, P < 0.05 and same direction of effect) association in the ADSP NHW WGS replication dataset.
b: Single‐variant AD association results with consistent (ie, Pdiscovery < 0.05 and same direction of effect in discovery and replication dataset) association at P < 0.0005 after meta‐analysis.
FIGURE 4Manhattan plot of spatial clustering association results based on rare (MAF < = 1%) variants in the family‐based discovery data set (NIMH and NIA cohorts). Highlighted are genes, which correspond to replicated regions, described in the workflow (Figure 1)
FIGURE 5QQ plot of spatial clustering association results based on rare (MAF < = 1%) variants in the family‐based discovery data set (NIMH and NIA cohorts)
Top spatial clustering–based AD association results
| a | b | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome | 14 | 11 | 5 | 5 | 14 | 1 | 13 | 2 | 6 | 3 | |
| Position of the first SNV | 61176726 | 74025114 | 61972635 | 112819545 | 61176726 | 197917553 | 101143359 | 79853746 | 158677635 | 140398927 | |
| Position of the last SNV | 61188023 | 74031365 | 61975793 | 112827357 | 61188023 | 197929226 | 101164882 | 79856892 | 158683441 | 140449630 | |
| Nearest protein‐coding gene | PRKCH | C2CD3 | KIF2A | APC | PRKCH | LHX9 | NALCN | CTNNA2 | SYTL3 | CLSTN2 | |
| Discovery dataset (NIMH + NIA families) | Number of SNVs in the region | 53 | 47 | 13 | 25 | 53 | 44 | 106 | 15 | 29 | 233 |
| P‐value | 2.51E‐05 | 8.36E‐05 | 1.69E‐04 | 3.37E‐04 | 2.51E‐05 | 7.62E‐03 | 2.58E‐03 | 1.22E‐05 | 1.77E‐02 | 1.19E‐05 | |
| Replication dataset NHW ADSP | Number of SNVs in the region | 53 | 42 | 16 | 29 | 53 | 39 | 134 | 18 | 28 | 273 |
| P‐value | 2.11E‐02 | 4.54E‐02 | 4.65E‐02 | 4.37E‐02 | 2.11E‐02 | 1.31E‐04 | 4.27E‐04 | 1.54E‐01 | 1.16E‐04 | 1.96E‐01 | |
| Meta‐analysis | P‐value | 8.17E‐06 | 5.12E‐05 | 1.00E‐04 | 1.79E‐04 | 8.17E‐06 | 1.48E‐05 | 1.63E‐05 | 2.67E‐05 | 2.90E‐05 | 3.27E‐05 |
| Location | Upstream | Intronic and exonic | Upstream | Intronic and exonic | Upstream | Intronic and exonic | Intronic and exonic | Intronic | Intronic and exonic | Intronic and exonic | |
| Mayo | Expression in AD temporal cortex | Sig. lower | No change | No change | No change | Sig. lower | Not tested | No change | No change | Not tested | No change |
| Illumina bodyMap2 transcriptome | Tissue expression | Mostly expressed in WBC | Ubiquitous expression | Mostly expressed in WBC and brain | Mostly expressed in brain | Mostly expressed in WBC | Mostly expressed in testes | Mostly expressed in brain | Mostly expressed in brain | Ubiquitous expression | Mostly expressed in ovaries |
| FANTOM | TFBS | 86 | 0 | 0 | 0 | 86 | 429* | 0 | 0 | 0 | 0 |
| FANTOM | Transcription start sites | 6 | 3 | 0 | 5 | 6 | 24 | 2 | 0 | 0 | 9 |
| Ensembl | BindingMotifs | 14 | 0 | 0 | 0 | 14 | 1 | 2 | 1 | 0 | 7 |
| Ensembl | Active enhancer | 10 | 0 | 0 | 10 | 10 | 0 | 26 | 6 | 0 | 32* |
| Ensembl | Open chromatin region active | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 2 |
| Ensembl | Active promoter | 61* | 15 | 0 | 0 | 61* | 104 | 0 | 0 | 0 | 57 |
| Ensembl | Active TFBS | 0 | 0 | 0 | 0 | 0 | 15* | 19* | 0 | 0 | 0 |
| Ensembl | CTCF binding site active | 9 | 2 | 0 | 0 | 9 | 31 | 73* | 19 | 0 | 16 |
| UCSC | Cell‐specific TFBS | 81 | 12 | 4 | 15 | 81 | 117 | 31 | 7 | 6 | 40 |
| UCSC | CpG islands | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 |
| UCSC | DNase cluster | 17 | 8 | 1 | 5 | 17 | 15 | 18 | 3 | 3 | 44 |
The PRKCH region was identified in both arms of the regional study, hence appears in both: a and b. Overlapping/GREAT‐assigned gene did not differ from the nearest gene assignment. SNV ‐ single nucleotide variant, TFBS ‐ transcription factor binding site, WBC ‐ white blood cells. *Significantly more annotations than expected by chance after correcting for multiple testing.
a: Replicated spatial clustering–based AD association results for regions showing P < 0.0005 in the discovery data set.
b: Top spatial clustering–based AD association results based on meta‐analysis (P meta < 5×10‐5 and P discovery < 0.05).
FIGURE 6Network of direct interactions between highly ranked SNV and regional genes and known AD‐associated genes. Direct protein‐protein relationships (blue links) between reference AD genes (red), Table 1 and Table 2 (yellow), Supplementary Table 12 and 15 (blue) protein‐coding genes. LINC00298 co‐regulated expression of directly interacting genes is highlighted (turquoise border). Proteins that are in direct interaction with genes from Tables 1 and 2 have been grouped where possible according to shared GO biological processes (green ellipse). Proteins that may not be directly interacting but are found commonly enriched in immune‐related processes are grouped (pink square). Proteins with dark green–colored borders are enriched in GO:BP nervous system development, whereas a navy blue border is enriched for generation of neurons. Gene‐gene relationships are listed in Supplementary Table 24. The network can be interactively explored via the NDEX project website
FIGURE 7Cell‐specific enrichment results from the EWCE tool. We compared genes identified in our rare‐variant analysis to common variants published in AD and which cell type each is significantly enriched in. Zero represents the mean expression in each cell based on 10,000 permutations of gene lists of the same size. The data for this figure can be found in Supplementary Table 19