| Literature DB >> 28537274 |
Henne Holstege1,2,3, Sven J van der Lee1,4, Marc Hulsman2,3, Tsz Hang Wong5, Jeroen Gj van Rooij5,6, Marjan Weiss2, Eva Louwersheimer1, Frank J Wolters4, Najaf Amin4, André G Uitterlinden4,6,7, Albert Hofman4,8, M Arfan Ikram4,5,9, John C van Swieten1,5, Hanne Meijers-Heijboer2, Wiesje M van der Flier1,10, Marcel Jt Reinders3, Cornelia M van Duijn4,11, Philip Scheltens1.
Abstract
Accumulating evidence suggests that genetic variants in the SORL1 gene are associated with Alzheimer disease (AD), but a strategy to identify which variants are pathogenic is lacking. In a discovery sample of 115 SORL1 variants detected in 1908 Dutch AD cases and controls, we identified the variant characteristics associated with SORL1 variant pathogenicity. Findings were replicated in an independent sample of 103 SORL1 variants detected in 3193 AD cases and controls. In a combined sample of the discovery and replication samples, comprising 181 unique SORL1 variants, we developed a strategy to classify SORL1 variants into five subtypes ranging from pathogenic to benign. We tested this pathogenicity screen in SORL1 variants reported in two independent published studies. SORL1 variant pathogenicity is defined by the Combined Annotation Dependent Depletion (CADD) score and the minor allele frequency (MAF) reported by the Exome Aggregation Consortium (ExAC) database. Variants predicted strongly damaging (CADD score >30), which are extremely rare (ExAC-MAF <1 × 10-5) increased AD risk by 12-fold (95% CI 4.2-34.3; P=5 × 10-9). Protein-truncating SORL1 mutations were all unknown to ExAC and occurred exclusively in AD cases. More common SORL1 variants (ExAC-MAF≥1 × 10-5) were not associated with increased AD risk, even when predicted strongly damaging. Findings were independent of gender and the APOE-ɛ4 allele. High-risk SORL1 variants were observed in a substantial proportion of the AD cases analyzed (2%). Based on their effect size, we propose to consider high-risk SORL1 variants next to variants in APOE, PSEN1, PSEN2 and APP for personalized risk assessments in clinical practice.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28537274 PMCID: PMC5567154 DOI: 10.1038/ejhg.2017.87
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Figure 1Flowchart of SORL1 variant pathogenicity analysis. aSORL1 variants in the independent replication data set were reported by Verheijen et al.[6] Pathogenicity screen was applied to SORL1 variants reported by bVardarajan et al.[4] and cNicolas et al.[5]
Discovery analysis: 115 SORL1 variants in 640 cases and 1268 controls
| MAF <0.01 | |||||
| All variants | 100 | 83 | 140 | 1.2 (0.9–1.6) | 0.23 |
| SIFT/Polyphen damaging | 36 | 40 | 44 | 1.9 (1.2–2.9) | 7.2 × 10−3 |
| CADD 0–20 | 55 | 42 | 94 | 0.9 (0.6–1.3) | 0.59 |
| CADD 20–30 | 26 | 28 | 43 | 1.3 (0.78–2.1) | 0.34 |
| CADD >30 | 19 | 15 | 8 | 4.0 (1.7–9.0) | 9.9 × 10−4 |
| MAF >0.01 (variants detected in >38 subjects in this sample) | |||||
| 0–20 | 13 | 640 | 1268 | 1.0 (1.0–1.1) | 0.54 |
| 20–30 | 1 | 48 | 103 | 1.0 (0.7–1.3) | 0.83 |
| >30 | 1 | 17 | 44 | 0.8 (0.5–1.3) | 0.29 |
| 0.001<MAF <0.01 (variants detected in 3–38 subjects in this sample | |||||
| 0–20 | 7 | 27 | 47 | 1.2 (0.7–1.9) | 0.54 |
| 20–30 | 3 | 18 | 24 | 1.5 (0.8–2.9) | 0.21 |
| >30 | 1 | 1 | 3 | 0.7 (0.1–5.5) | 0.73 |
| 0.0005<MAF <0.001 (variants detected in 2–3 subjects in this sample) | |||||
| 0–20 | 12 | 6 | 21 | 0.7 (0.3–1.4) | 0.33 |
| 20–30 | 5 | 3 | 9 | 0.7 (0.2–2.4) | 0.57 |
| >30 | 2 | 1 | 3 | 0.7 (0.1–5.2) | 0.70 |
| MAF <0.0005 (singletons in this sample) | |||||
| 0–20 | 36 | 9 | 26* | 0.7 (0.4–1.4) | 0.31 |
| 20–30 | 18 | 7 | 11 | 1.3 (0.5–3.3) | 0.65 |
| | |||||
| >30 missense | 11 | 9 | 2 | 8.7 (2.5–30.4) | 7.2 × 10−4 |
| >30 stop/frameshift | 5 | 5 | 0 | 19.8 (3.1–126.8) | 1.7 × 10−3 |
| VPS10 | |||||
| 0–20 | 5 | 1 | 4 | 0.6 (0.1–3.6) | 0.54 |
| | |||||
| >30 | 4 | 3 | 1 | 6.6 (0.8–53.1) | 7.6 × 10−2 |
| LDL A and B | |||||
| 0–20 | 10 | 3 | 7 | 0.9 (0.2–3.2) | 0.81 |
| 20–30 | 4 | 0 | 4 | 0.22 (0.0–1.7) | 0.14 |
| >30 | 8 | 7 | 1 | 10.8 (2.5–46.7) | 1.5 × 10−3 |
| Fibronectin | |||||
| 0–20 | 3 | 0 | 3 | 0.2 (0.0–2.5) | 0.23 |
| 20–30 | 5 | 0 | 5 | 0.2 (0.0–1.4) | 0.11 |
| >30 | 4 | 4 | 0 | 20.5 (2.6–164.5) | 4.5 × 10−3 |
Variants were categorized according to the associated minor allele frequency in the sample and CADD score. For each category, multiple variants per individual were collapsed into one variant such that an odds ratio and an associated P-value could be calculated using the score-based SeqMeta burden test, while using gender as a covariate. LDL-Receptor, low-density lipoprotein receptor A and BOR; odds ratio; VPS10, Vacuolar Protein Sorting domain 10. No subject carried more than one singleton, with the exception of one control subject who carried two intronic singletons with CADD 0–20 (*).
Associations that are significant after multiple testing correction are shown in bold (P<7.1 × 10−4).
Results for the odds ratios can be considered a one-step approximation to the maximum likelihood result, such that ORs of infinite have lower values when controls contribute no variants to the association.
Combined analysis: 181 unique SORL1 variants in 1895 cases and 3206 controls
| 0–10 | 8 | 1,895 | 3,206 | 1.0 | 1.0 |
| 10–20 | 4 | 1,895 | 3,206 | 1.0 | 1.0 |
| 20–30 | 1 | 203 | 300 | 1.2 (1.0–1.4) | 7.3 × 10−2 |
| >30 | 1 | 67 | 162 | 0.7 (0.5–0.9) | 1.0 |
| 0–10 | 5 | 93 | 158 | 1.0 (0.8–1.3) | 0.52 |
| 10–20 | 4 | 68 | 151 | 0.8 (0.6–1.0) | 0.98 |
| 20–30 | 3 | 37 | 40 | 1.6 (1.0–2.5) | 3.2 × 10−2 |
| >30 | 1 | 4 | 7 | 1.0 (0.3–3.3) | 0.63 |
| 0–10 | 5 | 10 | 12 | 1.4 (0.6–3.3) | 0.27 |
| 10–20 | 5 | 9 | 11 | 1.4 (0.6–3.4) | 0.30 |
| 20–30 | 11 | 15 | 21 | 1.2 (0.6–2.4) | 0.34 |
| >30 | 4 | 5 | 4 | 2.1 (0.6–7.9) | 0.21 |
| 0–10 | 5 | 7 | 10 | 1.2 (0.5–3.1) | 0.45 |
| 10–20 | 20 | 9 | 21 | 0.7 (0.3–1.6) | 0.84 |
| 20–30 | 22 | 9 | 23 | 0.7 (0.3–1.4) | 0.90 |
| >30 | 9 | 10 | 2 | 8.5 (1.9–38.8) | 1.4 × 10−3 |
| 0–10 | 7 | 2 | 8 | 0.4 (0.1–2.0) | 0.93 |
| 10–20 | 15 | 9 | 7 | 2.2 (0.8–5.9) | 9.4 × 10−2 |
| 20–30 | 22 | 13 | 11 | 2.0 (0.9–4.5) | 6.6 × 10−2 |
| | |||||
Variants were categorized according to the associated minor allele frequency reported in the ExAC database (ExAC-MAF) and CADD score.
In the replication sample, we were not able to account for the overlap when one subject carried multiple variants with the same MAF.
Associations that are significant after multiple testing correction are shown in bold (P<7.1 × 10−4).
For common variants derived form the replication sample, the variant MAF reported in cases and controls was used to estimate the number of case-carriers and control-carriers.
Figure 2Only the rarest variants with the highest CADD scores are associated with increased AD risk. The 181 SORL1 variants detected in 5101 AD cases and controls from the combined analysis were first separated by their ExAC-MAF, and then by their CADD values (see also Table 2).
Clinical selection criteria of variants
| P- | |||||||
|---|---|---|---|---|---|---|---|
| Maximum effect size | >30 | <1 × 10−5 | 29 | 28 | 4 | 12.0 (4.2–34.3) | |
| Maximum evidence for effect | >30 | <1 × 10−4 | 38 | 38 | 6 | 10.9 (4.6–25.9) | |
| Pathogenic | >30 (truncating) | <1 × 10−5 | 13 | 13 | 0 | inf (5.2– inf) | |
| Likely pathogenic | >30 (missense) | <1 × 10−4 | 25 | 25 | 6 | 7.1 (2.9–17.4) | |
| Uncertain significance | |||||||
| Possibly pathogenic | 10–30 | <1 × 10−5 | 33 | 21 | 15 | 2.4 (1.2–4.6) | 7.7 × 10−3 |
| Most likely not pathogenic | >30 | >1 × 10−4 | 6 | 76 | 173 | 0.73 (0.6–1.0) | 0.99 |
| Likely benign | 10–30 | >1 × 10−5 | 70 | 1,895 | 3,206 | 1.0 | 1.0 |
| Benign | 0–10 | 0–1 | 30 | 1,895 | 3,206 | 1.0 | 1.0 |
Associations that are significant after multiple testing correction are shown in bold (P<7.1 × 10−4).
Stop/frameshift mutations were all unknown to the ExAC database v.0.3.1.
Figure 3Protein position of SORL1 variants with MAF <1 × 10−4 in ExAC database. One hundred and twenty-one coding variants with ExAC-MAF <1 × 10−4 were detected the combined analysis of 5101 subjects (1895 cases and 3206 controls). Each symbol represents one case carrier (red) or control (green) carrier. Protein domains are depicted on the CADD=20 level, variants with CADD scores between 20 and 30 are considered ‘moderately damaging’ and variants with CADD scores >30 were considered ‘strongly damaging’. Markers outlined in black represent variants that were detected in multiple cases or in multiple controls.