| Literature DB >> 34681041 |
Anna Bocharova1, Kseniya Vagaitseva1, Andrey Marusin1, Natalia Zhukova2,3, Irina Zhukova2,3, Larisa Minaycheva1, Oksana Makeeva3, Vadim Stepanov1.
Abstract
Alzheimer's disease (AD) is a neurodegenerative disorder, and represents the most common cause of dementia. In this study, we performed several different analyses to detect loci involved in development of the late onset AD in the Russian population. DNA samples from 472 unrelated subjects were genotyped for 63 SNPs using iPLEX Assay and real-time PCR. We identified five genetic loci that were significantly associated with LOAD risk for the Russian population (TOMM40 rs2075650, APOE rs429358 and rs769449, NECTIN rs6857, APOE ε4). The results of the analysis based on comparison of the haplotype frequencies showed two risk haplotypes and one protective haplotype. The GMDR analysis demonstrated three significant models as a result: a one-factor, a two-factor and a three-factor model. A protein-protein interaction network with three subnetworks was formed for the 24 proteins. Eight proteins with a large number of interactions are identified: APOE, SORL1, APOC1, CD33, CLU, TOMM40, CNTNAP2 and CACNA1C. The present study confirms the importance of the APOE-TOMM40 locus as the main risk locus of development and progress of LOAD in the Russian population. Association analysis and bioinformatics approaches detected interactions both at the association level of single SNPs and at the level of genes and proteins.Entities:
Keywords: APOE; Alzheimer’s disease; GWAS; SNP; case–control study
Mesh:
Substances:
Year: 2021 PMID: 34681041 PMCID: PMC8535278 DOI: 10.3390/genes12101647
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Demographic parameters of the studied groups.
| Parameter | Patients with LOAD, | Control Group, |
|---|---|---|
| Gender: Women | 120 (64.86%) | 200 (69.69%) |
| Gender: Men | 65 (35.14%) | 87 (30.31%) |
| Mean age | 72.15 ± 7.87 | 71.8 ± 5.70 |
| Race | Caucasoid | Caucasoid |
| Population | Russians | Russians |
Frequencies of the 58 SNPs analyzed for LOAD.
| Gene. | SNP ID | Allele a | Functional Consequence | Position (GRCh38) | Minor Allele Frequency | MAF | |
|---|---|---|---|---|---|---|---|
| Cases | Controls | HapMap | |||||
|
| rs1010304 | G/A | intron variant | 20:41473007 | 0.06 | 0.05 | 0.06 |
|
| rs1031381 | T/C | intron variant | 11:134218788 | 0.40 | 0.43 | 0.42 |
|
| rs10489202 | T/G | intron variant | 1:167933841 | 0.21 | 0.20 | 0.22 |
|
| rs10503253 | A/C | intron variant | 8:4323322 | 0.23 | 0.27 | 0.26 |
|
| rs11064768 | G/A | intron variant | 12:119380704 | 0.08 | 0.07 | 0.07 |
|
| rs11191580 | C/T | intron variant | 10:103146454 | 0.09 | 0.09 | 0.11 |
|
| rs12125971 | T/C | intergenic variant | 1:106921021 | 0.08 | 0.08 | 0.07 |
|
| rs12140439 | A/C | intergenic variant | 1:177753772 | 0.29 | 0.30 | 0.34 |
|
| rs12290811 | A/T | intron variant | 11:79372576 | 0.12 | 0.12 | 0.14 |
|
| rs12361953 | G/T | intron variant | 11:24589584 | 0.14 | 0.14 | 0.15 |
|
| rs12494658 | C/T | intron variant | 3:85825326 | 0.24 | 0.30 | 0.28 |
|
| rs12922317 | G/A | intron variant | 16:11983775 | 0.40 | 0.35 | 0.35 |
|
| rs12989701 | A/C | intron variant | 2:127130409 | 0.10 | 0.15 | 0.15 |
|
| rs13034462 | G/T | intron variant | 2:79892368 | 0.03 | 0.04 | 0.05 |
|
| rs138880 | C/A | intron variant | 22:49824963 | 0.21 | 0.21 | 0.23 |
|
| rs1466662 | A/T | intron variant | 4:154426241 | 0.35 | 0.35 | 0.69 |
|
| rs1532278 | T/C | intron variant | 8:27608798 | 0.44 | 0.39 | 0.38 |
|
| rs157580 | G/A | intron variant | 19:44892009 | 0.34 | 0.36 | 0.35 |
|
| rs1606974 | A/G | intron variant | 2:51646461 | 0.08 | 0.06 | 0.07 |
|
| rs1635 | T/G | missense variant | 6:28259826 | 0.07 | 0.06 | 0.05 |
|
| rs16887244 | G/A | intron variant | 8:38173827 | 0.23 | 0.21 | 0.21 |
|
| rs16897515 | A/C | missense variant | 6:27310241 | 0.09 | 0.11 | 0.12 |
|
| rs17194490 | T/G | intron variant | 3:2506102 | 0.18 | 0.15 | 0.14 |
|
| rs17203055 | G/A | intron variant | 3:119365484 | 0.11 | 0.10 | 0.12 |
|
| rs17512836 | C/T | intron variant | 18:55527730 | 0.008 | 0.02 | 0.05 |
|
| rs17518584 | C/T | intron variant | 3:85555773 | 0.34 | 0.32 | 0.30 |
|
| rs17594526 | T/C | intron variant | 18:55391007 | 0.01 | 0.02 | 0.05 |
|
| rs17693963 | C/A | intergenic variant | 6:27742386 | 0.06 | 0.06 | 0.08 |
|
| rs2075650 | G/A | intron variant | 19:44892362 | 0.26 | 0.28 | 0.16 |
|
| rs2279590 | T/C | intron variant | 8:27598736 | 0.44 | 0.40 | 0.39 |
|
| rs2393895 | C/A | intron variant | 10:62579087 | 0.23 | 0.23 | 0.20 |
|
| rs2616984 | G/A | intron variant | 8:4625619 | 0.33 | 0.31 | 0.29 |
|
| rs368331 | G/A | intron variant | 7:21703356 | 0.07 | 0.06 | 0.06 |
|
| rs3772130 | G/A | intron variant | 3:121625293 | 0.20 | 0.25 | 0.25 |
|
| rs3783006 | C/G | intron variant | 13:98458955 | 0.46 | 0.48 | 0.47 |
|
| rs3818361 | A/G | intron variant | 1:207611623 | 0.33 | 0.25 | 0.25 |
|
| rs3826656 | G/A | intron variant | 19:51223357 | 0.24 | 0.24 | 0.23 |
|
| rs429358 | C/T | missense variant | 19:44908684 | 0.22 | 0.10 | 0.13 |
|
| rs433598 | T/C | intron variant | 16:20668884 | 0.35 | 0.37 | 0.34 |
|
| rs4420638 | G/A | 500B Downstream Variant | 19:44919689 | 0.18 | 0.14 | 0.18 |
|
| rs4629318 | A/G | intron variant | 3:2851590 | 0.15 | 0.15 | 0.14 |
|
| rs472926 | G/A | intron variant | 11:126035363 | 0.16 | 0.17 | 0.17 |
|
| rs4765905 | C/G | intron variant | 12:2240418 | 0.32 | 0.37 | 0.37 |
|
| rs530965 | T/C | intron variant | 11:79354056 | 0.49 | 0.47 | 0.49 |
|
| rs544991 | T/C | intron variant | 1:33723829 | 0.31 | 0.27 | 0.31 |
|
| rs561655 | G/A | intergenic variant | 11:86089237 | 0.30 | 0.35 | 0.33 |
|
| rs6129846 | T/C | intron variant | 20:41478674 | 0.06 | 0.05 | 0.06 |
|
| rs6574433 | G/A | intron variant | 14:78319816 | 0.45 | 0.42 | 0.42 |
|
| rs6656401 | A/G | intron variant | 1:207518704 | 0.31 | 0.23 | 0.24 |
|
| rs6857 | T/C | 3 Prime UTR Variant | 19:44888997 | 0.28 | 0.19 | 0.17 |
|
| rs6859 | A/G | intron variant | 19:44878777 | 0.50 | 0.42 | 0.44 |
|
| rs7004633 | G/A | intron variant | 8:88748082 | 0.21 | 0.19 | 0.20 |
|
| rs7341475 | A/G | intron variant | 7:103764368 | 0.16 | 0.19 | 0.16 |
|
| rs7561528 | A/G | intron variant | 2:127132061 | 0.31 | 0.31 | 0.34 |
|
| rs769449 | A/G | intron variant | 19:44906745 | 0.17 | 0.08 | 0.10 |
|
| rs7984606 | C/A | intron variant | 13:69881529 | 0.003 | 0.004 | 0.05 |
|
| rs9491140 | T/C | intron variant | 6:124370091 | 0.32 | 0.32 | 0.31 |
|
| rs7412 | T/C | missense variant | 19:44908822 | 0.08 | 0.07 | 0.07 |
a Minor allele/major allele.
Figure 1The structure of linkage disequilibrium in the genomic region of the NECTIN2-TOMM40-APOE locus: (a) tThe structure of LD for the cases; (b) the structure of LD for the controls. Linkage disequilibrium was measured by the D’ statistic: black—a strong link (D’ = 1, LOD > 2), grey—a significant link (D’ < 1, LOD > 2), white—poor link (D’ < 1, LOD < 2). A D’ value of 100 indicates a complete LD between 2 markers.
Association of statistically significant genetic markers with LOAD in the Russian population.
| Gene | Variant | Minor Allele | OR (95% CI) | Major Allele | OR (95% CI) | ||
|---|---|---|---|---|---|---|---|
| rs429358 | ε4 | 2.88 (1.95–4.24) | ε3 | 0.35 (0.24–0.51) | 5 × 10−8 | 8.62 × 10−4 | |
| rs769449 | A | 2.44 (1.62–3.66) | G | 0.41 (0.27–0.62) | 1 × 10−5 | 2.5 × 10−3 | |
| rs429358 | C | 2.53 (1.75–3.67) | T | 0.39 (0.27–0.57) | 5 × 10−7 | 1.7 × 10−3 | |
| rs2075650 | G | 1.67 (1.21–2.29) | A | 0.60 (0.44–0.82) | 2 × 10−3 | 3.4 × 10−3 | |
| rs6857 | T | 1.65 (1.21–2.26) | C | 0.61 (0.44–0.83) | 2 × 10−3 | 4.3 × 10−3 | |
|
| rs3818361 | A | 1.5 (1.12–2.00) | G | 0.67 (0.5–0.89) | 6 × 10−3 | 5.2 × 10−3 |
|
| rs6656401 | A | 1.46 (1.09–1.96) | G | 0.68 (0.51–0.92) | 0.01 | 6 × 10−3 |
|
| rs6859 | A | 1.34 (1.03–1.74) | G | 0.75 (0.57–0.97) | 0.03 | 6.8 × 10−3 |
|
| rs3772130 | G | 0.72 (0.52–0.99) | A | 1.39 (1.01–1.91) | 0.04 | 7.7 × 10−3 |
OR odds ratio, CI confidence interval, p values corrected based on Benjamini and Hochberg method, * Statistically significant results after multiple comparison correction.
Identical haplotypes identified in both groups.
| N | rs, Gene | Haplotype Frequencies (Cases/Controls) | |||||||
|---|---|---|---|---|---|---|---|---|---|
| rs6859 | rs6857 | rs157580 | rs2075650 | rs769449 | rs429358 | rs7412 | rs4420638 | ||
| 1 | G | C | G | A | G | T | C | A | 21.4/26.3 |
| 2 | A | C | A | A | G | T | C | A | 15.7/16.4 |
| 3 | G | C | A | A | G | T | C | A | 14.0/20.9 |
| 4 | A | T | A | G | A | C | C | G | 11.0/6.4 |
| 5 | A | T | A | G | G | T | C | A | 9.1/9.9 |
| 6 | A | C | G | A | G | T | C | A | 7.1/5.5 |
| 7 | G | C | G | A | G | T | T | A | 5.0/3.1 |
| 8 | G | C | A | A | G | T | T | A | 2.8/3.0 |
| 9 | A | C | A | A | G | T | C | G | 2.3/2.7 |
| 10 | G | T | A | G | A | C | C | G | 1.4/1.0 |
| 11 | G | T | A | A | G | C | C | G | 1.1/0.6 |
| 12 | G | C | A | A | A | C | C | G | 0.7/0.6 |
| 13 | A | T | A | A | G | C | C | G | 0.7/0.8 |
| 14 | G | C | A | A | G | T | C | G | 0.5/1.0 |
Figure 2Haplotype frequencies for the LOAD patients and the control group. * Statistically significant differences between the cases and the controls (p < 0.05).
Gene–gene interaction models by GMDR analysis.
| Best Interaction Model | TBA # | CVC | OR # (95% CI) # | |
|---|---|---|---|---|
| APOE | 0.59 | 6/10 | 0.0006 | 2.81 (1.54–5.13) |
|
| 0.66 | 10/10 | <0.0001 | 4.04 (2.27–7.17) |
|
| 0.66 | 10/10 | <0.0001 | 5.71 (3.03–10.78) |
CVC—cross validation consistency, TBA—testing balanced accuracy, # Values rounded up to 2 decimal places.
Figure 3Gene–gene interactions among TCF4 (rs17594526), APOE (rs429358) and CLU (rs1532278) loci in AD and healthy subjects: (a) two-factor model; (b) three-factor model. The score distribution of AD subjects (left black bar in boxes) and control subjects (right black bar in boxes) is shown for each genotype combination. High-risk genotype combinations are represented by dark gray shade cells, while light gray shade cells represent low-risk genotype combinations. Cells with no shading or white cells represent genotype combination for which no data is observed.
Figure 4Protein–protein interaction network of studied genes. The nodes and edges represent the proteins (genes) and their interactions, respectively. The PPI network contained 24 nodes and 39 edges. The STRING indicated that the proteins formed three subnetworks of interconnected gene products. Each subnetwork had a central protein: APOE, CACNA1C, CTNNA2. All these three subsystems are connected together by RELN as a bridge.
Significantly enriched pathways of studied genes.
| Pathway ID | Pathway Description | Observed Gene Count | False Discovery Rate | Matching Proteins in Your Network (Labels) |
|---|---|---|---|---|
| GO:0022008 | neurogenesis | 15 | 0.00028 | APOE,CDON,CLU,CNTN4,CNTNAP2,CTNNA2,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365 |
| GO:0007155 | cell adhesion | 11 | 0.00036 | CADM2,CD33,CDON,CNTN4,CNTNAP2,CTNNA2,DCHS2,NRXN3,PVRL2,RELN,TENM4 |
| GO:0007399 | nervous system development | 17 | 0.00036 | APOE,CDON,CLU,CNTN4,CNTNAP2,CTNNA2,DNAH11,GRIN2B,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365 |
| GO:0048699 | generation of neurons | 14 | 0.00036 | APOE,CDON,CNTN4,CNTNAP2,CTNNA2,KLHL1,LSM1,NRXN3,RELN,SORL1,STK24,TCF4,TENM4,ZNF365 |
| GO:0032989 | cellular component morphogenesis | 10 | 0.00044 | CLU,CNTN4,CNTNAP2,CTNNA2,NRXN3,PVRL2,RELN,STK24,TENM4,ZNF365 |
| GO:0050767 | regulation of neurogenesis | 10 | 0.00044 | APOE,CDON,CNTN4,LSM1,RELN,SORL1,STK24,TCF4,TENM4,ZNF365 |
| GO:0048666 | neuron development | 10 | 0.00045 | APOE,CNTN4,CNTNAP2,CTNNA2,KLHL1,NRXN3,RELN,STK24,TENM4,ZNF365 |
| GO:0031175 | neuron projection development | 9 | 0.00058 | APOE,CNTN4,CNTNAP2,CTNNA2,KLHL1,NRXN3,RELN,STK24,ZNF365 |
| GO:0000902 | cell morphogenesis | 9 | 0.00061 | CLU,CNTN4,CNTNAP2,CTNNA2,NRXN3,RELN,STK24,TENM4,ZNF365 |
| GO:0032990 | cell part morphogenesis | 8 | 0.00064 | CNTN4,CNTNAP2,CTNNA2,NRXN3,PVRL2,RELN,STK24,ZNF365 |
| GO:0007417 | central nervous system development | 10 | 0.00084 | CDON,CLU,CNTN4,CNTNAP2,CTNNA2,GRIN2B,KLHL1,RELN,TENM4,ZNF365 |
| GO:0007611 | learning or memory | 6 | 0.00089 | APOE,CNTNAP2,DNAH11,GRIN2B,NRXN3,RELN |