| Literature DB >> 34336000 |
Lanlan Li1, Yeying Yang2, Qi Zhang1, Jiao Wang3, Jiehui Jiang1, Alzheimer's Disease Neuroimaging Initiative4.
Abstract
OBJECTIVES: Alzheimer's disease (AD) is the most prevalent neurodegenerative disorder and the most common form of dementia in the elderly. Certain genes have been identified as important clinical risk factors for AD, and technological advances in genomic research, such as genome-wide association studies (GWAS), allow for analysis of polymorphisms and have been widely applied to studies of AD. However, shortcomings of GWAS include sensitivity to sample size and hereditary deletions, which result in low classification and predictive accuracy. Therefore, this paper proposes a novel deep-learning genomics approach and applies it to multitasking classification of AD progression, with the goal of identifying novel genetic biomarkers overlooked by traditional GWAS analysis.Entities:
Mesh:
Year: 2021 PMID: 34336000 PMCID: PMC8298161 DOI: 10.1155/2021/3359103
Source DB: PubMed Journal: Behav Neurol ISSN: 0953-4180 Impact factor: 3.342
Figure 1The flowchart of experimental procedures in this study.
Clinical and baseline demographic characteristics of all participants.
| Groups | Gender (M/F) | Age (years) | Education | MoCA |
|---|---|---|---|---|
|
| ||||
| AD ( | 320/240a | 74.42 ± 7.26 | 15.54 ± 2.85a | 17.18 ± 5.05a,b |
| MCI ( | 255/171c | 73.27 ± 7.39 | 15.98 ± 2.78c | 23.62 ± 2.95b,c |
| HC ( | 163/167a,c | 73.80 ± 5.84 | 16.46 ± 2.54a,c | 25.88 ± 2.42a,c |
|
| ||||
| AD ( | 41/21a | 75.71 ± 7.99 | 15.55 ± 3.32 | 13.91 ± 6.82b |
| MCI ( | 30/17c | 75.56 ± 7.94 | 14.81 ± 3.70 | 22.75 ± 3.31b |
| HC ( | 14/22a,c | 75.45 ± 3.49 | 15.58 ± 3.59 | — |
Data of age and education were presented as mean ± standard deviation. MoCA: Montreal Cognitive Assessment. Group-level two-sample t test is conducted for age, education, and MoCA. Group-level chi-square test is conducted for gender. ap value HC vs. AD; bp value AD vs. MCI; cp value HC vs. MCI.
Figure 2Manhattan plot of genome-wide association study (GWAS) between AD and HC groups. The y-axis shows the p value (on the –log10 scale) for each association test. The x-axis is the chromosomal position of each SNP. The horizontal lines in the Manhattan plot display the cutoffs for two significant levels: blue line for p < 10−5 (generally significant level) and red line for p < 1.66e−7.
Figure 3Q–Q plot of genome-wide association study (GWAS) between AD and HC groups. Genomic inflation factor is 1.084.
SNP summaries reaching genome-wide significance after GWAS.
| SNP | Position | Chr | Region or closest gene | Major/minor alleles |
| OR |
|---|---|---|---|---|---|---|
| rs429358 | 44908684 | 19 | APOE | C/T | 5.407 | 4.348 |
| rs2075650 | 50087459 | 19 | TOMM40 | G/A | 7.19 | 2.737 |
p < 1.66e−7 for SNPs listed above. Chr: chromosome.
Performance of different classification approaches in multitasking classification.
| Model | Accuracy (%) | Sensitivity (%) | Specificity (%) | AUC |
|---|---|---|---|---|
|
| ||||
| GWAS analysis | 71.38 ± 0.63 | 63.13 ± 2.87 | 85.59 ± 6.66 | 0.744 |
| CNN model | 92.45 ± 8.13 | 93.87 ± 12.26 | 90.00 ± 15.97 | 0.915 |
| ResNet18 | 97.96 ± 1.71 | 97.42 ± 3.16 | 98.89 ± 1.36 | 0.980 |
| ResNet34 | 98.78 ± 1.50 | 98.39 ± 2.50 | 99.44 ± 1.11 | 0.981 |
|
| ||||
| GWAS analysis | 56.99 ± 1.55 | 96.08 ± 13.92 | 5.94 ± 21.65 | 0.510 |
| CNN model | 87.47 ± 16.64 | 99.57 ± 0.85 | 71.67 ± 38.75 | 0.852 |
| ResNet18 | 97.59 ± 3.73 | 100.00 ± 0.00 | 94.44 ± 8.61 | 0.966 |
| ResNet34 | 99.52 ± 0.60 | 99.57 ± 0.85 | 99.44 ± 1.11 | 0.986 |
|
| ||||
| GWAS analysis | 58.97 ± 0.00 | 72.18 ± 0.01 | 41.54 ± 0.01 | 0.569 |
| CNN model | 86.42 ± 16.02 | 97.42 ± 4.40 | 71.91 ± 39.21 | 0.840 |
| ResNet18 | 97.80 ± 1.24 | 97.74 ± 2.41 | 97.87 ± 3.30 | 0.972 |
| ResNet34 | 98.90 ± 1.78 | 100.00 ± 0.00 | 97.45 ± 4.13 | 0.981 |
The methods are conducted with crossvalidation, and their results are given as mean ± standard deviation.
Figure 4ROC curve of the performance of different classification approaches in multitasking classification. (a) ROC curve between AD and HC groups. (b) ROC curve between MCI and HC groups. (c) ROC curve between AD and MCI groups.
Shared significant SNPs in AD multitasking classification at p threshold of 0.05.
| SNP loci | Chr | Position | Region or closest gene | Major/minor alleles |
|
|
|
|---|---|---|---|---|---|---|---|
| rs12091371 | 1 | 238671675 | FMN2 | A/G | 0.002227833 | 0.001509792 | 2.27881 |
| rs12129547 | 1 | 238761878 | GREM2 | T/C | 0.002767757 | 0.019531535 | 3.93396 |
| rs1801131 | 1 | 11777063 | MTHFR | C/A | 0.000154746 | 0.007333869 | 6.18756 |
| rs1801133 | 1 | 11778965 | MTHFR | T/C | 0.000192989 | 0.009566143 | 5.35043 |
| rs17034806 | 2 | 109002337 | RANBP2 | G/A | 0.002645921 | 0.028425309 | 3.04207 |
| rs243034 | 2 | 60456396 | MIR4432HG | G/A | 0.038985718 | 0.009492742 | 1.40355 |
| rs4676049 | 2 | 109001689 | RANBP2 | T/C | 0.002215778 | 0.034993277 | 4.51555 |
| rs6714710 | 2 | 97711518 | ZAP70 | G/T | 0.038909198 | 0.000768783 | 7.036 |
| rs1498853 | 3 | 69691797 | NANa | G/A | 0.000708166 | 0.012435495 | 6.43721 |
| rs2289506 | 3 | 101547592 | NIT2 | T/C | 0.000412705 | 0.004914541 | 1.07429 |
| rs288496 | 3 | 69714739 | NANa | T/C | 0.002172998 | 0.009409715 | 4.5892 |
| rs3864101 | 3 | 188862449 | NANa | T/G | 0.038241862 | 0.022506051 | 6.17468 |
| rs989692 | 3 | 156284059 | MME | T/C | 0.00243336 | 0.016131413 | 5.91293 |
| rs3796529 | 4 | 57492171 | REST | A/G | 0.048696066 | 0.004826601 | 7.08294 |
| rs753129 | 4 | 56363188 | NANa | C/T | 0.004469287 | 1.23015 | 0.013855564 |
| rs1925458 | 6 | 23486930 | LOC102724749; LOC105374976 | T/G | 0.000567167 | 0.000409483 | 3.10404 |
| rs1980493 | 6 | 32471193 | BTNL2; TSBP1-AS1 | G/A | 0.016858811 | 0.000751139 | 3.35304 |
| rs2651206 | 6 | 43321455 | TTBK1 | T/C | 0.034368574 | 2.23746 | 1.97506 |
| rs3734254 | 6 | 35502988 | PPARD | C/T | 0.001667544 | 0.000173601 | 2.21574 |
| rs3747742 | 6 | 41270496 | TREML2 | C/T | 6.7184 | 3.50103 | 1.19671 |
| rs6455128 | 6 | 62755705 | KHDRBS2 | A/C | 0.008767325 | 0.00013524 | 1.45915 |
| rs11767557 | 7 | 142819261 | EPHA1-AS1 | C/T | 0.001139233 | 1.20661 | 1.96401 |
| rs11771145 | 7 | 142820884 | EPHA1-AS1 | A/G | 0.001112928 | 9.62693 | 1.8557 |
| rs2227631 | 7 | 100556258 | SERPINE1 | G/A | 0.003024669 | 8.25065 | 7.24385 |
| rs6461569 | 7 | 21502301 | SP4 | C/T | 0.025204688 | 1.41338 | 0.001322394 |
| rs6966915 | 7 | 12232513 | TMEM106B | T/C | 0.031810788 | 0.004328052 | 6.57464 |
| rs11136000 | 8 | 27520436 | CLU | T/C | 0.000663408 | 8.36729 | 1.48921 |
| rs1975804 | 8 | 109360409 | EIF3E; LOC105375704 | C/T | 0.000833268 | 0.003779274 | 0.040135088 |
| rs1800977 | 9 | 106730271 | ABCA1; LOC105376196 | T/C | 0.013227434 | 5.61278 | 4.22923 |
| rs2007153 | 9 | 135493640 | DBH | A/G | 3.88584 | 0.000993637 | 4.15766 |
| rs2066715 | 9 | 106627854 | ABCA1 | A/G | 0.004157538 | 0.04271672 | 7.79217 |
| rs2283123 | 9 | 135505118 | DBH | T/C | 0.000105332 | 0.00221966 | 1.99182 |
| rs2740483 | 9 | 106730356 | ABCA1; LOC105376196 | C/G | 0.009678488 | 6.53287 | 3.81638 |
| rs4149313 | 9 | 106626574 | ABCA1 | G/A | 0.003318514 | 0.040059489 | 7.22211 |
| rs4878104 | 9 | 89382811 | DAPK1 | T/C | 0.002309724 | 6.11706 | 1.46237 |
| rs4548513 | 10 | 67710331 | CTNNA3 | T/C | 0.004692221 | 0.000992064 | 6.62732 |
| rs7070570 | 10 | 67534610 | LOC105378340; CTNNA3 | G/A | 0.000655491 | 2.34147 | 1.63609 |
| rs10501602 | 11 | 85359037 | PICALM | G/A | 0.004647595 | 3.42511 | 2.27238 |
| rs1133174 | 11 | 121006965 | SORL1 | A/G | 0.010544659 | 0.003255318 | 3.13279 |
| rs12805520 | 11 | 85308059 | CCDC83 | T/C | 0.007162755 | 4.02484 | 1.90295 |
| rs1354269 | 11 | 19330820 | NAV2 | C/T | 0.010833795 | 6.42781 | 2.65216 |
| rs1695 | 11 | 67109265 | GSTP1 | G/A | 0.006309008 | 6.1645 | 2.06528 |
| rs1699102 | 11 | 120962172 | SORL1 | C/T | 0.013141699 | 0.001477832 | 6.19383 |
| rs17571 | 11 | 1739170 | CTSD | T/C | 0.013910348 | 1.48442 | 1.58542 |
| rs1946518 | 11 | 111540668 | IL18 | T/G | 0.006486007 | 1.8075 | 7.21291 |
| rs2070045 | 11 | 120953300 | SORL1 | G/T | 0.014545915 | 0.001140944 | 7.97485 |
| rs3851179 | 11 | 85546288 | PICALM | A/G | 0.00599555 | 1.06305 | 1.50773 |
| rs543293 | 11 | 85497725 | PICALM | A/G | 0.004782682 | 1.26328 | 7.85454 |
| rs6265 | 11 | 27636492 | BDNF; BDNF-AS | A/G | 0.009460455 | 1.96067 | 1.42052 |
| rs7120118 | 11 | 47242866 | NR1H3 | C/T | 0.034010947 | 0.011720707 | 5.90652 |
| rs7943454 | 11 | 24478242 | LUZP2 | T/C | 0.01777448 | 1.41141 | 2.00064 |
| rs1799986 | 12 | 55821533 | LRP1 | T/C | 0.000875404 | 8.1991 | 0.000948497 |
| rs6311 | 13 | 46369479 | HTR2A | T/C | 1.95644 | 0.002519461 | 1.48261 |
| rs6313 | 13 | 46367941 | HTR2A | T/C | 3.21121 | 0.004551786 | 2.05426 |
| rs690705 | 13 | 33552918 | RFC3 | G/A | 9.15016 | 0.000123939 | 0.032894005 |
| rs7989332 | 13 | 19948575 | CRYL1 | A/C | 0.001947338 | 0.038324875 | 0.017831896 |
| rs10137185 | 14 | 63845529 | ESR2 | T/C | 0.000105484 | 0.00099597 | 2.08051 |
| rs1065778 | 15 | 49307498 | MIR4713HG; CYP19A1 | A/G | 7.88465 | 1.51664 | 2.28151 |
| rs2278317 | 15 | 31848032 | RYR3 | G/A | 0.019133823 | 5.18518 | 4.11594 |
| rs3751592 | 15 | 49393870 | CYP19A1; MIR7973-1; MIR7973-2 | G/A | 0.000333452 | 8.25536 | 5.77922 |
| rs11075996 | 16 | 52415525 | FTO | T/C | 0.021538201 | 9.92354 | 0.002856803 |
| rs11075997 | 16 | 52416413 | FTO | T/C | 0.023256237 | 1.14854 | 0.002461173 |
| rs6499640 | 16 | 52327178 | FTO | G/A | 0.016928027 | 4.22569 | 0.009148135 |
| rs1050565 | 17 | 25600202 | BLMH | G/A | 0.045066195 | 6.1136 | 1.19122 |
| rs391300 | 17 | 2163008 | SRR | A/G | 0.028223591 | 1.90848 | 0.000379785 |
| rs7946 | 17 | 17350285 | PEMT | C/T | 0.044709718 | 0.000412862 | 2.75631 |
SNP: single-nucleotide polymorphism. Chr: chromosome. NANa represents uncertain gene of the SNP loci. p value (HC vs. AD) represents the difference value of each SNP loci between AD and HC groups. All p value are calculated by a two-sample t test for false discovery rate correction.
Figure 5Visualization for part of shared significant SNPs in AD multitasking classification at p threshold of 0.05. Rs11136000 (CLU) on chromosome 8 (a); rs543293, rs10501602, and rs3851179 (PICALM) on chromosome 11 (b); rs2070045, rs1699102, and rs1133174 (SORL1) on chromosome 11 (c); and rs6311 and rs6313 (HTR2A) on chromosome 13 (d) are shown successively from left to right.