| Literature DB >> 33788842 |
Ray O Bahado-Singh1, Sangeetha Vishweswaraiah1, Buket Aydas2, Ali Yilmaz1, Raghu P Metpally3, David J Carey3, Richard C Crist4, Wade H Berrettini4, George D Wilson5, Khalid Imam6, Michael Maddens6, Halil Bisgin7, Stewart F Graham1, Uppala Radhakrishna1.
Abstract
We evaluated the utility of leucocyte epigenomic-biomarkers for Alzheimer's Disease (AD) detection and elucidates its molecular pathogeneses. Genome-wide DNA methylation analysis was performed using the Infinium MethylationEPIC BeadChip array in 24 late-onset AD (LOAD) and 24 cognitively healthy subjects. Data were analyzed using six Artificial Intelligence (AI) methodologies including Deep Learning (DL) followed by Ingenuity Pathway Analysis (IPA) was used for AD prediction. We identified 152 significantly (FDR p<0.05) differentially methylated intragenic CpGs in 171 distinct genes in AD patients compared to controls. All AI platforms accurately predicted AD with AUCs ≥0.93 using 283,143 intragenic and 244,246 intergenic/extragenic CpGs. DL had an AUC = 0.99 using intragenic CpGs, with both sensitivity and specificity being 97%. High AD prediction was also achieved using intergenic/extragenic CpG sites (DL significance value being AUC = 0.99 with 97% sensitivity and specificity). Epigenetically altered genes included CR1L & CTSV (abnormal morphology of cerebral cortex), S1PR1 (CNS inflammation), and LTB4R (inflammatory response). These genes have been previously linked with AD and dementia. The differentially methylated genes CTSV & PRMT5 (ventricular hypertrophy and dilation) are linked to cardiovascular disease and of interest given the known association between impaired cerebral blood flow, cardiovascular disease, and AD. We report a novel, minimally invasive approach using peripheral blood leucocyte epigenomics, and AI analysis to detect AD and elucidate its pathogenesis.Entities:
Year: 2021 PMID: 33788842 PMCID: PMC8011726 DOI: 10.1371/journal.pone.0248375
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Top 33 differentially methylated CpG markers—(Gene IDs, chromosome location, fold change, AUC, and percentage of methylation difference for each CpG).
| Target ID | CHR | Gene | FDR p-Val | Fold change | AUC | CI | % Methylation | % Methylation difference | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Lower | Upper | Cases | Control | |||||||
| cg20008763 | 19 | ZNF667 | 2.75071E-41 | 1.57 | 0.66 | 0.51 | 0.81 | 44.46 | 28.33 | 16.13 |
| cg25755428 | 19 | MRI1 | 4.25655E-41 | 1.57 | 0.65 | 0.50 | 0.81 | 43.40 | 27.67 | 15.73 |
| cg21353034 | 12 | VPS33A | 1.11479E-38 | 1.93 | 0.65 | 0.50 | 0.81 | 21.49 | 11.13 | 10.36 |
| cg05706624 | 17 | WSCD1 | 3.0837E-38 | 1.81 | 0.68 | 0.53 | 0.83 | 20.84 | 11.53 | 9.31 |
| cg12949483 | 15 | TMEM85 | 4.00337E-38 | 1.66 | 0.68 | 0.53 | 0.83 | 22.62 | 13.58 | 9.03 |
| cg26856451 | 2 | THAP4 | 1.35084E-37 | 2.25 | 0.74 | 0.60 | 0.88 | 13.92 | 6.19 | 7.73 |
| cg26340737 | 6 | RNF5P1; RNF5; AGPAT1 | 1.52469E-37 | 2.84 | 0.60 | 0.44 | 0.76 | 11.73 | 4.13 | 7.60 |
| cg04515524 | 19 | PLVAP | 1.15699E-30 | 0.39 | 0.75 | 0.61 | 0.89 | 8.28 | 21.08 | -12.80 |
| cg02356786 | 1 | LOC731275 | 3.59905E-21 | 0.48 | 0.71 | 0.57 | 0.86 | 9.95 | 20.76 | -10.81 |
| cg05841700 | 1 | PM20D1 | 4.61313E-19 | 0.65 | 0.62 | 0.46 | 0.78 | 26.67 | 40.75 | -14.08 |
| cg16259859 | 1 | ZBTB8A | 2.79863E-17 | 0.60 | 0.66 | 0.51 | 0.82 | 18.09 | 29.95 | -11.86 |
| cg08829299 | 11 | ATHL1 | 2.25432E-16 | 0.62 | 0.67 | 0.52 | 0.83 | 18.59 | 30.21 | -11.63 |
| cg10326472 | 6 | MYB | 8.45895E-14 | 1.50 | 0.69 | 0.54 | 0.84 | 29.08 | 19.34 | 9.74 |
| cg00613827 | 1 | CR1L | 2.86297E-12 | 0.52 | 0.61 | 0.45 | 0.77 | 7.96 | 15.32 | -7.36 |
| cg07509935 | 14 | LTB4R; CIDEB | 4.66298E-12 | 0.53 | 0.68 | 0.52 | 0.83 | 8.45 | 15.88 | -7.43 |
| cg08611411 | 1 | LOR | 2.87911E-11 | 1.97 | 0.55 | 0.38 | 0.71 | 12.41 | 6.30 | 6.12 |
| cg18157505 | 1 | PTPRC | 5.51496E-11 | 1.71 | 0.63 | 0.47 | 0.78 | 16.62 | 9.73 | 6.89 |
| cg27119318 | 21 | WRB | 1.0259E-10 | 0.61 | 0.69 | 0.54 | 0.84 | 12.64 | 20.66 | -8.02 |
| cg01819759 | 13 | RNF219 | 1.02625E-10 | 1.54 | 0.61 | 0.45 | 0.77 | 22.24 | 14.45 | 7.80 |
| cg01887804 | 15 | IVD | 1.62867E-10 | 1.70 | 0.65 | 0.50 | 0.81 | 16.24 | 9.54 | 6.70 |
| cg23623880 | 1 | MACF1 | 2.95897E-10 | 1.52 | 0.69 | 0.54 | 0.84 | 22.53 | 14.83 | 7.70 |
| cg07469467 | 12 | APAF1 | 4.81236E-10 | 0.58 | 0.63 | 0.47 | 0.78 | 9.82 | 16.88 | -7.06 |
| ch.15.658653F | 15 | TMOD2 | 7.3646E-10 | 0.55 | 0.64 | 0.48 | 0.80 | 7.81 | 14.28 | -6.47 |
| cg17160660 | 8 | MYC | 1.15589E-09 | 1.89 | 0.72 | 0.58 | 0.87 | 12.06 | 6.39 | 5.67 |
| cg16251399 | 6 | GUSBL2 | 1.28686E-09 | 0.47 | 0.65 | 0.50 | 0.81 | 5.02 | 10.62 | -5.60 |
| cg17578275 | 2 | ADAM17 | 1.29935E-09 | 0.60 | 0.67 | 0.52 | 0.82 | 10.93 | 18.09 | -7.16 |
| cg05800065 | 4 | NSG1 | 1.93339E-09 | 1.99 | 0.71 | 0.57 | 0.86 | 10.78 | 5.43 | 5.36 |
| cg19819404 | 4 | ZNF718 | 1.99376E-09 | 1.61 | 0.68 | 0.53 | 0.83 | 17.41 | 10.80 | 6.61 |
| cg00106073 | 1 | LMNA | 7.50554E-09 | 1.93 | 0.69 | 0.53 | 0.84 | 10.85 | 5.62 | 5.23 |
| cg24368383 | 1 | MIB2 | 1.5593E-08 | 2.44 | 0.62 | 0.46 | 0.78 | 7.62 | 3.12 | 4.50 |
| cg02722613 | 4 | SEPSECS | 1.69962E-08 | 0.63 | 0.60 | 0.43 | 0.76 | 11.38 | 18.19 | -6.80 |
| cg00853940 | 2 | TRPM8 | 2.05842E-08 | 1.50 | 0.68 | 0.52 | 0.83 | 20.05 | 13.33 | 6.73 |
| cg14304349 | 11 | TRIM6 | 3.15543E-08 | 0.49 | 0.59 | 0.43 | 0.76 | 4.87 | 9.96 | -5.10 |
Top 25 intergenic/extragenic markers*.
| Target ID | FDR p-Val | Fold change | AUC | CI | % Methylation | % Methylation difference | ||
|---|---|---|---|---|---|---|---|---|
| Lower | Upper | Cases | Control | |||||
| cg04299067 | 1.11E-14 | 1.31 | 0.79 | 0.66 | 0.92 | 49.58 | 37.83 | 11.76 |
| cg02147364 | 1.94E-14 | 0.4 | 0.77 | 0.64 | 0.91 | 4.49 | 11.35 | -6.87 |
| cg15711973 | 3.93E-10 | 0.86 | 0.77 | 0.63 | 0.9 | 59.61 | 69.68 | -10.07 |
| cg23332294 | 4.00E-07 | 1.13 | 0.76 | 0.63 | 0.9 | 69.15 | 61.27 | 7.88 |
| cg11166167 | 4.70E-06 | 0.91 | 0.78 | 0.65 | 0.91 | 68.17 | 75.27 | -7.1 |
| cg22680058 | 5.03E-05 | 1.61 | 0.77 | 0.64 | 0.91 | 10.9 | 6.77 | 4.14 |
| cg05293897 | 7.50E-05 | 0.83 | 0.77 | 0.64 | 0.91 | 35.64 | 43.18 | -7.54 |
| cg00614617 | 0.000121 | 1.15 | 0.8 | 0.68 | 0.93 | 53.81 | 46.71 | 7.1 |
| cg12269972 | 0.000145 | 0.81 | 0.76 | 0.62 | 0.9 | 30.32 | 37.36 | -7.04 |
| cg08343820 | 0.000357 | 0.92 | 0.78 | 0.65 | 0.91 | 69.33 | 75.21 | -5.87 |
| cg06336897 | 0.000397 | 0.9 | 0.79 | 0.66 | 0.92 | 60.13 | 66.73 | -6.6 |
| cg23980569 | 0.000405 | 0.89 | 0.81 | 0.68 | 0.93 | 56.04 | 62.86 | -6.82 |
| cg16219773 | 0.000518 | 0.93 | 0.76 | 0.63 | 0.9 | 71.94 | 77.43 | -5.5 |
| cg13699771 | 0.000844 | 1.12 | 0.76 | 0.62 | 0.9 | 59.28 | 52.91 | 6.37 |
| cg24328568 | 0.001267 | 0.9 | 0.8 | 0.67 | 0.92 | 56.64 | 63.03 | -6.39 |
| cg11122899 | 0.003358 | 0.83 | 0.79 | 0.66 | 0.92 | 26.86 | 32.53 | -5.67 |
| cg22509132 | 0.019164 | 0.93 | 0.76 | 0.63 | 0.9 | 65.3 | 70.13 | -4.83 |
| cg00280895 | 0.023204 | 1.11 | 0.77 | 0.64 | 0.91 | 51.94 | 46.75 | 5.19 |
| cg26041076 | 0.024238 | 0.73 | 0.81 | 0.69 | 0.94 | 8.46 | 11.53 | -3.08 |
| cg06858692 | 0.026431 | 0.92 | 0.76 | 0.63 | 0.9 | 57.32 | 62.41 | -5.09 |
| cg08895936 | 0.028385 | 1.03 | 0.77 | 0.63 | 0.9 | 87.82 | 84.95 | 2.87 |
| cg12688483 | 0.031136 | 1.59 | 0.76 | 0.63 | 0.9 | 6.38 | 4.02 | 2.36 |
| cg25906247 | 0.033333 | 1.43 | 0.77 | 0.63 | 0.9 | 8.68 | 6.06 | 2.61 |
| cg00521380 | 0.036657 | 0.96 | 0.8 | 0.67 | 0.92 | 77.56 | 81.17 | -3.61 |
| cg23694799 | 0.043606 | 0.94 | 0.84 | 0.72 | 0.95 | 64.7 | 69.18 | -4.49 |
*Methylation difference defined as FDR p-value <0.05*.
Top 25 intergenic/extragenic markers: Genome-wide significance threshold*.
| Target ID | p-Val | Fold change | AUC | CI | % Methylation | % Methylation difference | ||
|---|---|---|---|---|---|---|---|---|
| Lower | Upper | Cases | Control | |||||
| rs4331560 | 3.39589E-45 | 1.84 | 0.68 | 0.53 | 0.83 | 51.47 | 27.97 | 23.50 |
| rs5926356 | 2.7196E-42 | 1.53 | 0.60 | 0.44 | 0.76 | 52.13 | 33.98 | 18.15 |
| rs10936224 | 1.42552E-40 | 1.35 | 0.59 | 0.43 | 0.76 | 56.30 | 41.68 | 14.62 |
| rs1040870 | 1.82909E-40 | 1.50 | 0.58 | 0.42 | 0.74 | 43.36 | 28.96 | 14.39 |
| cg11468315 | 4.90383E-40 | 1.67 | 0.66 | 0.50 | 0.81 | 33.67 | 20.22 | 13.46 |
| cg27128435 | 8.16168E-40 | 1.32 | 0.74 | 0.60 | 0.88 | 53.64 | 40.67 | 12.97 |
| rs348937 | 1.81674E-39 | 1.35 | 0.61 | 0.45 | 0.77 | 47.37 | 35.19 | 12.19 |
| cg00727777 | 9.80978E-39 | 1.15 | 0.58 | 0.42 | 0.74 | 79.50 | 69.01 | 10.50 |
| cg27055313 | 2.16905E-38 | 1.75 | 0.75 | 0.62 | 0.89 | 22.50 | 12.82 | 9.68 |
| cg19775763 | 1.4397E-37 | 1.09 | 0.62 | 0.46 | 0.78 | 90.66 | 83.00 | 7.66 |
| cg19432688 | 8.03822E-37 | 1.07 | 0.53 | 0.36 | 0.69 | 93.51 | 87.76 | 5.75 |
| rs264581 | 1.75175E-28 | 0.52 | 0.63 | 0.47 | 0.79 | 16.87 | 32.17 | -15.30 |
| rs1495031 | 9.34546E-28 | 0.63 | 0.66 | 0.51 | 0.82 | 42.09 | 67.03 | -24.93 |
| rs2032088 | 1.11017E-26 | 0.60 | 0.60 | 0.44 | 0.76 | 24.32 | 40.75 | -16.42 |
| rs6982811 | 1.87656E-26 | 0.60 | 0.61 | 0.45 | 0.77 | 24.88 | 41.32 | -16.44 |
| rs6626309 | 4.51648E-26 | 0.72 | 0.60 | 0.44 | 0.76 | 44.25 | 61.55 | -17.31 |
| cg27438152 | 9.37531E-21 | 0.84 | 0.67 | 0.51 | 0.82 | 65.54 | 78.42 | -12.88 |
| cg23155965 | 7.14189E-19 | 0.91 | 0.68 | 0.53 | 0.83 | 81.52 | 90.01 | -8.49 |
| cg16097834 | 2.76613E-18 | 0.87 | 0.50 | 0.34 | 0.67 | 73.12 | 83.68 | -10.56 |
| rs2208123 | 8.75703E-17 | 0.80 | 0.64 | 0.49 | 0.80 | 52.83 | 66.31 | -13.49 |
| cg25556225 | 2.17724E-16 | 0.86 | 0.73 | 0.59 | 0.88 | 66.98 | 78.32 | -11.33 |
| cg00224807 | 3.28345E-16 | 0.90 | 0.62 | 0.46 | 0.78 | 77.81 | 86.73 | -8.92 |
| cg03192273 | 1.66265E-15 | 0.61 | 0.63 | 0.47 | 0.78 | 17.06 | 28.01 | -10.95 |
| rs7746156 | 4.86899E-15 | 1.29 | 0.57 | 0.41 | 0.73 | 53.95 | 41.92 | 12.03 |
| rs5987737 | 4.89189E-15 | 1.28 | 0.60 | 0.44 | 0.76 | 54.30 | 42.28 | 12.02 |
* Stringent genome-wide significance threshold: p-value <5x10-8.
Fig 1Principal Component Analysis (PCA) and Partial Least Square Discriminant Analyses (PLS-DA) with intragenic markers.
Alzheimer’s disease prediction based on all intragenic* CpG markers only.
| SVM | GLM | PAM | RF | LDA | DL | |
|---|---|---|---|---|---|---|
| AUC 95% CI | 0.9898 (0.8000–1) | 0.9880 (0.8000–1) | 0.9877 (0.8000–1) | 0.9620 (0.8000–1) | 0.9325 (0.8000–1) | 0.9920 (0.8000–1) |
| Sensitivity | 0.9100 | 0.9500 | 0.9200 | 0.9100 | 0.9000 | 0.9750 |
| Specificity | 0.9700 | 0.9800 | 0.9400 | 0.9500 | 0.9000 | 0.9700 |
* based on analysis of 283,143 CpG loci.
Important predictors in order.
SVM: cg10304803, cg07589235, cg09991306, cg07773593, cg11035296.
GLM: cg02434121, cg27066201, cg14185918, cg07079724, cg04898026.
PAM: cg25179758, cg08086084, cg21027526, cg17840509, cg24644672.
RF: cg25179758, cg27066201, cg14185918, cg07773593, cg11035296.
LDA: cg09991306, cg07773593, cg27066201, cg14185918, cg24644672.
DL: cg10304803, cg07589235, cg09991306, cg07773593, cg11035296.
Support Vector Machine (SVM), Generalized Linear Model (GLM), Prediction Analysis for Microarrays (PAM), Random Forest (RF), Linear Discriminant Analysis (LDA), and Deep Learning (DL).
Alzheimer’s disease prediction based on intergenic (extragenic) CpG markers* only.
| SVM | GLM | PAM | RF | LDA | DL | |
|---|---|---|---|---|---|---|
| AUC 95% CI | 0.9970 (0.8000–1) | 0.9980 (0.8000–1) | 0.9977 (0.8000–1) | 0.9820 (0.8000–1) | 0.9725 (0.8000–1) | 0.9990 (0.8000–1) |
| Sensitivity | 0.9200 | 0.9400 | 0.9300 | 0.9200 | 0.9200 | 0.9750 |
| Specificity | 0.9860 | 0.9810 | 0.9580 | 0.9550 | 0.9100 | 0.9750 |
*—analysis based on 244,246 markers.
Important predictors in order.
SVM: cg01941243, cg09301498, cg27128435, cg03043243, cg09050832.
GLM: rs4331560, cg15410835, cg05477405, cg16818568, cg01938825.
PAM: cg19008148, cg02875416, cg18232989, cg25761791, cg06842409.
RF: cg19008148, cg15410835, cg05477405, cg03043243, cg09050832.
LDA: cg15410835, cg27128435, cg03043243, cg25761791, cg06842409.
DL: cg01941243, cg09301498, cg27128435, cg03043243, cg09050832.
Support Vector Machine (SVM), Generalized Linear Model (GLM), Prediction Analysis for Microarrays (PAM), Random Forest (RF), Linear Discriminant Analysis (LDA), and Deep Learning (DL).
Fig 2Epigenetically dysregulated molecular pathways in AD.
Fig 3Epigenetically dysregulated disease pathways in AD.