| Literature DB >> 35899035 |
Sher Li Oh1,2, Meikun Zhou3, Eunice W M Chin1, Gautami Amarnath1, Chee Hoe Cheah1, Kok Pin Ng1,4,5, Nagaendran Kandiah1, Eyleen L K Goh1, Keng-Hwee Chiam3.
Abstract
The definitive diagnosis of Alzheimer's Disease (AD) without the need for neuropathological confirmation remains a challenge in AD research today, despite efforts to uncover the molecular and biological underpinnings of the disease process. Furthermore, the potential for therapeutic intervention is limited upon the onset of symptoms, providing motivation for studying and treating the AD precursor mild cognitive impairment (MCI), the prodromal stage of AD instead. Applying machine learning classification to transcriptomic data of MCI, AD, and cognitively normal (CN) control patients, we identified differentially expressed genes that serve as biomarkers for the characterization and classification of subjects into MCI or AD groups. Predictive models employing these biomarker genes exhibited good classification performances for CN, MCI, and AD, significantly above random chance. The PI3K-Akt, IL-17, JAK-STAT, TNF, and Ras signaling pathways were also enriched in these biomarker genes, indicating their diagnostic potential and pathophysiological roles in MCI and AD. These findings could aid in the recognition of MCI and AD risk in clinical settings, allow for the tracking of disease progression over time in individuals as part of a therapeutic approach, and provide possible personalized drug targets for early intervention of MCI and AD.Entities:
Keywords: Alzheimer's Disease; biomarkers; gene expression; machine learning; mild cognitive impairment; neurodegeneration
Year: 2022 PMID: 35899035 PMCID: PMC9309434 DOI: 10.3389/fdgth.2022.875895
Source DB: PubMed Journal: Front Digit Health ISSN: 2673-253X
Figure 1An illustration of the overall framework for studying biomarkers from blood gene expression.
Demographics and Mini-Mental State Examination (MMSE) scores of cognitively normal (CN) controls, mild cognitive impairment (MCI), and Alzheimer's Disease (AD) subjects in the Alzheimer's Disease Neuroimaging Initiative (ADNI) and National Neuroscience Institute (NNI) datasets.
|
|
|
|
| |
|---|---|---|---|---|
|
| ||||
| Number of subjects (%) | 99 (39.0%) | 61 (24.0%) | 94 (37.0%) | 254 (100%) |
| Age | 63.35 ± 0.69 | 65.03 ± 0.86 | 71.18 ± 0.84 | 66.65 ± 0.51 |
| Gender | 42 male, | 24 male, | 48 male, | 114 male, |
| MMSE | 28.62 ± 0.15 | 27.49 ± 0.20 | 21.91 + 0.48 | 25.87 ± 0.27 |
|
| ||||
| Number of subjects (%) | 261 (35.1%) | 439 (59.0%) | 44 (5.9%) | 744 (100%) |
| Age | 75.56 ± 0.39 | 72.53 ± 0.38 | 75.20 ± 1.43 | 73.75 ± 0.28 |
| Gender | 125 male, | 256 male, | 27 male, | 408 male, |
| MMSE | 27.81 ± 0.21 | 24.51 ± 0.30 | 18.95 ± 0.82 | 25.34 ± 0.21 |
CN, cognitively normal; MCI, mild cognitive impairment; AD, Alzheimer's Disease.
Figure 2Uniform manifold approximation and projection (UMAP) plots of datasets used for analysis. (A) National Neuroscience Institute (NNI) data colored by diagnosis. Each point represents data from 176 genes from a single subject. (B) Alzheimer's Disease Neuroimaging Initiative (ADNI) data colored by diagnosis. Each point represents data from 20092 genes from a single subject. CN, cognitively normal; MCI, mild cognitive impairment; AD, Alzheimer's Disease.
Differentially expressed genes from pairwise comparisons using National Neuroscience Institute (NNI) data.
|
|
|
|---|---|
| CN vs. MCI | ABCA7, CA4, CCL3, CD31, CSF1, F5, FGF2, TNNT2, IKBKG, IL17A, ITGB3, KITLG, LPA, NOS2, OSM, SF3B1, TBP |
| CN vs. AD | CBL, CCL18, CCL27, DNMT3A, FGF1, IL23, IL4R, NFKB1, THPO, TNFB |
| MCI vs. AD | CA4, CCL3, CCL4, CCL5, CCL7, CRP, CSF1, EDN1, F5, IL13, IL4R, IL6, IL7, NOS2, NOTCH3, OCLN |
This table shows the list of differentially expressed genes for each pairwise comparison. Boruta demonstrates that 17 genes are important for distinguishing between cognitively normal (CN) control vs. mild cognitive impairment (MCI) subjects, 10 genes are important for CN vs. Alzheimer's Disease (AD), and 16 genes are important for MCI vs. AD, respectively.
Highest prediction accuracy for Alzheimer's Disease Neuroimaging Initiative (ADNI) data using differentially expressed genes from National Neuroscience Institute (NNI) data as predictive features.
|
|
|
|
|---|---|---|
| CN vs. MCI | 59.55 ± 0.24 | 8.78e-12 |
| CN vs. AD | 55.96 ± 0.13 | 2.46e-12 |
| MCI vs. AD | 56.65 ± 0.09 | 2.79e-14 |
This table shows the mean and standard error (s.e.) of the highest prediction percent accuracy when random forest models constructed using differentially expressed genes shown in .
indicates p < 0.001 according to a one-tailed t-test with the alternative hypothesis that the mean highest prediction percent accuracy is >50%.
Figure 3Protein-protein association networks constructed using the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) database. Proteins displayed are expressed from genes identified by Boruta from pairwise comparisons between (A) cognitively normal (CN) control and mild cognitive impairment (MCI), (B) CN and Alzheimer's Disease (AD), and (C) MCI and AD subjects.
Figure 4Top 20 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways from gene set enrichment analysis of differentially expressed genes from each pairwise comparison. All pathways shown have an adjusted p < 0.05 from gene set enrichment analysis. Pathways that are considered as potential biomarker pathways, specifically the PI3K-Akt, IL-17, JAK-STAT, TNF, and Ras signaling pathways, are shown in red. CN, cognitively normal; MCI, mild cognitive impairment; AD, Alzheimer's Disease.