| Literature DB >> 35418845 |
Xianglian Meng1, Yue Wu1, Wenjie Liu1, Ying Wang2, Zhe Xu1, Zhuqing Jiao3.
Abstract
Alzheimer's disease (AD) is a degenerative disease of the central nervous system characterized by memory and cognitive dysfunction, as well as abnormal changes in behavior and personality. The research focused on how machine learning classified AD became a recent hotspot. In this study, we proposed a novel voxel-based feature detection framework for AD. Specifically, using 649 voxel-based morphometry (VBM) methods obtained from MRI in Alzheimer's Disease Neuroimaging Initiative (ADNI), we proposed a feature detection method according to the Random Survey Support Vector Machines (RS-SVM) and combined the research process based on image-, gene-, and pathway-level analysis for AD prediction. Particularly, we constructed 136, 141, and 113 novel voxel-based features for EMCI (early mild cognitive impairment)-HC (healthy control), LMCI (late mild cognitive impairment)-HC, and AD-HC groups, respectively. We applied linear regression model, least absolute shrinkage and selection operator (Lasso), partial least squares (PLS), SVM, and RS-SVM five methods to test and compare the accuracy of these features in these three groups. The prediction accuracy of the AD-HC group using the RS-SVM method was higher than 90%. In addition, we performed functional analysis of the features to explain the biological significance. The experimental results using five machine learning indicate that the identified features are effective for AD and HC classification, the RS-SVM framework has the best classification accuracy, and our strategy can identify important brain regions for AD.Entities:
Keywords: Alzheimer’s disease; RS-SVM; gene-level; pathway-level; voxel-based features
Year: 2022 PMID: 35418845 PMCID: PMC8995748 DOI: 10.3389/fninf.2022.856295
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
FIGURE 1Study workflow. (A) We performed image preprocessing on voxel-based measures extracted from structural MRI (VBM-MRI) in the ADNI data set. (B) Feature extraction. (C) RS- SVM construction. The features were identified by RS-SVM. (D) Evaluation and analysis. We assessed the biological significance using gene- and pathway-level analysis.
Participant characteristics.
| Subjects | HC | EMCI | LMCI | AD | p |
| Number | 353 | 273 | 504 | 296 | – |
| Gender (M/F) | 187/166 | 153/120 | 309/195 | 166/130 | <0.001 |
| Age (mean ± sd) | 72.2 ± 7.6 | 71.3 ± 7.1 | 74.0 ± 7.6 | 75.1 ± 5.5 | <0.001 |
| Edu (mean ± sd) | 16.1 ± 2.7 | 16.1 ± 2.6 | 16.0 ± 2.9 | 16.3 ± 2.6 | <0.001 |
HC, healthy control; EMCI, early mild cognitive impairment; LMCI, late mild cognitive impairment; AD, Alzheimer’s disease; Edu, education.
FIGURE 2The accuracy curves were obtained through ten experiments for five methods in three groups. (A) Prediction accuracy of EMCI-HC group. (B) Prediction accuracy of LMCI-HC group. (C) Prediction accuracy of AD-HC group.
Test results of different models.
| Group | Model | Validation set | Test set | ||||||
| Accuracy | Precision | Recall | F-Measure | Accuracy | Precision | Recall | F-Measure | ||
| EMCI-HC | Linear regression | 0.67 | 0.67 | 0.67 | 0.67 | 0.73 | 0.73 | 0.73 | 0.73 |
| Lasso | 0.79 | 0.79 | 0.79 | 0.79 | 0.80 | 0.80 | 0.80 | 0.80 | |
| PLS | 0.8 | 0.8 | 0.8 | 0.8 | 0.82 | 0.81 | 0.81 | 0.81 | |
| SVM | 0.73 | 0.73 | 0.73 | 0.73 | 0.76 | 0.76 | 0.76 | 0.76 | |
|
|
|
|
|
|
|
|
|
| |
| LMCI-HC | Linear regression | 0.62 | 0.62 | 0.62 | 0.62 | 0.78 | 0.78 | 0.77 | 0.77 |
| Lasso | 0.80 | 0.80 | 0.80 | 0.80 | 0.81 | 0.81 | 0.81 | 0.81 | |
| PLS | 0.65 | 0.64 | 0.65 | 0.64 | 0.66 | 0.65 | 0.66 | 0.65 | |
| SVM | 0.73 | 0.73 | 0.73 | 0.73 | 0.74 | 0.74 | 0.74 | 0.74 | |
|
|
|
|
|
|
|
|
|
| |
| AD-HC | Linear regression | 0.85 | 0.85 | 0.84 | 0.84 | 0.84 | 0.85 | 0.84 | 0.84 |
| Lasso | 0.85 | 0.85 | 0.85 | 0.85 | 0.85 | 0.85 | 0.85 | 0.85 | |
| PLS | 0.91 | 0.92 | 0.91 | 0.91 | 0.91 | 0.92 | 0.91 | 0.91 | |
| SVM | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | 0.87 | |
|
|
|
|
|
|
|
|
|
| |
Bold fonts represented the model and experimental results in this paper.
Top 10 conditionally significant genes were obtained. Chr represents Chromosome; Gene represents the gene name; CorrectedP represents P-value generated by Bonferroni correction.
| No. | Chr | Gene | CorrectedP |
| 1 | 8 | CSMD1 | 1.74556E-36 |
| 2 | 16 | RBFOX1 | 3.18755E-23 |
| 3 | 16 | CDH13 | 1.07119E-20 |
| 4 | 9 | PTPRD | 1.92988E-19 |
| 5 | 8 | DLGAP2 | 3.74049E-17 |
| 6 | 11 | CNTN5 | 4.81385E-16 |
| 7 | 7 | MAGI2 | 5.93057E-16 |
| 8 | 20 | MACROD2 | 1.50704E-14 |
| 9 | 16 | WWOX | 1.64798E-14 |
| 10 | 3 | CNTN4 | 1.87567E-13 |
Top 10 significant pathways.
| NO. | Pathways | Corrected | Gene |
| 1 | Insulin secretion | 1.01E-06 | PLCB1, PRKCB, PRKCA, CREB5, RYR2, CHRM3, KCNMA1, RAPGEF4, CACNA1C |
| 2 | Oxytocin signaling pathway | 4.80E-06 | PLCB1, PRKAG2, PRKCA, CACNB2, RYR3, RYR2, PRKCB, CACNA1C, ITPR2, CACNA2D3 |
| 3 | Salivary secretion | 7.70E-06 | PLCB1, PRKCA, RYR3, PRKCB, CHRM3, KCNMA1, PRKG1, ITPR2 |
| 4 | Vascular smooth muscle contraction | 7.94E-06 | PLCB1, CACNA1C, PRKCH, PRKCA, PRKCB, PRKCE, KCNMA1, PRKG1, ITPR2 |
| 5 | Calcium signaling pathway | 1.48E-05 | PLCB1, PRKCB, ERBB4, PRKCA, RYR3, RYR2, CHRM3, CACNA1C, ITPR2, PDE1A |
| 6 | Glutamatergic synapse | 2.08E-05 | PLCB1, CACNA1C, PRKCA, GRIK2, PRKCB, DLGAP1, ITPR2, GRM7 |
| 7 | Morphine addiction | 5.00E-05 | PRKCA, PDE1A, PRKCB, PDE3A, GABRB3, PDE4D, PDE10A |
| 8 | Circadian entrainment | 5.56E-05 | PLCB1, PRKCB, PRKCA, RYR3, RYR2, CACNA1C, PRKG1 |
| 9 | Pancreatic secretion | 5.56E-05 | PLCB1, PRKCB, PRKCA, RYR2, CHRM3, KCNMA1, ITPR2 |
| 10 | Aldosterone synthesis and secretion | 5.56E-05 | PLCB1, CACNA1C, PRKCA, CREB5, PRKCB, PRKCE, ITPR2 |
FIGURE 3ROC curve of five classification methods for three groups. (A) Prediction ROC of EMCI-HC group. (B) Prediction ROC of LMCI-HC group. (C) Prediction ROC of AD-HC group.
FIGURE 4An image showing the relation of genes and pathways.