| Literature DB >> 35386189 |
Xiong Li1, Yangping Qiu1, Juan Zhou1, Ziruo Xie1.
Abstract
Background: Recent development in neuroimaging and genetic testing technologies have made it possible to measure pathological features associated with Alzheimer's disease (AD) in vivo. Mining potential molecular markers of AD from high-dimensional, multi-modal neuroimaging and omics data will provide a new basis for early diagnosis and intervention in AD. In order to discover the real pathogenic mutation and even understand the pathogenic mechanism of AD, lots of machine learning methods have been designed and successfully applied to the analysis and processing of large-scale AD biomedical data. Objective: To introduce and summarize the applications and challenges of machine learning methods in Alzheimer's disease multi-source data analysis.Entities:
Keywords: Alzheimer's disease; association analysis; disease diagnosis; genome-wide; machine learning; multi-modal data fusion
Year: 2021 PMID: 35386189 PMCID: PMC8922327 DOI: 10.2174/1389202923666211216163049
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.689
KEGG pathway analysis results.
| ID |
|
|
|
|---|---|---|---|
| 1 | Oxidative phosphorylation | 2.38E-30 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 2 | Retrograde endocannabinoid signaling | 5.61E-30 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 3 | Thermogenesis | 2.66E-27 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 4 | Parkinson disease | 3.94E-27 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 5 | Prion disease | 1.33E-26 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 6 | Huntington disease | 6.15E-26 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 7 | Alzheimer disease | 6.04E-25 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 8 | Amyotrophc lateral sclerosis | 6.04E-25 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 9 | Metabolic pathways | 5.90E-16 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,MT-ND6,MT-ND1,MT-ND4L,MT-ND5,MT-ND4,MT-ND2,MT-ND3,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
| 10 | Non-alcoholic fatty liver disease | 3.77E-12 | NDUFB7,NDUFS7,NDUFS4,NDUFB8,NDUFA8,NDUFS1,NDUFA3,NDUFA6 |
The comparison of multigene association analysis methods.
| Methods | Method Principle | Data | Results and Conclusion | References |
|---|---|---|---|---|
| Gene-wide association analysis of SORL1 | Genotype by an array-based method, gene-wide associations | 936 samples, saliva and sMRI data | The SORLl gene is associated with differences in hippocampal volume in young, healthy adults. | [ |
| G-SMuRFS | Group-sparse multi-task regression and feature selection | 632 samples, genetic and sMRI data from the ADNI-1 dataset | Simulation studies demonstrate that the interval estimates obtained using the approach achieve adequate coverage probabilities that outperform those obtained from the nonparametric bootstrap. | [ |
| bgsmtr | Bayesian group sparse multi- task regression | 632 samples, SNP and sMRI data from ADNI dataset | The prediction performance of the G-SMuRFS method was consistently better than conventional multi-variate linear regression and ridge regression, and these selected SNPs could predict the responses of multiple imaging phenotypes at the same time. | [ |
| TGSCCA | Temporally constrained group sparse canonical correlation analysis framework | 114 samples, the genotyping and longitudinal imaging data from ADNI | The method can achieve strong associations and discover phenotypic biomarkers across multiple time points to guide disease-progressive interpretation. | [ |
The comparison of multi-modal data fusion for subtypes identification and classification diagnosis.
| Methods | Method Principle | Data | Results and Conclusion | References |
|---|---|---|---|---|
| ssCCA | Structured and sparse CCA | fMRI and sMRI data of AD patients (n=34) and NC subjects (n=42) from the ADNI database. | The unsupervised method differentiates the transition pattern between the subject-course of AD patients and NC subjects. | [ |
| DCN | Integration of temporal and spatial properties of dynamic connectivity networks | 149 subjects, including 50 NCs, | Simulation studies demonstrate that the interval estimates obtained using the approach achieve adequate coverage probabilities that outperform those obtained from the nonparametric bootstrap. | [ |
| mCCA+jICA | Canonical correlation analysis plus joint | 35 healthy controls, 24 AD subjects, and 23 VCI subjects. | Results showed that | [ |
The comparison of diagnosis of Alzheimer's disease methods based on multi-modal image data fusion.
| Method | Method Principle | Data | Result and Conclusion | References |
|---|---|---|---|---|
| rMLTFL | Robust multi-label transfer feature learning. | 406 samples, MRI and CSF data from the ADNI database. | rMLTFL model that can simultaneously utilize the multi-bit label coding vectors and the original class labels for subjects to capture a common set of features among multiple relevant domains, identify the unrelated domains, and improve the performance of AD diagnosis. | [ |
| Discriminative sparse learning | Discriminative sparse learning method with relational regularization. | 805 subjects, including 226 AD patients, 393 MCI subjects, and 186 NC subjects from ADNI database. | Obtain a classification accuracy of 94.68% for AD | [ |
| -MKMTL | Multikernel-based MTL method. | The MRI features used are based on the imaging data from the ADNI database processed by a team from UCSF. | the multi-kernel multitask learning method not only yields superior performance on regression performance but also is a powerful tool for fusing multimodalities data. | [ |
| MKL model | Integrate complementary | T1-weighted MRI and Mean Diffusivity (MD) maps from the DTI modality of 45 AD patients, 52 NC, and 58 MCI subjects from the ADNI database. | The classification accuracies obtained by the method are 90.2%, 79.42%, and 76.63% for respectively AD | [ |
| MM-SDPN | SDPN is first used to learn high- level features of MRI and PET, which are then fed to another SDPN to fuse multimodal neuroimaging information. The MM-SDPN | 202 samples, MRI and PET images from | The model has better performance than the existing AD diagnostic methods for multi-modal feature learning. | [ |