| Literature DB >> 31797582 |
Lodewijk Brand1, Kai Nichols, Hua Wang, Heng Huang, Li Shen.
Abstract
Alzheimer's disease (AD) is a serious neurodegenerative condition that affects millions of people across the world. Recently machine learning models have been used to predict the progression of AD, although they frequently do not take advantage of the longitudinal and structural components associated with multi-modal medical data. To address this, we present a new algorithm that uses the multi-block alternating direction method of multipliers to optimize a novel objective that combines multi-modal longitudinal clinical data of various modalities to simultaneously predict the cognitive scores and diagnoses of the participants in the Alzheimer's Disease Neuroimaging Initiative cohort. Our new model is designed to leverage the structure associated with clinical data that is not incorporated into standard machine learning optimization algorithms. This new approach shows state-of-the-art predictive performance and validates a collection of brain and genetic biomarkers that have been recorded previously in AD literature.Entities:
Mesh:
Year: 2020 PMID: 31797582 PMCID: PMC6948350
Source DB: PubMed Journal: Pac Symp Biocomput ISSN: 2335-6928
Fig. 1.Visualization of the input (X), coefficient (V) and output (Y) tensors. In each time-point of X the K modalities (MRI, SNP, etc.) are explicitly defined to facilitate the calculation of the group l1-norm. The goal of the proposed method is to learn a joint model V that can effectively map X to the cognitive scores and diagnoses encoded in Y.
Root mean-squared error values and standard deviations between the true and predicted RAVLT scores for the proposed method compared against an array of widely used machine learning algorithms. RAVLT scores vary between 0 and 74.
| Model | RAVLTTOT | RAVLT30 | RAVLT30-RECOG |
|---|---|---|---|
| 4.19e11±7.20e11 | 1.06e12±1.34e12 | 8.85e11±1.06e12 | |
| 18.9±0.888 | 20.5±1.17 | ||
| 19.4±0.913 | 21.1±1.29 | 20.0±0.957 | |
| 19.2±0.961 | 20.7±1.25 | 19.8±1.05 | |
| 19.8±0.928 | |||
Multi-class F scores and their standard deviations, of the iterated five-fold cross validation experiments, for predicting the cognitive status of ADNI participants averaged over each time-point.
| Model | ||||
|---|---|---|---|---|
| 0.265±0.0276 | 0.313±0.0562 | 0.396±0.0299 | ||
| 0.325±0.0201 | 0.415±0.0466 | 0.401±0.0308 | 0.386±0.0325 | |
| 0.289±0.0341 | 0.474±0.0450 | 0.363±0.0254 | 0.396±0.0286 | |
| 0.330±0.0415 | 0.472±0.0524 | 0.410±0.0388 | 0.420±0.0332 | |
| 0.312±0.0588 | 0.475±0.0523 | 0.341±0.0737 | 0.400±0.0366 | |
| 0.255±0.070 | 0.447±0.0485 | 0.405±0.0655 | 0.390±0.0284 | |
| 0.308±0.038 | 0.448±0.0381 | 0.332±0.0364 | 0.378±0.0311 | |
| 0.415±0.0222 | ||||
The solution algorithm to optimize Eq. (2).
Fig. 2.Left: The proposed objective in Eq. 7 plotted over one-hundred iterations of Algorithm 1. In each run the primal and dual variables are randomly re-initialized. Right: The difference between the introduced variables designed to decouple the terms in Eq. (2).
Fig. 3.Top-5 ordered biomarkers in the FreeSurfer modality at each time-point. The identified biomarkers, listed on the far-left and far-right, are ordered from largest coefficient (top) to smallest (bottom) derived from V.
The top-30 SNPs identified by our algorithm.
| 1. rs429358[ | 7. rs17477673 | 13. rs7894245 | 19. rs2994978 | 25. rs212525[ |
| 2. rs7870463 | 8. rs11218301 | 14. rs4310446[ | 20. rs6746923 | 26. rs17477827 |
| 3. rs9461735 | 9. rs11687624 | 15. rs439401[ | 21. rs1801133[ | 27. rs2177828 |
| 4. rs6139494 | 10. rs405509[ | 16. rs1556758 | 22. rs7945931[ | 28. rs7036781[ |
| 5. rs17561 | 11. rs17123514 | 17. rs2248478 | 23. rs4631890 | 29. rs2627641 |
| 6. rs749008[ | 12. rs10512186 | 18. rs6037894 | 24. rs4713432[ | 30. rs17209374 |