| Literature DB >> 27610127 |
Yan Jin1, Yi Su2, Xiao-Hua Zhou3, Shuai Huang1.
Abstract
By 2050, it is estimated that the number of worldwide Alzheimer's disease (AD) patients will quadruple from the current number of 36 million, while no proven disease-modifying treatments are available. At present, the underlying disease mechanisms remain under investigation, and recent studies suggest that the disease involves multiple etiological pathways. To better understand the disease and develop treatment strategies, a number of ongoing studies including the Alzheimer's Disease Neuroimaging Initiative (ADNI) enroll many study participants and acquire a large number of biomarkers from various modalities including demographic, genotyping, fluid biomarkers, neuroimaging, neuropsychometric test, and clinical assessments. However, a systematic approach that can integrate all the collected data is lacking. The overarching goal of our study is to use machine learning techniques to understand the relationships among different biomarkers and to establish a system-level model that can better describe the interactions among biomarkers and provide superior diagnostic and prognostic information. In this pilot study, we use Bayesian network (BN) to analyze multimodal data from ADNI, including demographics, volumetric MRI, PET, genotypes, and neuropsychometric measurements and demonstrate our approach to have superior prediction accuracy.Entities:
Keywords: ADNI; Alzheimer’s disease; Bayesian network; Heterogeneous; Multimodal biomarkers
Year: 2016 PMID: 27610127 PMCID: PMC4992017 DOI: 10.1186/s13637-016-0046-9
Source DB: PubMed Journal: EURASIP J Bioinform Syst Biol ISSN: 1687-4145
Subject information at baseline
| AD ( | MCI ( | HC ( | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | SD | Range | Mean | SD | Range | Mean | SD | Range | |
| Age | 74.6 | 8.1 | 56.5–89.6 | 73.9 | 6.7 | 58.5–90.6 | 73.4 | 7.3 | 55.0–89.6 |
| Edu | 16.0 | 2.6 | 8.0–20.0 | 16.4 | 2.7 | 9.0–20.0 | 16.8 | 2.6 | 9.0–20.0 |
| MMSE | 23.8 | 1.6 | 20.0–26.0 | 27.0 | 2.1 | 24.0–30.0 | 28.8 | 1.9 | 24.0–30.0 |
| ADAS | 15.5 | 7.8 | 4.0–51.0 | 14.6 | 9.5 | 0.0–51.0 | 10.8 | 8.8 | 3.0–31.0 |
Description of heterogeneous multimodal biomarkers
| Biomarker | Description |
|---|---|
| Age | Age |
| Sex | Gender |
| Edu | Years of education |
| FDG | Average FDG-PET |
| AV45 | Average AV45 SUVR |
| HippoNV | The normalized hippocampus volume |
| APOE4 | Apolipoprotein E4 polymorphism |
| rs3818361 | CR1 gene rs3818361 polymorphism |
| rs744373 | BIN1 gene rs744373 polymorphism |
| rs11136000 | Clusterin CLU gene rs11136000 polymorphism |
| rs610932 | MS4A6A gene rs610932 polymorphism |
| rs3851179 | PICALM gene rs3851179 polymorphism |
| rs3764650 | ABCA7 gene rs3764650 polymorphism |
| rs3865444 | CD33 gene rs3865444 polymorphism |
| MMSE | Mini-mental state examination |
| ADAS-cog | Alzheimer’s Disease Assessment Scale |
Fig. 1Learn mixed type Bayesian network using heterogeneous multimodality data at baseline
Ten fold cross validation MSE result
| Mean (SD) | ||
|---|---|---|
| MMSE | ADAS-cog | |
| Bayesian network | 2.810 (0.441) | 35.380 (3.244) |
| Linear regression | 3.125 (0.439) | 38.748 (4.364) |
| Decision tree | 3.758 (0.552) | 42.195 (4.306) |
| Random forest | 2.914 (0.330) | 35.218 (4.932) |
Fig. 2Visualization for heterogeneous correlation matrix
RuleFit: 10 most important rules
| Impo. | Supp. | Rule |
|---|---|---|
| y: MMSE | ||
| 100 | 0.78 | 61.85 < |
| 91.3 | 0.81 | AGE <85.75 and FDG >5.78 |
| 74.6 | 0.15 | FDG <5.85 and AV45 >1.11 |
| 68.2 | 0.06 | EDU <19.5 and HippoNV <0.38 and APOE4=1 |
| 46.9 | 0.75 | 5.76 < |
| y: ADAS-cog | ||
| 100 | 0.73 | FDG >5.75 and HippoNV >0.39 |
| 62.5 | 0.65 | FDG >4.9 and 1.02 <AV45 <1.51 |
| 41.2 | 0.72 | EDU <19.5 and HippoNV <0.55 |
| 41 | 0.44 | FDG >6.34 and rs3764650=0 |
| 35.9 | 0.17 | 1.23 <AV45 <1.63 and rs744373=0 |