| Literature DB >> 28291242 |
Xiaoke Hao1, Chanxiu Li1, Lei Du2, Xiaohui Yao3, Jingwen Yan3,4, Shannon L Risacher3, Andrew J Saykin3, Li Shen3, Daoqiang Zhang1.
Abstract
Neuroimaging genetics is an emerging field that aims to identify the associations between genetic variants (e.g., single nucleotide polymorphisms (SNPs)) and quantitative traits (QTs) such as brain imaging phenotypes. In recent studies, in order to detect complex multi-SNP-multi-QT associations, bi-multivariate techniques such as various structured sparse canonical correlation analysis (SCCA) algorithms have been proposed and used in imaging genetics studies. However, associations between genetic markers and imaging QTs identified by existing bi-multivariate methods may not be all disease specific. To bridge this gap, we propose an analytical framework, based on three-way sparse canonical correlation analysis (T-SCCA), to explore the intrinsic associations among genetic markers, imaging QTs, and clinical scores of interest. We perform an empirical study using the Alzheimer's Disease Neuroimaging Initiative (ADNI) cohort to discover the relationships among SNPs from AD risk gene APOE, imaging QTs extracted from structural magnetic resonance imaging scans, and cognitive and diagnostic outcomes. The proposed T-SCCA model not only outperforms the traditional SCCA method in terms of identifying strong associations, but also discovers robust outcome-relevant imaging genetic patterns, demonstrating its promise for improving disease-related mechanistic understanding.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28291242 PMCID: PMC5349597 DOI: 10.1038/srep44272
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
5-fold cross-validation results on ADNI: The model learned from the training data is used to estimate the correlation coefficients on the training set.
| Method | Correlation Coefficient on Training Set | |||||
|---|---|---|---|---|---|---|
| F1 | F2 | F3 | F4 | F5 | Mean + Std | |
| BM-SCCA | 0.2619 | 0.2810 | 0.1846 | 0.2679 | 0.2755 | 0.2542 ± 0.0396 |
| CS-SCCA | 0.3436 | 0.3819 | 0.3743 | 0.3536 | 0.3798 | 0.3667 ± 0.0171 |
| DS-SCCA | 0.3519 | 0.3843 | 0.3848 | 0.3584 | 0.3822 | 0.3723 ± 0.0159 |
5-fold cross-validation results on ADNI: The model learned from the training data is used to estimate the correlation coefficients on the testing set.
| Method | Correlation Coefficient on Test Set | |||||
|---|---|---|---|---|---|---|
| F1 | F2 | F3 | F4 | F5 | Mean + Std | |
| BM-SCCA | 0.1996 | 0.1848 | −0.0250 | 0.2845 | 0.2320 | 0.1752 ± 0.1183 |
| CS-SCCA | 0.3328 | 0.2126 | 0.2258 | 0.3275 | 0.2180 | 0.2633 ± 0.0612 |
| DS-SCCA | 0.3566 | 0.2173 | 0.2200 | 0.3139 | 0.2474 | 0.2711 ± 0.0616 |
Figure 1Heat map of average estimated canonical loadings on 85 APOE SNPs associated with 116 brain ROIs across 5-fold cross-validation respect to different methods.
Figure 2Visualization of mapping top 10 average estimated canonical loadings generated by T-SCCA (combination of CS-SCCA and DS-SCCA) onto the brain.
Characteristics of the subjects.
| Subjects | NC | SMC | EMCI | LMCI | AD |
|---|---|---|---|---|---|
| Number | 211 | 82 | 273 | 187 | 160 |
| Gender(M/F) | 109/102 | 33/49 | 153/120 | 108/79 | 95/65 |
| Age | 76.14 ± 6.53 | 72.45 ± 5.67 | 71.48 ± 7.12 | 73.86 ± 8.44 | 75.18 ± 7.88 |
| Education | 16.45 ± 2.62 | 16.78 ± 2.67 | 16.08 ± 2.62 | 16.38 ± 2.81 | 15.86 ± 2.75 |
| MMSE | 29.01 ± 1.23 | 29.00 ± 1.22 | 28.38 ± 1.54 | 27.71 ± 1.73 | 24.00 ± 2.62 |
| CDR | 0.01 ± 0.07 | 0.00 ± 0.00 | 0.49 ± 0.08 | 0.49 ± 0.07 | 0.72 ± 0.27 |
| ADNI-MEM | 1.02 ± 0.58 | 1.12 ± 0.57 | 0.60 ± 0.60 | 0.07 ± 0.67 | −0.76 ± 0.61 |
| ADNI-EF | 0.85 ± 0.69 | 0.73 ± 0.81 | 0.51 ± 0.74 | 0.18 ± 0.81 | −0.53 ± 0.91 |
Note: NC = Normal Control, SMC = Significant Memory Concern, ECMI = Early Mild Cognitive Impairment, LCMI = Late Mild Cognitive Impairment, AD = Alzheimer’s disease.