| Literature DB >> 30010762 |
Hongjiu Zhang1, Fan Zhu1,2, Hiroko H Dodge3,4,5, Gerald A Higgins1, Gilbert S Omenn1,6,7,8, Yuanfang Guan1,6,9.
Abstract
Motivation: Heterogeneous diseases such as Alzheimer's disease (AD) manifest a variety of phenotypes among populations. Early diagnosis and effective treatment offer cost benefits. Many studies on biochemical and imaging markers have shown potential promise in improving diagnosis, yet establishing quantitative diagnostic criteria for ancillary tests remains challenging.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30010762 PMCID: PMC6054197 DOI: 10.1093/gigascience/giy085
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:An overview of the AD diagnostic model. AD: Alzheimer's disease; CN: cognitive normal; MCI: mild cognitive impairment.
Figure 2:Violin plots of performance of different models estimated by cross-validation. (A) Performance of MMSE regression evaluated in terms of Pearson product-moment correlation coefficient. (B) Performance of MMSE regression evaluated in terms of Lin concordance correlation coefficient. (C) Performance of diagnostic predictions evaluated in terms of area under the curve. The average scores are labeled correspondingly. The final model performance is marked with a white body. GB: gradient boosting regression tree; GPR: Gaussian process regression with custom kernel; L: linear regression without regularization; L1: LASSO regression; L2: ridge regression; RF: random forest; SVM: kernel support vector regressor; XGB: XGBoost.
Figure 3:PCA over the dot-product similarity matrix (A and B) and the custom kernel similarity matrix (C and D). PCA on kernel matrix revealed patterns of different disease progressions in the transformed feature space.
Figure 4:Similarity networks of subjects in the ADNI dataset, decomposited by Girvan-Newman algorithm. (A) Visualization of the clusters. Subjects are colored according to baseline diagnosis. (B) The distribution of three diagnostic types in two clusters. (C) The distribution of MCI-to-AD conversion/nonconversion subjects in two clusters.
Figure 5:Violin plots of performance of different diagnostic models estimated on Parkinson's disease dataset by cross-validation. Performance is evaluated in terms of F1 scores, ratio of correct predictions, and AUC.The average scores are labeled correspondingly. GB: gradient boosting regression tree; GPR: Gaussian process regression with custom kernel; Linear: linear regression without regularization; RF: random forest; Ridge: ridge regression; SVM: kernel support vector regressor.