| Literature DB >> 30403676 |
Sylvester Olubolu Orimaye1, Jojo Sze-Meng Wong2, Chee Piau Wong3.
Abstract
It has been quite a challenge to diagnose Mild Cognitive Impairment due to Alzheimer's disease (MCI) and Alzheimer-type dementia (AD-type dementia) using the currently available clinical diagnostic criteria and neuropsychological examinations. As such we propose an automated diagnostic technique using a variant of deep neural networks language models (DNNLM) on the verbal utterances of affected individuals. Motivated by the success of DNNLM on natural language tasks, we propose a combination of deep neural network and deep language models (D2NNLM) for classifying the disease. Results on the DementiaBank language transcript clinical dataset show that D2NNLM sufficiently learned several linguistic biomarkers in the form of higher order n-grams to distinguish the affected group from the healthy group with reasonable accuracy on very sparse clinical datasets.Entities:
Year: 2018 PMID: 30403676 PMCID: PMC6221274 DOI: 10.1371/journal.pone.0205636
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Deep-deep neural network language models.
Details of n-gram vocabularies from the MCI and AD-type dementia datasets.
| Vocabulary | 4-grams | 5-grams |
|---|---|---|
| MCI | 841 | 748 |
| Control | 915 | 806 |
| Total | 1756 | 1554 |
| Unique | 1642 | 1508 |
| AD-type dementia | 3781 | 3264 |
| Control | 4054 | 3533 |
| Total | 7835 | 6797 |
| Unique | 6948 | 6475 |
Percentages of transcript files for training, test, and validation sets for the MCI and AD-type dementia datasets.
| Dataset | MCI/Control | AD-type dementia/Control |
|---|---|---|
| Training | 50% | 50% |
| Test | 25% | 25% |
| Validation | 25% | 25% |
| Total (size) | 100% (38) | 100% (198) |
% error and perplexity on MCI held-out test set.
(h = Hidden layer size; Bz = Batch size).
| Models | (%) Error | Perplexity |
|---|---|---|
| D2NNLM-4n (n = 4,h = 11,Bz = 9) | ||
| D2NNLM-5n (n = 5,h = 19,Bz = 6) | ||
| DNNLM (n = 4,h = 300,Bz = 5) | 20.0 | 1.6 |
| NNLM (n = 4,h = 150,Bz = 4) | 25.0 | 1.6 |
% error and perplexity on AD-type dementia held-out test set (h = Hidden layer size; Bz = Batch size).
| Models | (%) Error | Perplexity |
|---|---|---|
| D2NNLM-4n (n = 4,h = 7,Bz = 29) | ||
| D2NNLM-5n (n = 5,h = 127,Bz = 35) | ||
| DNNLM (n = 4,h = 300,Bz = 34) | 26.5 | 1.6 |
| NNLM (n = 4,h = 300,Bz = 20) | 27.5 | 1.5 |
Fig 2% Error of the D2NNLMs vs. DNNLM on MCI dataset with smaller number of hidden layers.
Fig 3% Error of the D2NNLMs vs. DNNLM on AD-type dementia dataset with smaller number of hidden layers.
Fig 4Perplexity comparison between the D2NNLMs and DNNLM on the MCI dataset with smaller number of hidden layers.
Fig 5Perplexity comparison between the D2NNLMs and DNNLM on the AD-type dementia dataset with smaller number of hidden layers.
Performance comparison with the LPOCV AUC on the MCI dataset, N = 38.
| Models | AUC | s.d | SE | 95% CI of AUC | |
|---|---|---|---|---|---|
| D2NNLM-4n | |||||
| D2NNLM-5n | |||||
| DNNLM | 0.68 | 1.60 | 0.26 | 0.009 | 0.171 to 1.000 |
| 4-gram LM | 0.61 | 1.70 | 0.28 | 0.027 | 0.069 to 1.000 |
| Speech Measures | 0.47 | 1.80 | 0.29 | 0.107 | -0.102 to 1.000 |
| Word Alignment | 0.63 | 1.70 | 0.28 | 0.022 | 0.089 to 1.000 |
Performance comparison with the LPOCV AUC on the AD-type dementia dataset, N = 198.
| Models | AUC | s.d | SE | 95% CI of AUC | |
|---|---|---|---|---|---|
| D2NNLM-4n | |||||
| D2NNLM-5n | |||||
| DNNLM | 0.73 | 1.60 | 0.11 | <0.001 | 0.507 to 0.953 |
| 4-gram LM | 0.72 | 3.60 | 0.26 | 0.005 | 0.218 to 1.000 |
| Speech Measures | 0.73 | 3.50 | 0.25 | 0.003 | 0.242 to 1.000 |
| Word Alignment | 0.68 | 3.70 | 0.26 | 0.010 | 0.165 to 1.000 |