| Literature DB >> 31427959 |
Kathleen C Fraser1,2, Kristina Lundholm Fors2, Marie Eckerström3, Fredrik Öhman3,4, Dimitrios Kokkinakis2.
Abstract
Recent work has indicated the potential utility of automated language analysis for the detection of mild cognitive impairment (MCI). Most studies combining language processing and machine learning for the prediction of MCI focus on a single language task; here, we consider a cascaded approach to combine data from multiple language tasks. A cohort of 26 MCI participants and 29 healthy controls completed three language tasks: picture description, reading silently, and reading aloud. Information from each task is captured through different modes (audio, text, eye-tracking, and comprehension questions). Features are extracted from each mode, and used to train a series of cascaded classifiers which output predictions at the level of features, modes, tasks, and finally at the overall session level. The best classification result is achieved through combining the data at the task level (AUC = 0.88, accuracy = 0.83). This outperforms a classifier trained on neuropsychological test scores (AUC = 0.75, accuracy = 0.65) as well as the "early fusion" approach to multimodal classification (AUC = 0.79, accuracy = 0.70). By combining the predictions from the multimodal language classifier and the neuropsychological classifier, this result can be further improved to AUC = 0.90 and accuracy = 0.84. In a correlation analysis, language classifier predictions are found to be moderately correlated (ρ = 0.42) with participant scores on the Rey Auditory Verbal Learning Test (RAVLT). The cascaded approach for multimodal classification improves both system performance and interpretability. This modular architecture can be easily generalized to incorporate different types of classifiers as well as other heterogeneous sources of data (imaging, metabolic, etc.).Entities:
Keywords: early detection; eye-tracking; language; machine learning; mild cognitive impairment; multimodal; speech
Year: 2019 PMID: 31427959 PMCID: PMC6688130 DOI: 10.3389/fnagi.2019.00205
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Demographic information and neuropsychological test scores, by group (mean and standard deviation).
| Age (years) | 67.8 (7.7) | 70.6 (5.8) | * | |
| Education (years) | 13.3 (3.7) | 14.3 (3.6) | n.s. | |
| Sex (F/M) | 21/8 | 14/12 | n.s. | |
| MMSE (/30) | 29.6 (0.6) | 28.2 (1.4) | *** | |
| Memory/Learning | RAVLT (total) | 45.5 (11.1) | 37.6 (10.7) | * |
| RAVLT (delayed) | 9.2 (3.6) | 5.8 (3.5) | *** | |
| RAVLT (immediate) | 9.5 (3.5) | 6.1 (3.1) | *** | |
| RCF (3 min) | 18.8 (5.1) | 15.8 (6.8) | n.s. | |
| RCF (20 min) | 18.6 (4.4) | 14.3 (7.0) | * | |
| WLM (delayed) | 21.9 (8.1) | 16.0 (10.5) | * | |
| WLM (immediate) | 25.8 (6.3) | 21.3 (7.6) | * | |
| Language | BNT | 53.3 (4.6) | 50.2 (7.6) | n.s. |
| Verbal fluency (F-A-S) | 47.2 (11.5) | 43.6 (11.1) | n.s. | |
| Similarities | 24.6 (4.7) | 24.0 (5.2) | n.s. | |
| Token test (Part 5) | 20.9 (1.4) | 20.0 (1.8) | n.s. | |
| Attention | Digit span | 13.1 (3.5) | 12.4 (2.8) | n.s. |
| Digit-symbol | 62.9 (12.3) | 54.2 (10.8) | ** | |
| TMT A | 34.1 (11.9) | 39.5 (13.3) | n.s. | |
| TMT B | 79.8 (32.9) | 97.8 (49.4) | n.s. | |
| Spatial | Block design | 40.6 (9.5) | 35.5 (12.2) | n.s. |
| RCF (copy) | 33.6 (2.4) | 32.4 (3.4) | n.s. | |
| Silhouettes | 22.4 (4.2) | 19.3 (3.3) | *** | |
| Executive | Letter-digit | 9.5 (2.3) | 8.7 (2.6) | n.s. |
| PaSMO | 68.2 (21.5) | 86.8 (29.1) | * | |
| Stroop (trial 1) | 13.2 (2.4) | 14.6 (3.1) | n.s. | |
| Stroop (trial 2) | 17.6 (3.4) | 19.4 (5.4) | n.s. | |
| Stroop (trial 3) | 24.1 (6.6) | 27.6 (6.6) | * | |
| Stroop effect | 1.8 (0.4) | 1.9 (0.5) | n.s. |
* indicates p < 0.05, ** indicates p < 0.01, *** indicates p < 0.005.
Figure 1The different levels of analysis in the multimodal dataset.
Language features (26 features).
Speech features (12 features).
Eye-movement features (22 features).
Comprehension features (11 features).
Figure 2Fusion at the feature level (early fusion).
Figure 3Fusion at the mode, task, and session levels (late fusion).
Figure 4Area under the ROC curve (AUC) for each classification configuration, estimated using leave-pair-out cross-validation. Error bars represent standard deviation over each fold in the cross-validation. PD, picture description; RS, reading silently; RA, reading aloud.
Classification results for the baseline classifier trained on neuropsychological test scores, followed by the results at each level of the cascaded classifier.
| Neuropsych. | 0.73 | 0.75 | 0.62 | 0.65 | 0.53 | 0.56 | 0.70 | 0.74 |
| PD: speech | 0.31 | 0.40 | 0.46 | 0.51 | 0.09 | 0.15 | 0.83 | 0.87 |
| PD: language | 0.70 | 0.73 | 0.69 | 0.69 | 0.52 | 0.54 | 0.86 | 0.84 |
| PD: combined | 0.71 | 0.71 | 0.67 | 0.63 | 0.55 | 0.55 | 0.79 | 0.72 |
| RS: eyes | 0.82 | 0.81 | 0.72 | 0.72 | 0.70 | 0.66 | 0.75 | 0.79 |
| RS: comp | 0.60 | 0.58 | 0.57 | 0.58 | 0.40 | 0.41 | 0.73 | 0.75 |
| RS: combined | 0.82 | 0.79 | 0.70 | 0.68 | 0.68 | 0.65 | 0.72 | 0.72 |
| RA: eyes | 0.73 | 0.71 | 0.66 | 0.65 | 0.64 | 0.62 | 0.69 | 0.68 |
| RA: comp | 0.37 | 0.38 | 0.48 | 0.41 | 0.18 | 0.06 | 0.77 | 0.76 |
| RA: speech | 0.45 | 0.64 | 0.43 | 0.56 | 0.25 | 0.38 | 0.62 | 0.73 |
| RA: combined | 0.72 | 0.72 | 0.65 | 0.65 | 0.63 | 0.62 | 0.67 | 0.67 |
| Feature fusion | 0.79 | 0.77 | 0.69 | 0.70 | 0.74 | 0.70 | 0.64 | 0.71 |
| Mode fusion | 0.83 | 0.85 | 0.75 | 0.76 | 0.56 | 0.59 | ||
| Task fusion | 0.78 | 0.89 | 0.85 | |||||
| Session fusion | 0.87 | 0.86 | 0.80 | 0.78 | 0.79 | 0.79 | 0.77 | |
PD, picture description; RS, reading silently; RA, reading aloud. The best result for each classifier and metric is indicated in boldface.
Figure 5Predictions made by the LR classifier using feature fusion, averaged across all test folds. The solid gray line indicates the default threshold of 0.5. By changing the threshold, we can adjust the sensitivity and specificity of the classification; e.g., if the threshold is increased to 0.6 (upper dashed line), only one control is mis-classified as MCI; if the threshold is decreased to 0.4 (lower dashed line), only four MCI patients are mis-classified.
Figure 6Spearman correlations between classifier predictions and neuropsychological test scores. PD, picture description; RS, reading silently; RA, reading aloud. Asterisks indicate the sign has been flipped so that all test scores correlate positively with diagnosis.