| Literature DB >> 29780871 |
Laura Hernández-Domínguez1, Sylvie Ratté1, Gerardo Sierra-Martínez2, Andrés Roche-Bergua3.
Abstract
INTRODUCTION: We present a methodology to automatically evaluate the performance of patients during picture description tasks.Entities:
Keywords: Alzheimer's disease (AD); Automatic assessment; Information coverage; Linguistic analysis; Machine learning; Mild cognitive impairment (MCI); Phonetic features; Picture description task
Year: 2018 PMID: 29780871 PMCID: PMC5956933 DOI: 10.1016/j.dadm.2018.02.004
Source DB: PubMed Journal: Alzheimers Dement (Amst) ISSN: 2352-8729
Linguistic characteristics selected to evaluate patients' language functions
| Measure | Equation | Interpretation |
|---|---|---|
| Text size | Number of words used in a text | |
| Vocabulary size | Number of different lemmas | |
| Hapax legomena | Number of lemmas mentioned only once | |
| Hapax dislegomena | Number of lemmas mentioned exactly twice | |
| Rationalization of the size of the vocabulary and the length of the text. | ||
| Honoré's | R = 100 · log | A measure based on the ratio of hapax legomena, vocabulary size, and the length of the text |
| Type-token ratio (TTR) | TTR measures the ratio of hapax legomena and the size of the vocabulary. It can be sensitive to the size of the sample | |
| Sichel's | Similar to TTR, but using hapax dislegomena, being more robust against samples of different sizes | |
| Yule's characteristic | Yule's is a measure of lexical repetition considered to be text length independent. In this measure, the number of lemmas of frequency | |
| Entropy | Entropy measures the uniformity of the vocabulary. In the equation, |
Lemmas refer to words without inflections (in their canonical form).
Distribution of interviews used for experimentation
| Variable | All (n = 517) | AD (n = 257) | HC (n = 217) | MCI (n = 43) |
|---|---|---|---|---|
| Participants | 262 | 169 | 74 | 19 |
| Gender | ||||
| Male | 189 | 87 | 75 | 27 |
| Female | 328 | 170 | 142 | 16 |
| Education (years) | ||||
| 6–9 | 55 | 51 | 2 | 2 |
| 10–12 | 200 | 112 | 79 | 9 |
| 13–16 | 209 | 76 | 111 | 22 |
| 17+ | 53 | 18 | 25 | 10 |
| Age (years) | ||||
| Under 50 | 6 | 0 | 5 | 1 |
| 50–59 | 81 | 21 | 57 | 3 |
| 60–69 | 188 | 81 | 94 | 13 |
| 70–79 | 190 | 111 | 57 | 22 |
| 80+ | 52 | 44 | 4 | 4 |
Abbreviations: AD, Alzheimer's disease; HC, healthy elderly control; MCI, mild cognitive impairment; n, number of tests.
Active voice linguistic patterns used for the coverage measure (reproduced from Velazquez [27])
| Interpretation | Example | ||
|---|---|---|---|
| N-V | N-V | Subject + action | “boy stealing” |
| V-N | V-N | Action over an object | “stealing cookies” |
| P-N | P-N | Locations + indirect objects | “in kitchen” |
| N-V | V-N | Subject + action + object | “woman washing dishes” |
Abbreviations: p, pattern; R, referent; S, subject of comparison; N, noun; V, verb; P, preposition.
Fig. 1Linguistic variations of ICUs in the Cookie Theft picture description task. The standardized name of each group is shown. Abbreviation: ICUs, information content units.
Fig. 2Data set partitioning during a 10-fold cross-validation process to evaluate classifiers. The blue section indicates the part of the dataset that is being used as test, while the remaining gray area indicates the part of the dataset being used as training set in each fold.
Correlations∗ of features with the severity of cognitive impairment and with the MMSE
| Correlation to cognitive impairment | Correlation to MMSE score | ||
|---|---|---|---|
| Variable | Corr. | Variable | Corr. |
| Informativeness t = 100% | −0.408 | Informativeness t = 100% | 0.443 |
| Informativeness t = 80% | −0.388 | Informativeness t = 80% | 0.437 |
| Informativeness score | −0.334 | Informativeness score | 0.429 |
| Informativeness t = 60% | −0.333 | Informativeness t = 60% | 0.372 |
| Informativeness variance | −0.257 | Informativeness variance | 0.338 |
| Hapax legomena | −0.254 | Auxiliary verb frequency | 0.305 |
| Pertinence t = 100% | −0.222 | Hapax legomena | 0.265 |
| Auxiliary verb frequency | −0.216 | Auxiliary verb rate | 0.241 |
| MFCC-12 kurtosis | 0.205 | Noun frequency | 0.226 |
| Pertinence t = 80% | −0.201 | Preposition rate | 0.194 |
| MFCC-8 kurtosis | 0.198 | Pertinence t = 100% | 0.192 |
| MFCC-12 skewness | −0.185 | General entropy | 0.189 |
| Noun frequency | −0.183 | Vocabulary size | 0.187 |
| Honoré's | −0.180 | Pertinence t = 80% | 0.183 |
| MFCC-10 kurtosis | 0.163 | Honoré's | 0.177 |
| Conjunction rate | 0.163 | Preposition frequency | 0.175 |
| Vocabulary size | −0.156 | MFCC-12 skewness | 0.173 |
Abbreviations: MMSE, Mini–Mental State Examination; Corr, correlation; t, threshold; MFCC, Mel-Frequency Cepstral Coefficient.
All correlations presented with P value < .001. Variables are shown in descending order with respect to the strength of their correlation.
Controlled for education, age, and gender.
Correlations∗ of features with socioeconomic variables
| Age | Gender | Education | |||
|---|---|---|---|---|---|
| Variable | Corr. | Variable | Corr. | Variable | Corr. |
| MFCC-3 kurtosis | −0.200 | MFCC-10 kurtosis | −0.239 | Preposition freq. | 0.230 |
| Conjunction freq. | 0.182 | MFCC-12 variance | 0.181 | Hapax legomena | 0.222 |
| Brunet's | 0.179 | MFCC-13 variance | 0.179 | Vocabulary size | 0.219 |
| General entropy | 0.177 | MFCC-5 skewness | 0.175 | Text size | 0.207 |
| Auxiliary verb freq. | 0.175 | MFCC-5 variance | 0.174 | General entropy | 0.201 |
| MFCC-1 variance | 0.172 | MFCC-8 skewness | −0.169 | Adjective freq. | 0.200 |
| MFCC-6 kurtosis | −0.168 | MFCC-10 skewness | 0.163 | Conjunction freq. | 0.191 |
| MFCC-5 kurtosis | −0.164 | Noun freq. | 0.190 | ||
| MFCC-9 kurtosis | −0.158 | Informativeness t = 60% | 0.190 | ||
| Informativeness score | 0.156 | Auxiliary verb freq. | 0.184 | ||
| Verb freq. | 0.172 | ||||
| Informativeness score | 0.169 | ||||
| Brunet's | 0.167 | ||||
Abbreviations: Corr, correlation; MFCC, Mel-Frequency Cepstral Coefficient; freq., frequency.
All correlations presented with P value < .001. Variables are shown in descending order with respect to the strength of their correlation.
Controlled for education, gender, and cognitive status.
Controlled for age, education, and cognitive status.
Controlled for age, gender, and cognitive status.
Performance of classifiers separating HCs from AD patients
| Learner | Features | Accuracy | Sensitivity | Specificity | Precision | F-score | AUC |
|---|---|---|---|---|---|---|---|
| Average performance | |||||||
| RFC | Ling | 0.72 | 0.76 | 0.67 | 0.74 | 0.75 | 0.72 |
| SVM | Ling | 0.75 | 0.75 | 0.74 | 0.77 | 0.76 | 0.75 |
| RFC | Cov | 0.73 | 0.78 | 0.67 | 0.73 | 0.75 | 0.72 |
| SVM | Cov | 0.74 | 0.80 | 0.67 | 0.74 | 0.77 | 0.74 |
| RFC | Phon | 0.59 | 0.66 | 0.52 | 0.62 | 0.64 | 0.59 |
| SVM | Phon | 0.62 | 0.70 | 0.52 | 0.63 | 0.66 | 0.61 |
| RFC | Cov + Ling | 0.78 | 0.84 | 0.72 | 0.78 | 0.81 | 0.78 |
| SVM | Cov + Ling | ||||||
| RFC | Best | 0.75 | 0.78 | 0.71 | 0.76 | 0.77 | 0.74 |
| SVM | Best | ||||||
| Best model | |||||||
| RFC | Ling | 0.81 | 0.77 | 0.86 | 0.87 | 0.82 | 0.82 |
| SVM | Ling | 0.85 | 0.85 | 0.86 | 0.88 | 0.86 | 0.85 |
| RFC | Cov | 0.85 | 0.88 | 0.82 | 0.85 | 0.87 | 0.85 |
| SVM | Cov | 0.85 | 0.88 | 0.82 | 0.85 | 0.87 | 0.85 |
| RFC | Phon | 0.67 | 0.65 | 0.68 | 0.71 | 0.68 | 0.67 |
| SVM | Phon | 0.72 | 0.84 | 0.57 | 0.70 | 0.76 | 0.71 |
| RFC | Cov + Ling | ||||||
| SVM | Cov + Ling | 0.88 | 0.81 | 0.95 | 0.95 | 0.88 | 0.88 |
| RFC | Best | 0.85 | 0.85 | 0.86 | 0.88 | 0.86 | 0.85 |
| SVM | Best | 0.87 | 0.80 | 0.95 | 0.95 | 0.87 | 0.88 |
Abbreviations: AD, Alzheimer's disease; AUC, area under the curve of receiver operating characteristics; HCs, healthy elderly controls; RFC, Random Forests Classifier; SVM, Support Vector Machine classifier; Ling, set of all linguistic features; Cov, set of all information coverage features; Phon, set of all phonetic features; Cov + Ling, a combination of all linguistic and information coverage features.
NOTE. The best results are indicated in bold.
A combination of all features with P value < .001 when correlating with cognitive impairment.
Performance of classifiers separating HCs from cognitively impaired patients (AD or MCI, indistinctly)
| Learner | Features | Accuracy | Sensitivity | Specificity | Precision | F-score | AUC |
|---|---|---|---|---|---|---|---|
| Average performance | |||||||
| RFC | Ling | 0.70 | 0.78 | 0.59 | 0.73 | 0.75 | 0.69 |
| SVM | Ling | 0.72 | 0.80 | 0.61 | 0.74 | 0.77 | 0.70 |
| RFC | Cov | 0.74 | 0.83 | 0.61 | 0.75 | 0.79 | 0.72 |
| SVM | Cov | 0.73 | 0.86 | 0.56 | 0.73 | 0.79 | 0.71 |
| RFC | Phon | 0.59 | 0.79 | 0.31 | 0.61 | 0.69 | 0.55 |
| SVM | Phon | 0.61 | 0.81 | 0.33 | 0.62 | 0.70 | 0.57 |
| RFC | Cov + Ling | 0.76 | 0.84 | 0.66 | 0.77 | 0.81 | 0.75 |
| SVM | Cov + Ling | ||||||
| RFC | Best | 0.77 | 0.82 | 0.69 | 0.78 | 0.80 | 0.75 |
| SVM | Best | 0.75 | 0.82 | 0.65 | 0.76 | 0.79 | 0.73 |
| Best model | |||||||
| RFC | Ling | 0.78 | 0.80 | 0.76 | 0.83 | 0.81 | 0.78 |
| SVM | Ling | 0.87 | 0.87 | 0.86 | 0.90 | 0.88 | 0.87 |
| RFC | Cov | 0.83 | 0.87 | 0.77 | 0.84 | 0.85 | 0.82 |
| SVM | Cov | 0.83 | 0.90 | 0.73 | 0.82 | 0.86 | 0.81 |
| RFC | Phon | 0.67 | 0.90 | 0.36 | 0.66 | 0.76 | 0.63 |
| SVM | Phon | 0.65 | 0.90 | 0.29 | 0.64 | 0.75 | 0.59 |
| RFC | Cov + Ling | 0.85 | 0.87 | 0.82 | 0.87 | 0.87 | 0.84 |
| SVM | Cov + Ling | 0.85 | 0.87 | 0.82 | 0.87 | 0.87 | 0.84 |
| RFC | Best | ||||||
| SVM | Best | 0.83 | 0.83 | 0.82 | 0.86 | 0.85 | 0.83 |
Abbreviations: AD, Alzheimer's disease; AUC, area under the curve of receiver operating characteristics; HCs, healthy elderly controls; RFC, Random Forests Classifier; SVM, Support Vector Machine classifier; Ling, set of all linguistic features; Cov, set of all information coverage features; Phon, set of all phonetic features; Cov + Ling, a combination of all linguistic and information coverage features; MCI, mild cognitive impairment.
NOTE. The best results are indicated in bold.
A combination of all features with P value < .001 when correlating with cognitive impairment.