| Literature DB >> 32134942 |
Sylvester Olubolu Orimaye1,2, Karl Goodkin2, Ossama Abid Riaz2, Jean-Maurice Miranda Salcedo2, Thabit Al-Khateeb2, Adeola Olubukola Awujoola2, Patrick Olumuyiwa Sodeke2.
Abstract
There is a limited evaluation of an independent linguistic battery for early diagnosis of Mild Cognitive Impairment due to Alzheimer's disease (MCI-AD). We hypothesized that an independent linguistic battery comprising of only the language components or subtests of popular test batteries could give a better clinical diagnosis for MCI-AD compared to using an exhaustive battery of tests. As such, we combined multiple clinical datasets and performed Exploratory Factor Analysis (EFA) to extract the underlying linguistic constructs from a combination of the Consortium to Establish a Registry for Alzheimer's disease (CERAD), Wechsler Memory Scale (WMS) Logical Memory (LM) I and II, and the Boston Naming Test. Furthermore, we trained a machine-learning algorithm that validates the clinical relevance of the independent linguistic battery for differentiating between patients with MCI-AD and cognitive healthy control individuals. Our EFA identified ten linguistic variables with distinct underlying linguistic constructs that show Cronbach's alpha of 0.74 on the MCI-AD group and 0.87 on the healthy control group. Our machine learning evaluation showed a robust AUC of 0.97 when controlled for age, sex, race, and education, and a clinically reliable AUC of 0.88 without controlling for age, sex, race, and education. Overall, the linguistic battery showed a better diagnostic result compared to the Mini-Mental State Examination (MMSE), Clinical Dementia Rating Scale (CDR), and a combination of MMSE and CDR.Entities:
Year: 2020 PMID: 32134942 PMCID: PMC7058300 DOI: 10.1371/journal.pone.0229460
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary statistics for the combined dataset before multiple imputations.
The number of observations is shown for each category of sex and race. Mean (standard deviation) is shown for all other variables. n excludes observations with missing values.
| Variable | MCI-AD (n = 187) | Control (n = 261) | |
|---|---|---|---|
| SEX (Male/Female) | 77/110 | 109/152 | 0.9015 |
| RACE(White/Black/Asian) | 170/16/0 | 243/17/1 | 0.3143 |
| EDUCATION (Years) | 16.21(9.05) | 15.50(6.07) | 0.3514 |
| AGE (Years) | 85.39(7.56) | 84.18(6.91) | 0.0795 |
| CDR | 0.12(0.22) | 0.05(0.17) | <0.0000 |
| MMSE | 27.18(2.05) | 27.56(2.85) | 0.1000 |
| LM I | 10.64(4.39) | 11.51(4.76) | 0.0500 |
| LM II | 8.88(4.82) | 9.89(5.27) | 0.0400 |
| BOSTON | 25.35(3.43) | 25.85(3.74) | 0.0900 |
Summary statistics of the imputed datasets for the MCI-AD and control groups from the ten iterations of multiple imputations.
Mean (standard deviation) is shown for all other variables.
| Variable | MCI-AD (n = 1870) | Control (n = 2610) | |
|---|---|---|---|
| SEX | 1.59 (0.49) | 1.58(0.49) | 0.6948 |
| RACE | 1.61(7.15) | 1.08(0.35) | 0.0014 |
| EDUCATION (Years) | 16.21(9.03) | 15.50(6.06) | 0.0031 |
| AGE (Years) | 85.39(7.54) | 84.18(6.90) | <0.0001 |
| CDR | 0.12(0.22) | 0.05(0.17) | <0.0001 |
| MMSE | 27.18(2.05) | 27.56(2.85) | <0.0001 |
| LM I | 10.68(4.38) | 11.51(4.75) | <0.0001 |
| LM II | 8.93(4.81) | 9.89(5.26) | <0.0001 |
| BOSTON | 25.38(3.41) | 25.94(3.73) | <0.0001 |
| WORDLISTUSED | 1.43(0.74) | 1.48 (0.59) | 0.0169 |
| WORDLISTCANTREAD | 0.12(2.00) | 0.05(1.58) | 0.2012 |
| WORDLISTTRIALI | 4.51(2.73) | 4.83(1.99) | <0.0001 |
| WORDLISTTRIALII | 6.16(2.40) | 6.51(1.89) | <0.0001 |
| WORDLISTTRIALIII | 6.93(2.68) | 7.27(1.88) | <0.0001 |
| WORDLISTACQUISITION | 17.67(6.12) | 18.67(4.81) | <0.0001 |
| WORDLISTINTRUSIONS | 0.61(2.85) | 0.64(1.42) | 0.6174 |
| WORDLISTDELAYEDRECALL | 5.37(3.35) | 5.89(2.53) | <0.0001 |
| WORDLISTDELAYEDINTRUSIONS | 0.36(1.04) | 0.21(0.80) | <0.0001 |
| WORDLISTRECOGNITION | 19.16(4.80) | 19.08(1.64) | 0.4592 |
EFA structure matrix for the MCI-AD group.
Uniquely loaded variables (>0.40) are marked with asterisks. Important r and communality (Comm.) values are boldfaced.
| Variable | Factor1 | Factor2 | Factor3 | Comm. | |
|---|---|---|---|---|---|
| SEX | 0.17 | 14 | 6 | 7 | 0.03 |
| RACE | 0.10 | -33 | -22 | ||
| EDUCATION (Years) | -0.27 | -39 | -23 | -19 | 0.24 |
| AGE (Years) | 0.24 | -15 | -16 | 25 | 0.11 |
| CDR | -0.41 | -9 | -25 | -5 | 0.08 |
| MMSE | 0.30 | 0 | 21 | 36 | 0.17 |
| LM I | 4 | 8 | |||
| LM II | 5 | 7 | |||
| BOSTON | 0.30 | 0 | 23 | 22 | 0.10 |
| WORDLISTUSED | 0.26 | -2 | 2 | 22 | 0.05 |
| WORDLISTCANTREAD | -0.27 | -39 | 26 | -2 | 0.22 |
| WORDLISTTRIALI | 0.71 | 73 | 51 | 6 | 0.80 |
| WORDLISTTRIALII | -23 | 12 | |||
| WORDLISTTRIALIII | 0 | 1 | |||
| WORDLISTACQUISITION | 25 | 8 | |||
| WORDLISTINTRUSIONS | -6 | 6 | |||
| WORDLISTDELAYEDRECALL | 35 | 11 | |||
| WORDLISTDELAYEDINTRUSIONS | -0.33 | -15 | -3 | ||
| WORDLISTRECOGNITION | -11 | -10 |
EFA structure matrix for the control group.
Uniquely loaded variables (>0.40) are marked with asterisks. Important r and communality (Comm.) values are boldfaced.
| Variable | Factor1 | Factor2 | Factor3 | Comm. | |
|---|---|---|---|---|---|
| SEX | -0.18 | 3 | -9 | -16 | 0.04 |
| RACE | -0.26 | -12 | -22 | -12 | 0.08 |
| EDUCATION (Years) | 0.43 | -1 | 45 | 75 | 0.76 |
| AGE (Years) | -0.08 | -25 | 2 | -9 | 0.07 |
| CDR | -0.36 | -37 | -39 | 11 | 0.30 |
| MMSE | 32 | -22 | |||
| LM I | 13 | -7 | |||
| LM II | 11 | -9 | |||
| BOSTON | 37 | -7 | |||
| WORDLISTUSED | 3 | 0 | |||
| WORDLISTCANTREAD | -0.11 | 2 | -12 | -3 | 0.01 |
| WORDLISTTRIALI | 0.73 | 72 | 16 | -42 | 0.73 |
| WORDLISTTRIALII | 23 | 4 | |||
| WORDLISTTRIALIII | -1 | -17 | |||
| WORDLISTACQUISITION | 15 | -23 | |||
| WORDLISTINTRUSIONS | 0.50 | -19 | -23 | 51* | |
| WORDLISTDELAYEDRECALL | 29 | -36 | |||
| WORDLISTDELAYEDINTRUSIONS | 6 | -13 | |||
| WORDLISTRECOGNITION | -0.59 | 43 | 21 | -64 | 0.64 |
Fraction of Missing Information (FMI) and Relative Efficiency (RE) for linguistic variables.
| Variable | FMI | RE |
|---|---|---|
| LM I | 0.0156 | 0.9984 |
| LM II | 0.0094 | 0.9991 |
| Boston | 0.0229 | 0.9977 |
| WORDLISTUSED | 0.8723 | 0.9198 |
| WORDLISTTRIALI | 0.7855 | 0.9272 |
| WORDLISTTRIALII | 0.8466 | 0.9220 |
| WORDLISTTRIALIII | 0.8550 | 0.9212 |
| WORDLISTACQUISITION | 0.7969 | 0.9262 |
| WORDLISTDELAYEDRECALL | 0.7293 | 0.9320 |
| WORDLISTRECOGNITION | 0.6322 | 0.9405 |
a Component of Linguistic Battery II in Tables 7 & 8.
Machine learning diagnostic performance of models without covariates using the Area Under the ROC Curve (AUC)—(No covariates used in the models).
| Model | AUC | CI | p-value |
|---|---|---|---|
| Linguistic Battery I | 0.72 | 0.70-0.73 | <0.0001 |
| Linguistic Battery II | 0.88 | 0.86-0.89 | <0.0001 |
| MMSE | 0.59 | 0.57-0.62 | <0.0001 |
| CDR | 0.55 | 0.51-0.58 | <0.0001 |
| MMSE & CDR | 0.64 | 0.62-0.66 | <0.0001 |
| All combined | 0.98 | 0.97-0.98 | <0.0001 |
* Linguistic Battery I is with CERAD wordlist. Linguistic Battery II excludes CERAD wordlist.
Machine learning diagnostic performance of models with covariates using the Area Under the ROC Curve (AUC)—(Models include covariates).
| Model | AUC | CI | p-value |
|---|---|---|---|
| Linguistic Battery I w/ covariates | 0.84 | 0.83-0.86 | <0.0001 |
| Linguistic Battery II w/ covariates | 0.97 | 0.96-0.97 | <0.0001 |
| MMSE w/ covariates | 0.77 | 0.75-0.78 | <0.0001 |
| CDR w/ covariates | 0.68 | 0.66-0.71 | <0.0001 |
| MMSE & CDR w/ covariates | 0.86 | 0.85-0.88 | <0.0001 |
| All combined w/ covariates | 0.99 | 0.99-1.00 | <0.0001 |
* Linguistic Battery I is with CERAD wordlist. Linguistic Battery II excludes CERAD wordlist.
Identified optimal SVM parameters for each model.
| Model | Kernel | Cost | Gamma |
|---|---|---|---|
| Linguistic Battery I | Radial | 10 | 2 |
| Linguistic Battery II | Radial | 1 | 1 |
| MMSE | Radial | 0.1 | 0.5 |
| CDR | Radial | 100 | 1 |
| MMSE & CDR | Radial | 100 | 1 |
| All combined | Radial | 1 | 2 |
| Linguistic Battery I w/ covariates | Radial | 10 | 1 |
| Linguistic Battery II w/ covariates | Radial | 1 | 0.5 |
| MMSE w/ covariates | Radial | 1 | 1 |
| CDR w/ covariates | Radial | 0.1 | 2 |
| MMSE & CDR w/ covariates | Radial | 1 | 2 |
| All combined w/ covariates | Radial | 10 | 2 |
* Linguistic Battery I is with CERAD wordlist. Linguistic Battery II excludes CERAD wordlist.
Fig 1Comparison of the underlying patterns of the linguistic battery between the MCI-AD and healthy control groups.
(a) A. MCI-AD group. (b) B. Healthy control group.
Fig 2Comparison of the underlying patterns of combined MMSE and CDR test battery between the MCI-AD and healthy control groups.
(a) A. MCI-AD group. (b) B. Healthy control group.