| Literature DB >> 30483116 |
Daniela Beltrami1,2, Gloria Gagliardi1,3, Rema Rossini Favretti3, Enrico Ghidoni2, Fabio Tamburini3, Laura Calzà1,4.
Abstract
Background: The discovery of early, non-invasive biomarkers for the identification of "preclinical" or "pre-symptomatic" Alzheimer's disease and other dementias is a key issue in the field, especially for research purposes, the design of preventive clinical trials, and drafting population-based health care policies. Complex behaviors are natural candidates for this. In particular, recent studies have suggested that speech alterations might be one of the earliest signs of cognitive decline, frequently noticeable years before other cognitive deficits become apparent. Traditional neuropsychological language tests provide ambiguous results in this context. In contrast, the analysis of spoken language productions by Natural Language Processing (NLP) techniques can pinpoint language modifications in potential patients. This interdisciplinary study aimed at using NLP to identify early linguistic signs of cognitive decline in a population of elderly individuals.Entities:
Keywords: Natural Language Processing; cognitive decline; language; mild cognitive impairment; preclinical Alzheimer; speech analysis
Year: 2018 PMID: 30483116 PMCID: PMC6243042 DOI: 10.3389/fnagi.2018.00369
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Inclusion criteria for participant enrollment in control and pathological groups.
| - MMSE Raw Score (RS) ≥ 24; | - MMSE ≥ 18; |
Level of education and demographic characteristics of participants.
| Control group | CG | 48 | 61.60 ± 6.93 | 13.00 ± 3.92 |
| Pathological group | a-MCI | 16 | 64.19 ± 7.44 | 11.00 ± 4.00 |
| md-MCI | 16 | 64.50 ± 7.47 | 11.56 ± 4.79 | |
| e-D | 16 | 66.38 ± 6.70 | 9.38 ± 4.01* |
The statistical analysis was performed by comparing CG vs. aMCI, mdMCI, and eD by the non-parametric Kruskal-Wallis tests; .
Figure 1Results of the conventional neuropsycological test performed at the enrolment of the study. The graphs report median and interquartile range. The statistical analysis was performed by the non-parametric Kruskal-Wallis test with Dunn's multiple comparison having CG as control, where *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001. MMSE, Mini Mental State Examination; MoCA, Montreal Cognitive Assessment; CDT, Clock Drawing Test; GPCog-a, General Practitioner Assessment of Cognition; PF, Phonemic verbal fluency tests; SF, Semantic verbal fluency tests.
The table reports the results of the spontaneous speech analysis.
| Content density | LEX_ContDens | 0.0007 | 0.711 | 0.9741 | |||||||||
| Part-of-speech rate: Adj | LEX_PoS_ADJ | 0.0086 | 0.0124 | 0.5366 | |||||||||
| Part-of-speech rate: Adv | LEX_PoS_ADV | 0.0968 | 0.4451 | 0.2955 | |||||||||
| Part-of-speech rate: art | LEX_PoS_ART | 0.4329 | 0.9572 | 0.7976 | |||||||||
| Part-of-speech rate: conj | LEX_PoS_CONJ | 0.5617 | 0.1266 | 0.0290 | |||||||||
| Part-of-speech rate: interj | LEX_PoS_INTERJ | 0.7401 | 0.2145 | 0.8830 | |||||||||
| Part-of-speech rate: noun | LEX_PoS_NOUN | 0.1714 | 0.1274 | 0.2230 | |||||||||
| Part-of-speech rate: num | LEX_PoS_NUM | 0.4855 | 0.2719 | 0.9803 | |||||||||
| Part-of-speech rate: phras | LEX_PoS_PHRAS | 0.2351 | 0.2214 | 0.0403 | |||||||||
| Part-of-speech rate: predet | LEX_PoS_PREDET | 0.9299 | 0.5257 | 0.7294 | |||||||||
| Part-of-speech rate: prep | LEX_PoS_PREP | 0.5495 | 0.2078 | 0.2338 | |||||||||
| Part-of-speech rate: pron | LEX_PoS_PRON | 0.4161 | 0.1433 | 0.4150 | |||||||||
| Part-of-speech rate: Vrb | LEX_PoS_VERB | 0.1434 | 0.1103 | 0.0821 | |||||||||
| Reference rate to reality | LEX_RefRReal | 0.1089 | 0.0258 | 0.3954 | |||||||||
| Personal Deixis rate | LEX_PDEIXIS | 0.7031 | 0.1610 | 0.5622 | |||||||||
| Spatial Deixis rate | LEX_SDEIXIS | 0.5189 | 0.5346 | 0.1813 | |||||||||
| Temporal Deixis rate | LEX_TDEIXIS | 0.7165 | 0.5619 | 0.1717 | |||||||||
| Action verbs rate | LEX_ACTVRB | 0.4493 | 0.6362 | 0.3406 | |||||||||
| Propositional idea density | LEX_IDEAD | 0.0125 | 0.8162 | 0.3547 | |||||||||
| Frequency-of-use tagging | LEX_DM_F | 0.2147 | 0.9209 | 0.3132 | |||||||||
| Lexical richness: type-token ratio | LEX_TTR | 0.0617 | 0.4601 | 0.336 | |||||||||
| Lexical richness: Brunét's index | LEX_BrunetW | 0.0144 | 0.3496 | 0.1255 | |||||||||
| Lexical richness: Honoré's statistic | LEX_HonoreR | 0.5666 | 0.8246 | 0.7209 | |||||||||
| Mean number of words in utterances | LEX_NW | 0.0071 | 0.2203 | 0.0670 | |||||||||
| Percent. of vocalic intervals | RHY_%V | 0.1749 | 0.7722 | 0.4195 | |||||||||
| Std. deviation of vocalic interval durations | RHY_DeltaV | 0.0709 | 0.2127 | 0.2484 | |||||||||
| Std. deviation of consonantal interval durations | RHY_DeltaC | 0.3722 | 0.8292 | 0.0011 | |||||||||
| Pairwise variability index, raw | RHY_VnPVI | 0.8350 | 0.5988 | 0.0314 | |||||||||
| Pairwise variability index, normalized | RHY_CrPVI | 0.4644 | 0.6684 | 0.0152 | |||||||||
| Variation coefficient for ΔV | RHY_VarcoV | 0.0079 | 0.2779 | 0.3225 | |||||||||
| Variation coefficient for ΔC | RHY_VarcoC | 0.2960 | 0.9642 | 0.0146 | |||||||||
| Silence segments duration | SPE_SILMEAN | 0.0028 | <0.0001 | 0.0837 | |||||||||
| Speech segments duration | SPE_SPEMEAN | 0.0012 | 0.0016 | 0.0001 | |||||||||
| Temporal regularity of voiced segment | SPE_TRVSD | 0.7356 | 0.4391 | 0.4000 | |||||||||
| Verbal rate | SPE_VR | 0.0944 | 0.0548 | 0.0597 | |||||||||
| Transformed phonation rate | SPE_TPR | 0.0008 | 0.0002 | 0.0003 | |||||||||
| Standardized phonation time | SPE_SPT | 0.2824 | 0.1591 | 0.0840 | |||||||||
| Standardized pause rate | SPE_SPR | 0.0134 | 0.0037 | 0.0002 | |||||||||
| Root mean square energy | SPE_RMSEM | 0.2812 | 0.2035 | 0.2089 | |||||||||
| Pitch | SPE_PITCHM | 0.4208 | 0.3924 | 0.3847 | |||||||||
| Spectral centroid | SPE_SPCENTRM | 0.0162 | 0.1505 | 0.1047 | |||||||||
| Higuchi fractal dimension | SPE_HFractDM | 0.0022 | 0.0022 | 0.0046 | |||||||||
| Utterance length | SYN_SLENM | 0.0596 | <0.0001 | **** | 0.0012 | ||||||||
| Number of dependent elements linked to the noun | SYN_NPLENM | 0.6967 | 0.3866 | 0.3754 | |||||||||
| Global dependency distance | SYN_GRAPHDISTM | 0.0275 | 0.0106 | 0.0010 | |||||||||
| Syntactic embeddedness: maximum depth of the structure | SYN_MAXDEPTHM | 0.1168 | 0.0002 | 0.0191 | |||||||||
| Syntactic complexity | SYN_ISynCompl | 0.3602 | 0.8648 | 0.6019 | |||||||||
See Table in Supplementary Materials for abbreviations and references. For each group of features (lexical, rhythmic, acoustic, syntactic), the statistical analysis was performed by comparing aMCI and mdMCI vs. CG by the non-parametric Kruskal-Wallis tests and Dunn's multiple comparison, having the CG as “control group,” and by comparing CG vs. eD by the non-parametric Mann-Whitney U-test. The Kruskal-Wallis p-value, the significant p-values from the Dunn's test (aMCI and mdMCI) and Mann-Whitney U-test (eD) are reported in the table, where
p < 0.05;
p < 0.01;
p < 0.001. aMCI, amnesic mild cognitive impairment; CG: control group; eD, early dementia; mdMCI, multiple domain mild cognitive impairment.