| Literature DB >> 35229018 |
Gabriela Stegmann1,2, Shira Hahn1,2, Samarth Bhandari2, Kan Kawabata2, Jeremy Shefner3, Cayla Jessica Duncan3, Julie Liss1,2, Visar Berisha1,2, Kimberly Mueller4,5,6.
Abstract
We developed and evaluated an automatically extracted measure of cognition (semantic relevance) using automated and manual transcripts of audio recordings from healthy and cognitively impaired participants describing the Cookie Theft picture from the Boston Diagnostic Aphasia Examination. We describe the rationale and metric validation. We developed the measure on one dataset and evaluated it on a large database (>2000 samples) by comparing accuracy against a manually calculated metric and evaluating its clinical relevance. The fully automated measure was accurate (r = .84), had moderate to good reliability (intra-class correlation = .73), correlated with Mini-Mental State Examination and improved the fit in the context of other automatic language features (r = .65), and longitudinally declined with age and level of cognitive impairment. This study demonstrates the use of a rigorous analytical and clinical framework for validating automatic measures of speech, and applied it to a measure that is accurate and clinically relevant.Entities:
Keywords: algorithm; automatic; cognition; digital; language; longitudinal; speech
Year: 2022 PMID: 35229018 PMCID: PMC8865737 DOI: 10.1002/dad2.12294
Source DB: PubMed Journal: Alzheimers Dement (Amst) ISSN: 2352-8729
Description of the evaluation sample
| Evaluation data | ||||
|---|---|---|---|---|
| Demographic | CU | CU‐D | MCI | Dementia |
| Age mean (SD) | 58.5 (10.5) | 63.6 (6.0) | 66.7 (6.4) | 71.2 (8.6) |
| Sex (% female) | 58% F | 61% F | 73% F | 65% F |
| Race (%White) | 93% W | 84% W | 78% W | 97% W |
| Education (% less than high school, % completed high school, % more than high school) |
1% < HS, 16% HS, 83% > HS |
2% < HS, 10% HS, 88% > HS |
12% < HS, 15% HS, 73% > HS |
33% < HS, 31% HS, 38% > HS |
| Number of observations | 2,610 | 327 | 64 | 311 |
| Number of participants | 1258 | 180 | 26 | 195 |
Abbreviations: CU, cognitively unimpaired; CU‐D, cognitive unimpaired showing atypical decline; HS, high school; MCI, mild cognitive impairment; SD, standard deviation.
Number of observations for each sample characteristic
| Sample characteristics | Number of observations |
|---|---|
| Speech was manually transcribed | 2716 |
| Manual transcription was manually annotated to manually calculate SemR | 2163 |
| Speech was transcribed using ASR | 2921 |
| Speech was collected in clinic | 2716 |
| Speech was collected remotely | 595 |
| Speech sample was collected with paired MMSE | 2564 |
| Speech was collected in close temporal proximity (separated by ≈1 week) | 319 |
Abbreviations: ASR, automatic speech recognition; MMSE, Mini‐Mental State Examination; SemR, semantic relevance.
FIGURE 1Scatterplots showing: (A) the manually annotated SemR values versus manually transcribed algorithmically computed SemR values, (B) manually transcribed algorithmically computed SemR values versus ASR‐transcribed algorithmically computed SemR values, and (C) manually annotated SemR values versus ASR‐transcribed algorithmically computed SemR values. SemR, semantic relevance
Correlations and differences between the manually annotated, manually transcribed algorithmically computed, and ASR‐transcribed algorithmically computed SemR values
| Analysis | Correlation | MAE |
|---|---|---|
| Human‐transcript‐and‐SemR versus | ||
| Human‐transcript‐automatic‐SemR | 0.87 | 0.04 |
| Human‐transcript‐automatic‐SemR versus | ||
| ASR‐transcript‐automatic‐SemR | 0.95 | 0.01 |
| Human‐transcript‐and‐SemR versus | ||
| ASR‐transcript‐automatic‐SemR | 0.84 | 0.03 |
Abbreviations: ASR, automatic speech recognition; MAE, mean absolute error; SemR, semantic relevance.
FIGURE 2Boxplots of SemR scores for at‐home (unsupervised) and in‐clinic (supervised) samples. SemR, semantic relevance
FIGURE 3Test‐retest reliability plot for SemR. SemR, semantic relevance
FIGURE 4Scatterplot showing the predicted and observed Mini‐Mental State Examination (MMSE) values
Parameter estimates for the GCMs for each cognitive group
| Parameter | CU estimate (S.E.) | CU‐D estimate (S.E.) | MCI estimate (S.E.) | Dementia estimate (S.E.) |
|---|---|---|---|---|
|
| ||||
| Intercept (centered at age 65) | 0.158 (.002) | 0.167 (.004) | .163 (.01) | .132 (.005) |
| Slope | −.0004 (.0002) | −.0014 (.0006) | −.0026 (.0015) | −.0005 (.0004) |
|
| ||||
| Participant intercepts SD | 0.03 | 0.05 | 0.03 | 0.03 |
| Residuals SD | 0.04 | 0.04 | 0.04 | 0.05 |
Abbreviations: CU, cognitively unimpaired; CU‐D, cognitive unimpaired showing atypical decline; GCM, growth curve model; MCI, mild cognitive impairment; SD, standard deviation; S.E., standard error.
FIGURE 5Longitudinal plots showing the SemR values as a function of age for (A) cognitively unimpaired participants and (B) cognitively unimpaired declining, mild cognitive impairment, and dementia participants. The dark solid lines are based on the fixed effects of the growth curve model, and the shaded areas show the 95% confidence bands