| Literature DB >> 35224534 |
Fengyi Tang1, Jun Chen2, Hiroko H Dodge3, Jiayu Zhou1.
Abstract
In recent years, behavioral markers such as spoken language and lexical preferences have been studied in the early detection of mild cognitive impairment (MCI) using conversations. While the combination of linguistic and acoustic signals have been shown to be effective in detecting MCI, they have generally been restricted to structured conversations in which the interviewee responds to fixed prompts. In this study, we show that linguistic and acoustic features can be combined synergistically to identify MCI in semi-structured conversations. Using conversational data from an on-going clinical trial (Clinicaltrials.gov: NCT02871921), we find that the combination of linguistic and acoustic features on semi-structured conversations achieves a mean AUC of 82.7, significantly (p < 0.01) out-performing linguistic-only (74.9 mean AUC) or acoustic-only (65.0 mean AUC) detections on hold-out data. Additionally, features (linguistic, acoustic and combination) obtained from semi-structured conversations outperform their counterparts obtained from structured weekly conversations in identifying MCI. Some linguistic categories are significantly better at predicting MCI status (e.g., death, home) than others.Entities:
Keywords: Alzheimer's disease; I-CONECT project; audio and linguistic markers; behavioral intervention; conversations; mild cognitive impairment (MCI)
Year: 2022 PMID: 35224534 PMCID: PMC8878676 DOI: 10.3389/fdgth.2021.702772
Source DB: PubMed Journal: Front Digit Health ISSN: 2673-253X
Figure 1Distribution of cluster centroids on LIWC question vectors (1,000 EM runs).
Demographic characteristics by baseline cognitive status.
|
|
|
|
|
|---|---|---|---|
| Age | 79.3 (3.7) | 80.0 (4.3) | 0.72 |
| Gender (% Women) | 53.3% | 82.4% | 0.08 |
| Years of Education | 14.8 (2.8) | 15.8 (3.0) | 0.36 |
| Race (% White) | 86.7% | 94.1% | 0.49 |
| MoCA Score ( | 21.3 (2.9) | 25.7 (2.5) | 0.00018 |
MCI, mild cognitive impairment; NC, those with normal cognition. Two-sample student t-tests for continuous variables and Pearson chi-squared test for categorical variables were used to calculate p-values.
Comparisons of behavioral marker performances on 100 subtopic-stratified shuffle splits using semi-structured conversations.
|
|
|
|
|
|---|---|---|---|
| Linguistic | 74.9 (3.27) | - | - |
| Acoustic | 64.9 (4.66) | −3.52 | 5.37e-4 |
| Combo | 79.9 (4.37) | 1.78 | 0.077 |
| Ensemble |
|
|
|
Features are obtained from video chats (semi-structured conversations). Variance across different splits is reported in parenthesis. Bolded: the best performing model by AUC Score.
Figure 2Feature coefficients β across various train-test splits. Colors: yellow = overlapping feature weights, purple = non overlapping feature weights.
Top 10 LIWC feature coefficients correlated with MCI compared with top 10 coefficients correlated with NC.
|
|
|
|
| ||
|---|---|---|---|---|---|
| Death | 0.50 | 1.67 (1.6–1.7) | Swear | −0.74 | 0.48 (0.4–0.5) |
| They | 0.49 | 1.67 (1.6–1.7) | Feel | −0.61 | 0.55 (0.5–0.6) |
| Home | 0.38 | 1.48 (1.4–1.5) | Percept | −0.56 | 0.58 (0.5–0.6) |
| Ingest | 0.38 | 1.47 (1.4–1.5) | Nonfl | −0.54 | 0.59 (0.5–0.6) |
| Number | 0.32 | 1.40 (1.4–1.4) | Insight | −0.53 | 0.59 (0.5–0.6) |
| Friend | 0.30 | 1.37 (1.3–1.4) | Leisure | −0.49 | 0.62 (0.6–0.6) |
| You | 0.24 | 1.28 (1.2–1.3) | Assent | −0.46 | 0.64 (0.6–0.6) |
| Social | 0.24 | 1.27 (1.2–1.3) | Anger | −0.44 | 0.65 (0.6–0.7) |
| We | 0.21 | 1.25 (1.2–1.3) | Money | −0.39 | 0.68 (0.6–0.8) |
| Bio | 0.20 | 1.22 (1.2–1.2) | Time | −0.31 | 0.73 (0.7–0.8) |
Ninety five percent confidence interval is given for odds ratio.
Figure 3Feature importance rankings for MFCC coefficient weights. Elastic net was used to stabilize the L1 path across different train-test splits.
Top five MFCC feature coefficients associated with MCI compared to the top five feature coefficients associated with NC.
|
|
|
|
| ||
|---|---|---|---|---|---|
| AVG, min, δ0-8 | 0.15 | 1.16 (1.16–1.17) | MAX, min, δ1-13 | −0.15 | 0.86 (0.86–0.87) |
| AVG, max, δ0-8 | 0.14 | 1.15 (1.15 - 1.15) | MAX, min, δ0-7 | −0.15 | 0.86 (0.86–0.87) |
| AVG, std, δ0-1 | 0.13 | 1.14 (1.13 - 1.14) | MAX, min, δ1-11 | −0.15 | 0.86 (0.86–0.87) |
| MAX, std, δ0-12 | 0.12 | 1.13 (1.13 - 1.14) | STD, min, δ2-8 | −0.15 | 0.86 (0.86–0.87) |
| STD, max, δ0-2 | 0.12 | 1.13 (1.12 - 1.13) | MAX, avg, δ1-10 | −0.14 | 0.87 (0.86–0.87) |
Ninety five percent confidence interval is given for the odds ratio.
Comparisons of behavioral marker performances on 100 subtopic-stratified shuffle splits using structured conversations.
|
|
|
|
|
|---|---|---|---|
| Linguistic | 56.8 (6.3) | −7.30 | 7.37e-12 |
| Acoustic | 55.8 (6.3) | −6.58 | 4.2e-10 |
| Combo | 49.5 (4.4) | −10.08 | 1.7e-19 |
| Ensemble | 56.0 (7.2) | −6.24 | 2.64e-9 |
Features obtained from Weekly Check-ins (structured conversations). Variance across different splits is reported in parenthesis.