| Literature DB >> 34641989 |
Alexandra König1,2,3, Elisa Mallick1,2,3, Johannes Tröger1,2,3, Nicklas Linz1,2,3, Radia Zeghari1,2,3, Valeria Manera1,2,3, Philippe Robert1,2,3.
Abstract
BACKGROUND: Certain neuropsychiatric symptoms (NPS), namely apathy, depression, and anxiety demonstrated great value in predicting dementia progression, representing eventually an opportunity window for timely diagnosis and treatment. However, sensitive and objective markers of these symptoms are still missing. Therefore, the present study aims to investigate the association between automatically extracted speech features and NPS in patients with mild neurocognitive disorders.Entities:
Keywords: apathy; depression; mild neurocognitive disorders; neuropsychiatric symptoms; speech analysis; vocal parameters
Mesh:
Year: 2021 PMID: 34641989 PMCID: PMC8581700 DOI: 10.1192/j.eurpsy.2021.2236
Source DB: PubMed Journal: Eur Psychiatry ISSN: 0924-9338 Impact factor: 5.361
Overview and explanation of extracted speech features.
| Speech features | Explanation | Category |
|---|---|---|
| Speech ratio | Prosodic feature for verbal fluency from speech to nonspeech proportion in the audio | Temporal |
| Speech interval | Speech segments uninterrupted by pauses between syllables to measure speech production efficiency | Temporal |
| Total phonation time | Total time duration of all words across all sentences | Temporal |
| Number of pauses | Total number of pauses longer than a time threshold between the syllables | Temporal |
| Harmonics to noise ratio | Ratio between periodic and nonperiodic components of speech which reflects voice quality | Source |
| Sound to noise ratio | Ratio between the power of speech signal and the power of background noise that reflects voice quality | Source |
| Jitter | Jitter is a measure of random perturbation in signal periodicity more representative when examining long vowels. Jitter calculation is based on relative jitter | Source |
| Espinola zero crossing metric | Measure for the rate at which speech signal crosses the zero reference and its deviation from the reference | Source |
| Average amplitude change | Variation of the signal amplitude over time | Source |
| Amplitude mean absolute value | Variation of the signal amplitude over time without change direction | Source |
| Amplitude third moment | Skewness of the signal amplitude over time | Source |
| Amplitude fourth moment | Kurtosis of the signal amplitude over time | Source |
| Max amplitude | Measure for the maximum disturbance of the air caused by the speech signal | Source |
| Mean power | Mean power transmitted by the signal | Source |
| Total power | Total power transmitted by the signal | Source |
| Mean value of the fundamental frequency in voiced parts of the audio. Fundamental frequency or pitch quantifies speech signal’s periodic components for speech production assessment | Prosodic | |
| Standard deviation of fundamental frequency in voiced parts of the audio | Prosodic | |
| Average MFCC | Decomposition of MFCC into a range of spectrum coefficients 1 through 20 MFC represents the short-term power spectrum of a sound | Spectral |
| Deltas | First derivative of the average MFCC values that presents the change in the power spectrum | Spectral |
| Delta deltas | First derivative of the average MFCC values that presents the rate of change in the power spectrum | Spectral |
| Peak frequency | Frequency of the maximum power value in signal frequency spectrum | Spectral |
| Power spectrum ratio | Power of the most powerful frequency relative to all other frequencies | Spectral |
Abbreviation: MFCC, mel-frequency cepstral coefficient.
Demographic data for included participants, split by gender.
| Females | Males | |
|---|---|---|
| 92 | 49 | |
| Age | 75.01 (7.88) | 76.08 (6.75) |
| NPI anxiety | 1.84 (2.64) | 1.86 (2.67) |
| NPI depression | 0.85 (1.90) | 0.84 (1.69) |
| NPI apathy | 1.64 (2.74) | 3.37 (3.81) |
| NPI total | 7.10 (9.23) | 9.82 (8.96) |
| MMSE | 24.01 (4.95) | 23.69 (4.34) |
Note: Mean and standard deviation are reported.
Abbreviations: MMSE, Mini-Mental State Examination; NPI, neuropsychiatric inventory.
Spearman rank correlations between MMSE and NPI subscales for females and males.
| NPI score | Females | Males | ||
|---|---|---|---|---|
| Correlation |
| Correlation |
| |
| Anxiety | −0.335 | <0.01 | −0.268 | >0.05 |
| Depression | −0.148 | >0.05 | −0.163 | >0.05 |
| Apathy | −0.584 | <0.001 | −0.486 | <0.001 |
Abbreviations: MMSE, Mini-Mental State Examination; NPI, neuropsychiatric inventory.
Figure 1.Plot between (left) Spearman rank correlations and (right) spearman rank partial correlations corrected for Mini-Mental State Examination (MMSE), between audio features and neuropsychiatric inventory (NPI) subscales, separated by gender and voice task. Only significant correlations are reported. Absolute value of correlation is reflected in the size and color (positive correlations in blue; negative correlations in red) of the dot.
Mean absolute error of regression methods (linear regression L1 penalization and SVM) and of the baseline, for males and females separately.
| Male | Female | |||||
|---|---|---|---|---|---|---|
| Baseline | Linear regression L1 penalization | SVM | Baseline | Linear regression L1 penalization | SVM | |
| NPI anxiety | 2.05232 | 2.05232 | 2.17857 | 2.22331 | ||
| NPI depression | 1.26708 | 1.26708 | 1.08078 | 1.08078 | ||
| NPI apathy | 2.13903 | 3.22024 | 3.37315 | |||
Note: Results better than baseline are marked in bold.
Abbreviation: NPI, neuropsychiatric inventory.