| Literature DB >> 35573349 |
Qing Zhao1, Hong-Zhen Fan1, Yan-Li Li1, Lei Liu1, Ya-Xue Wu1, Yan-Li Zhao1, Zhan-Xiao Tian1, Zhi-Ren Wang1, Yun-Long Tan1, Shu-Ping Tan1.
Abstract
Background: At present, there is no established biomarker for the diagnosis of depression. Meanwhile, studies show that acoustic features convey emotional information. Therefore, this study explored differences in acoustic characteristics between depressed patients and healthy individuals to investigate whether these characteristics can identify depression.Entities:
Keywords: MFCC; acoustic characteristics; biomarker; depression; zero-crossing rate
Year: 2022 PMID: 35573349 PMCID: PMC9095973 DOI: 10.3389/fpsyt.2022.815678
Source DB: PubMed Journal: Front Psychiatry ISSN: 1664-0640 Impact factor: 5.435
Demographic and clinical characteristics of all participants N = 133.
| Characteristics | Depression group | Control group | Group comparison | |
|
|
|
| ||
| Mean ± SD | Mean ± SD |
| ||
| Age (years) | 34.90 ± 9.32 | 36.67 ± 8.56 | 1.15 | 0.25 |
| Gender (Male,%) | 33.80% | 31.30% | 0.1 | 0.85 |
| Education (years) | 16.25 ± 2.05 | 15.86 ± 1.83 | −1.17 | 0.24 |
| Age of onset (years) | 27.59 ± 6.25 | NA | NA | NA |
| Disease duration (years) | 7.31 ± 6.02 | NA | NA | NA |
| HAMA score | 21.3 ± 9.93 | NA | NA | NA |
| HAMD score | 16.09 ± 6.24 | NA | NA | NA |
| PHQ-9 score | 13.13 ± 6.61 | NA | NA | NA |
HAMA, Hamilton Anxiety Scale; HAMD, Hamilton Depression Scale; PHQ-9, Patient Health Questionnaire.
Differences in acoustic characteristics between the depression and control groups under different emotional tasks N = 133.
| Depression group | Control group | Group effect | |||||||
|
|
|
| |||||||
| Positive | Neutral | Negative | Positive | Neutral | Negative |
|
| η 2 | |
| ZCR (10–2) | 4.91 (1.78) | 5.08 (1.76) | 5.01 (1.82) | 6.34 (0.82) | 6.67 (0.80) | 6.63 (0.82) | 128.07 | <0.0001 | 0.25 |
| HNR (10–2) | 34.59 (8.97) | 35.78 (9.03) | 35.14 (9.10) | 31.30 (3.46) | 31.78 (3.36) | 31.52 (3.88) | 34.09 | <0.0001 | 0.08 |
| F0 | 38.68 (38.55) | 44.23 (41.60) | 41.26 (39.33) | 13.61 (11.84) | 17.60 (12.77) | 15.67 (14.99) | 85.13 | <0.0001 | 0.18 |
| MFCC1 | −6.01(5.25) | −6.54(5.07) | −6.64(5.13) | −10.88(1.49) | −11.73(1.45) | −11.68(1.46) | 181.35 | <0.0001 | 0.32 |
| MFCC2 | −0.84(4.20) | 0.81 (3.99) | 0.70 (3.94) | −6.23(1.84) | −4.23(1.61) | −4.46(1.63) | 265.72 | <0.0001 | 0.41 |
| MFCC3 | −2.69(2.72) | −1.24(3.01) | −1.27(2.76) | −5.99(2.26) | −4.16(2.30) | −4.28(2.34) | 158.39 | <0.0001 | 0.29 |
| MFCC4 | −3.56(4.07) | −4.47(4.30) | −3.77(4.06) | −7.33(2.24) | −8.06(2.23) | −7.49(2.24) | 119.36 | <0.0001 | 0.23 |
| MFCC5 | −5.92(4.79) | −6.50(4.55) | −6.04(4.65) | −9.26(2.37) | −9.55(2.32) | −9.32(2.31) | 68.58 | <0.0001 | 0.15 |
| MFCC6 | −7.13(5.64) | −6.74(5.46) | −6.78(5.37) | −9.55(2.31) | −9.00(2.01) | −9.19(2.14) | 28.66 | <0.0001 | 0.07 |
| MFCC7 | −4.49(3.08) | −5.14(2.95) | −4.80(2.98) | −8.19(2.68) | −8.84(2.27) | −8.51(2.32) | 171.70 | <0.0001 | 0.31 |
| MFCC8 | −3.36(3.60) | −4.93(3.73) | −4.47(3.64) | −6.36(2.16) | −8.08(2.09) | −7.55(2.01) | 99.82 | <0.0001 | 0.20 |
| MFCC9 | −3.14(3.24) | −3.95(2.63) | −3.78(3.05) | −5.11(2.07) | −5.97(1.59) | −5.83(1.67) | 67.17 | <0.0001 | 0.15 |
| MFCC10 | −4.62(4.14) | −3.90(3.78) | −4.50(4.15) | −8.14(2.93) | −7.42(2.26) | −8.08(2.68) | 130.32 | <0.0001 | 0.25 |
| MFCC11 | −3.98(2.57) | −3.68(2.95) | −4.02(2.94) | −5.55(2.30) | −5.00(2.42) | −5.40(2.42) | 32.37 | <0.0001 | 0.08 |
| MFCC12 | −2.07(1.86) | −2.18(1.78) | −2.40(1.94) | −3.41(2.01) | −3.39(1.89) | −3.75(2.02) | 48.35 | <0.0001 | 0.11 |
*Indicates survived FDR-correction.
ZCR, zero-crossing rate; HNR, harmonic-to-noise ratio; F0, fundamental frequency; MFCC, mel-frequency cepstral coefficient.
The correlation between depressive symptom severity and acoustic characteristics.
| PHQ-9 score | HAMA total score | HAMA somatic anxiety score | HAMD anxiety/somatization score | |
| ZCR-positive | –0.13 | 0.31 | 0.34 | 0.18 |
| MFCC4-positive | 0.39 | –0.06 | 0.02 | –0.05 |
| MFCC4-neutral | 0.42 | –0.02 | 0.01 | 0.02 |
| MFCC4-negative | 0.36 | –0.10 | –0.09 | –0.09 |
| MFCC7-positive | 0.43 | 0.10 | 0.11 | –0.11 |
| MFCC7-neutral | 0.41 | 0.14 | 0.17 | –0.14 |
| MFCC7-negative | 0.46 | 0.09 | 0.12 | –0.05 |
| MFCC9-neutral | –0.02 | –0.09 | –0.07 | −0.34 |
*p < 0.05.
PHQ-9, Patient Health Questionnaire; HAMA, Hamilton Anxiety Scale; HAMD, Hamilton Depression Scale; ZCR, zero-crossing rate; MFCC, mel-frequency cepstral coefficient.
Linear regression analysis between PHQ-9 score and MFCC7-negative.
| β | SE |
|
| 95% confidence interval of β | ||
| Lower bound | Upper bound | |||||
|
| ||||||
| (constant) | 17.00 | 1.79 | 9.49 | < 0.001 | 13.34 | 20.65 |
| MFCC7-negative | 0.90 | 0.33 | 2.69 | 0.01 | 0.22 | 1.58 |
|
| ||||||
| (Constant) | 4.33 | 1.11 | 3.92 | < 0.001 | 2.09 | 6.58 |
| MFCC9-neutral | –0.45 | 0.22 | –2.04 | 0.049 | –0.91 | –0.001 |
The discriminant result of grouping by different classifiers.
| Indicator | LR | SVM | ||
| Depression group | Control group | Depression group | Control group | |
| Precision | 0.89 | 0.91 | 0.93 | 0.79 |
| Recall | 0.94 | 0.83 | 0.82 | 0.92 |
| F1 score | 0.91 | 0.87 | 0.87 | 0.85 |
LR, Logistic Regression; SVM, support vector machine.
FIGURE 1ROC curves of acoustic features in LR and SVM models. The red line represents LR model; the green line represents SVM model; the horizontal axis is the false positive rate, and the vertical axis is the true positive rate.
FIGURE 2The 20 acoustic features with the largest contribution in LR model. pos, positive emotion task; neg, negative emotion task; neu, neutral emotion task.