| Literature DB >> 31615470 |
Jingying Wang1, Lei Zhang2, Tianli Liu3, Wei Pan1, Bin Hu4, Tingshao Zhu5.
Abstract
BACKGROUND: Abnormalities in vocal expression during a depressed episode have frequently been reported in people with depression, but less is known about if these abnormalities only exist in special situations. In addition, the impacts of irrelevant demographic variables on voice were uncontrolled in previous studies. Therefore, this study compares the vocal differences between depressed and healthy people under various situations with irrelevant variables being regarded as covariates.Entities:
Keywords: Acoustic feature; Cross-situation; Major depressive disorder; Voice analysis
Year: 2019 PMID: 31615470 PMCID: PMC6794822 DOI: 10.1186/s12888-019-2300-7
Source DB: PubMed Journal: BMC Psychiatry ISSN: 1471-244X Impact factor: 3.630
Demographic characteristics of the sample
| Depressed ( | Healthy ( | |
|---|---|---|
| Age ( | 34.3 ± 10.3 | 31.9 ± 8.4 |
| Gender ( | ||
| Female | 26 | 27 |
| Male | 21 | 30 |
| Educational level ( | ||
| Primary school | 1 | 0 |
| Middle school | 7 | 4 |
| High school | 5 | 8 |
| Secondary school | 2 | 1 |
| Junior college | 9 | 1 |
| Bachelor | 17 | 11 |
| Master | 6 | 22 |
| Doctor | 0 | 10 |
Acoustic features
| Name of feature | Explanation |
|---|---|
|
| subjective perception of sound volume |
|
| lowest frequency of a periodic waveform |
|
| the envelope of the smoothed F0 contour |
|
| the rate of sign-changes along a signal |
|
| the rate of voicing in one speech |
|
| vocal tract changes in a certain voice spectral energy |
|
| quantization of linear prediction coefficients (LPC) for transmission over a channel |
The main effect of group in each scenario
| Scenario a | Group | Educational Level | ||||
|---|---|---|---|---|---|---|
| Wilks’ Lamda (λ) | ηp2 | Wilks’ Lamda (λ) | P value | ηp2 | ||
| VW- pos | 4.556 | .000 | .603 | 1.177 | .289 | .282 |
| VW- neu | 5.894 | .000 | .666 | 1.168 | .297 | .283 |
| VW- neg | 4.839 | .000 | .620 | 1.683 | .045 | .362 |
| QA- pos | 5.007 | .000 | .625 | 1.337 | .168 | .308 |
| QA- neu | 4.659 | .000 | .608 | 2.111 | .007 | .413 |
| QA- neg | 5.468 | .000 | .646 | 1.579 | .068 | .345 |
| TR- pos | 5.185 | .000 | .637 | 1.428 | .122 | .325 |
| TR- neu | 5.369 | .000 | .645 | 1.526 | .084 | .340 |
| TR- neg | 5.568 | .000 | .650 | 1.559 | .073 | .342 |
| PD- pos | 5.238 | .000 | .636 | 0.993 | .487 | .249 |
| PD- neu | 5.427 | .000 | .644 | 1.179 | .287 | .282 |
| PD- neg | 4.491 | .000 | .600 | 1.387 | .141 | .316 |
aVW video watching, QA question answering, TR text reading, PD picture describing, pos positive, neu neutral, neg negative
Positive emotion: the different acoustic features between depressed and healthy people under different tasks
| Video Watching | Question Answering | Text Reading | Picture Describing | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | |
| loudness | 0.38 ± 0.17 | 0.16 ± 0.16 | 34.07*** |
| 0.38 ± 0.16 | 0.17 ± 0.17 | 30.92*** | . | 0.48 ± 0.2 | 0.23 ± 0.23 | 24.49*** |
| 0.35 ± 0.16 | 0.16 ± 0.16 | 24.61*** | . |
|
| −0.32 ± 4.18 | 0.58 ± 3.81 | 2.92 | .03 | 0.08 ± 3.34 | 0.79 ± 3.32 | 1.30 | .01 | 2.67 ± 3.26 | 2.69 ± 3.1 | 0.00 | .00 | − 0.83 ± 3.76 | 0.59 ± 3.19 | 5.44* | .05 |
|
| 7.81 ± 3.66 | 8.63 ± 2.70 | 1.93 | .02 | 8.07 ± 2.93 | 8.68 ± 2.71 | 2.07 | .02 | 5.39 ± 4.13 | 8.35 ± 4.57 | 13.87*** | .12 | 8.29 ± 2.81 | 9.36 ± 3.01 | 3.06 | .03 |
|
| 6.19 ± 4.83 | 3.28 ± 3.40 | 9.31** | .09 | 6.98 ± 4.46 | 3.17 ± 3.35 | 18.87*** | . | 4.82 ± 5.53 | 0.34 ± 4.6 | 14.17*** | .13 | 7.27 ± 4.69 | 3.89 ± 3.73 | 12.77** | .11 |
|
| 5.90 ± 4.23 | 3.94 ± 4.22 | 6.57* | .06 | 5.04 ± 4.41 | 3.13 ± 4.25 | 3.23 | .03 | 0.91 ± 6.13 | −0.61 ± 6.71 | 0.24 | .00 | 6.2 ± 3.95 | 3.78 ± 4.59 | 7.56** | .07 |
| mfcc5 | 3.23 ± 6.12 | −3.88 ± 6.79 | 27.60*** |
| 1.67 ± 5.25 | −4.93 ± 6.3 | 26.56*** | . | − 1.98 ± 6.35 | − 10.39 ± 8.97 | 20.80*** |
| 2.75 ± 5.61 | −3.52 ± 5.12 | 27.07*** | . |
|
| 3.83 ± 6.88 | 5.78 ± 6.49 | 1.10 | .01 | 3.17 ± 5.41 | 5.95 ± 6.56 | 3.07 | .03 | 0.34 ± 6.63 | 5.04 ± 8.16 | 8.20** | .08 | 3.67 ± 5.34 | 5.42 ± 6.68 | 0.68 | .01 |
| mfcc7 | −0.21 ± 5.32 | − 7.25 ± 4.69 | 47.63*** |
| − 0.27 ± 5.06 | − 7.6 ± 74.11 | 57.35*** | . | − 2.12 ± 5.86 | − 10.77 ± 4.51 | 55.24*** |
| 0.19 ± 4.96 | − 7.33 ± 3.72 | 64.00*** | . |
|
| 2.17 ± 5.51 | 1.90 ± 4.47 | 0.18 | .00 | 0.76 ± 5.43 | 1.87 ± 4.59 | 1.18 | .01 | 0.44 ± 7.11 | 1.53 ± 5.63 | 1.36 | .01 | 1.85 ± 4.92 | 1.42 ± 4.09 | 0.43 | .00 |
|
| 0.33 ± 4.37 | 2.37 ± 4.15 | 4.51* | .04 | − 0.36 ± 5.41 | 1.67 ± 3.93 | 3.86 | .04 | −1.59 ± 6.55 | 1.01 ± 5.46 | 6.99** | .07 | 0.39 ± 4.79 | 2.23 ± 3.57 | 2.83 | .03 |
|
| 1.58 ± 5.48 | 1.29 ± 5.16 | 0.09 | .00 | 0.83 ± 6.04 | 0.34 ± 4.99 | 0.23 | .00 | −1.42 ± 7.99 | − 4.13 ± 7.04 | 2.21 | .02 | 1.3 ± 5.85 | 1.01 ± 4.9 | 0.09 | .00 |
|
| − 0.75 ± 5.08 | − 0.56 ± 4.20 | 0.07 | .00 | − 0.73 ± 4.51 | − 0.84 ± 4.14 | 0.17 | .00 | − 2.87 ± 5.23 | − 3.13 ± 4.71 | 0.00 | .00 | −0.64 ± 4.03 | −0.01 ± 3.85 | 0.16 | .00 |
|
| −2.17 ± 3.67 | − 1.05 ± 2.84 | 1.10 | .01 | −1.61 ± 3.56 | −1.54 ± 3.05 | 0.02 | .00 | −3.02 ± 3.98 | −3.18 ± 3.25 | 0.52 | .01 | −2.21 ± 3.65 | −1.22 ± 2.46 | 0.58 | .01 |
|
| 0.2 ± 0.04 | 0.21 ± 0.04 | 0.08 | .00 | 0.2 ± 0.03 | 0.2 ± 0.03 | 0.54 | .01 | 0.19 ± 0.03 | 0.2 ± 0.02 | 0.46 | .00 | 0.21 ± 0.04 | 0.2 ± 0.03 | 0.02 | .00 |
|
| 0.62 ± 0.06 | 0.62 ± 0.07 | 0.36 | .00 | 0.61 ± 0.05 | 0.61 ± 0.06 | 0.02 | .00 | 0.55 ± 0.04 | 0.56 ± 0.05 | 4.42* | .04 | 0.63 ± 0.06 | 0.62 ± 0.06 | 1.14 | .01 |
|
| 0.97 ± 0.07 | 0.98 ± 0.06 | 0.38 | .00 | 0.97 ± 0.06 | 0.98 ± 0.06 | 1.68 | .02 | 0.92 ± 0.07 | 0.95 ± 0.06 | 5.72* | .05 | 0.98 ± 0.06 | 0.98 ± 0.05 | 0.00 | .00 |
|
| 1.33 ± 0.07 | 1.3 ± 0.09 | 4.70* | .05 | 1.33 ± 0.07 | 1.29 ± 0.09 | 3.42 | .03 | 1.28 ± 0.07 | 1.24 ± 0.08 | 3.53 | .03 | 1.34 ± 0.07 | 1.3 ± 0.08 | 7.39** | .07 |
|
| 1.66 ± 0.08 | 1.61 ± 0.1 | 8.55** | .08 | 1.66 ± 0.07 | 1.6 ± 0.1 | 10.60** | .10 | 1.62 ± 0.07 | 1.54 ± 0.1 | 16.49*** | .14 | 1.67 ± 0.07 | 1.61 ± 0.09 | 13.44*** | .12 |
|
| 1.99 ± 0.07 | 1.96 ± 0.11 | 3.24 | .03 | 1.99 ± 0.06 | 1.94 ± 0.11 | 4.38* | .04 | 1.95 ± 0.07 | 1.89 ± 0.11 | 7.36** | .07 | 2.0 ± 0.06 | 1.95 ± 0.1 | 5.50* | .05 |
|
| 2.36 ± 0.07 | 2.31 ± 0.11 | 5.18* | .05 | 2.36 ± 0.06 | 2.3 ± 0.11 | 9.27** | .09 | 2.33 ± 0.08 | 2.23 ± 0.13 | 15.00*** | .13 | 2.37 ± 0.06 | 2.31 ± 0.1 | 11.37** | .10 |
|
| 2.72 ± 0.04 | 2.7 ± 0.05 | 3.13 | .03 | 2.72 ± 0.04 | 2.69 ± 0.05 | 6.98* | .07 | 2.7 ± 0.05 | 2.65 ± 0.07 | 16.99*** |
| 2.72 ± 0.04 | 2.7 ± 0.05 | 7.71** | .07 |
|
| 0.03 ± 0.01 | 0.03 ± 0.01 | 1.77 | .02 | 0.03 ± 0.01 | 0.03 ± 0.01 | 7.95** | .07 | 0.03 ± 0.01 | 0.04 ± 0.01 | 13.76*** | .12 | 0.03 ± 0.01 | 0.03 ± 0.01 | 3.53 | .03 |
|
| 0.55 ± 0.08 | 0.51 ± 0.06 | 8.43** | .08 | 0.56 ± 0.06 | 0.51 ± 0.05 | 17.95*** | . | 0.59 ± 0.07 | 0.57 ± 0.07 | 7.63** | .07 | 0.55 ± 0.07 | 0.51 ± 0.05 | 7.66** | .07 |
|
| 126.5 ± 54.73 | 89.62 ± 41.16 | 10.48** | .10 | 128.32 ± 41.95 | 90.72 ± 36.79 | 20.13*** |
| 140.33 ± 39.77 | 109.61 ± 37.3 | 18.58*** |
| 124.69 ± 48.05 | 89.01 ± 37.79 | 11.59** | .10 |
|
| 299.33 ± 38.45 | 279.74 ± 48.49 | 4.74* | .05 | 296.98 ± 36.47 | 274.64 ± 9.57 | 5.57* | .05 | 266.53 ± 42.48 | 230.66 ± 43.66 | 12.78*** | .11 | 298.89 ± 37.26 | 271.72 ± 45.88 | 9.99** | .09 |
*, p < 0.05; **, p < 0.01; ***, p < 0.001; In the column of ηp2, we use bold for representing the features have large effect sizes. the upright features represent the features which are significant across all tasks
Neutral emotion: the different acoustic features between depressed and healthy people under different tasks
| Video Watching | Question Answering | Text Reading | Picture describing | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | |
| loudness | 0.37 ± 0.17 | 0.17 ± 0.17 | 27.22*** | . | 0.38 ± 0.16 | 0.17 ± 0.17 | 26.13*** | . | 0.49 ± 0.21 | 0.24 ± 0.24 | 21.20*** | . | 0.34 ± 0.13 | 0.17 ± 0.17 | 22.20*** | . |
|
| 0.06 ± 3.79 | 0.97 ± 3.63 | 1.86 | .02 | 0.09 ± 3.45 | 0.57 ± 3.63 | 0.92 | .01 | 1.16 ± 3.24 | 1.63 ± 2.86 | 0.83 | .01 | −0.46 ± 3.16 | 1.02 ± 3.22 | 7.16** | .07 |
|
| 8.62 ± 2.96 | 9.22 ± 3 | 0.79 | .01 | 8.75 ± 2.73 | 9.45 ± 2.83 | 1.08 | .01 | 8.37 ± 3.44 | 10.93 ± 4.02 | 11.05*** | .10 | 8.35 ± 2.5 | 8.99 ± 2.25 | 1.75 | .02 |
|
| 7.33 ± 4.78 | 3.8 ± 3.39 | 15.82*** | .14 | 8.09 ± 4.57 | 3.34 ± 3.49 | 29.18*** | . | 7.52 ± 5.94 | 2.46 ± 4.6 | 21.24*** | . | 7.32 ± 3.83 | 4.08 ± 3.08 | 17.81*** | . |
|
| 6.29 ± 4.65 | 3.84 ± 4.24 | 8.64** | .08 | 5.09 ± 4.32 | 2.98 ± 4.46 | 5.83* | .06 | −0.46 ± 6.69 | − 2.1 ± 6.39 | 0.87 | .01 | 6.21 ± 3.53 | 3.93 ± 3.96 | 7.44** | .07 |
| mfcc5 | 2.35 ± 6.43 | −4.5 ± 6.77 | 24.71*** | . | 1.77 ± 5.49 | −5.83 ± 7.31 | 31.73*** | . | −3.6 ± 5.9 | − 12.42 ± 8.17 | 28.81*** | . | 3.32 ± 5.21 | − 3.2 ± 4.83 | 34.84*** | . |
|
| 4.09 ± 6.64 | 6.16 ± 6.28 | 0.83 | .01 | 3.54 ± 5.64 | 5.5 ± 6.7 | 0.63 | .01 | 0.34 ± 6.62 | 5.66 ± 7.66 | 10.12** | .09 | 4.23 ± 4.42 | 5.55 ± 6.14 | 0.24 | .00 |
| mfcc7 | − 0.21 ± 6.08 | − 7.21 ± 4.63 | 40.80*** | . | −1 ± 5.28 | − 8.08 ± 4.45 | 49.00*** | . | −3.81 ± 5.68 | − 11.79 ± 4.65 | 51.12*** | . | − 0.31 ± 4.3 | − 7.06 ± 3.59 | 64.20*** | . |
|
| 1.85 ± 5.25 | 1.97 ± 4.73 | 0.01 | .00 | 0.35 ± 5.82 | 1.73 ± 4.65 | 1.86 | .02 | −2.78 ± 7.51 | − 1.11 ± 5.86 | 2.25 | .02 | 1.96 ± 3.86 | 1.85 ± 3.26 | 0.14 | .00 |
|
| 0.16 ± 5.01 | 2.66 ± 3.95 | 5.91* | .06 | − 0.59 ± 5.51 | 1.67 ± 4.05 | 3.78 | .04 | −3.27 ± 5.9 | − 0.37 ± 4.72 | 6.10* | .06 | 0.57 ± 4.18 | 2.4 ± 3.19 | 3.62 | .04 |
|
| 1.9 ± 5.7 | 0.59 ± 5.51 | 2.01 | .02 | 1.44 ± 5.59 | 0.06 ± 4.63 | 3.57 | .03 | −0.44 ± 6.94 | − 2.64 ± 6.22 | 2.19 | .02 | 1.95 ± 5.04 | 1.47 ± 4.11 | 0.41 | .00 |
|
| − 0.08 ± 4.46 | −0.8 ± 4.46 | 1.75 | .02 | − 0.49 ± 4.59 | −0.54 ± 3.86 | 0.34 | .00 | −1.55 ± 6.06 | − 2.2 ± 5.16 | 0.20 | .00 | −0.51 ± 3.43 | 0.19 ± 3.38 | 0.28 | .00 |
|
| −2.2 ± 3.68 | −1.1 ± 3.2 | 0.95 | .01 | − 1.83 ± 3.75 | −1.75 ± 3.16 | 0.11 | .00 | −3.23 ± 4.41 | −3.47 ± 3.21 | 0.80 | .01 | −2 ± 2.8 | −1.08 ± 2.11 | 1.10 | .01 |
|
| 0.19 ± 0.04 | 0.2 ± 0.04 | 0.77 | .01 | 0.2 ± 0.03 | 0.21 ± 0.04 | 1.68 | .02 | 0.2 ± 0.03 | 0.2 ± 0.02 | 0.06 | .00 | 0.2 ± 0.03 | 0.2 ± 0.03 | 0.33 | .00 |
|
| 0.62 ± 0.05 | 0.62 ± 0.07 | 0.05 | .00 | 0.61 ± 0.05 | 0.61 ± 0.06 | 0.71 | .01 | 0.56 ± 0.04 | 0.57 ± 0.04 | 2.22 | .02 | 0.63 ± 0.05 | 0.61 ± 0.05 | 3.94 | .04 |
|
| 0.98 ± 0.06 | 0.99 ± 0.06 | 0.33 | .00 | 0.97 ± 0.06 | 0.98 ± 0.05 | 0.22 | .00 | 0.91 ± 0.07 | 0.95 ± 0.05 | 8.00** | .08 | 0.99 ± 0.05 | 0.98 ± 0.05 | 1.13 | .01 |
|
| 1.34 ± 0.07 | 1.3 ± 0.09 | 5.06* | .05 | 1.34 ± 0.07 | 1.28 ± 0.09 | 9.67** | .09 | 1.27 ± 0.07 | 1.23 ± 0.08 | 2.58 | .03 | 1.35 ± 0.06 | 1.3 ± 0.07 | 13.91*** | .12 |
|
| 1.67 ± 0.07 | 1.61 ± 0.1 | 10.99*** | .10 | 1.66 ± 0.07 | 1.59 ± 0.11 | 15.73*** | .14 | 1.6 ± 0.08 | 1.52 ± 0.1 | 14.48*** | .13 | 1.68 ± 0.06 | 1.61 ± 0.08 | 18.67*** | . |
|
| 2.0 ± 0.07 | 1.95 ± 0.11 | 5.46* | .05 | 2.0 ± 0.06 | 1.94 ± 0.12 | 7.88** | .07 | 1.94 ± 0.07 | 1.88 ± 0.1 | 7.76** | .07 | 2.01 ± 0.05 | 1.96 ± 0.1 | 9.47** | .09 |
|
| 2.37 ± 0.06 | 2.31 ± 0.11 | 10.24** | .09 | 2.37 ± 0.06 | 2.29 ± 0.12 | 14.31*** | .13 | 2.32 ± 0.07 | 2.22 ± 0.12 | 17.25*** | . | 2.38 ± 0.05 | 2.32 ± 0.09 | 15.02*** | .13 |
|
| 2.72 ± 0.04 | 2.7 ± 0.05 | 5.91* | .06 | 2.72 ± 0.04 | 2.69 ± 0.06 | 11.33*** | .10 | 2.7 ± 0.05 | 2.64 ± 0.07 | 19.24*** | . | 2.73 ± 0.03 | 2.7 ± 0.04 | 11.75*** | .11 |
|
| 0.03 ± 0.01 | 0.03 ± 0.01 | 6.30* | .06 | 0.03 ± 0.01 | 0.03 ± 0.01 | 15.26*** | .13 | 0.03 ± 0.01 | 0.04 ± 0.01 | 15.01*** | .13 | 0.03 ± 0.01 | 0.03 ± 0.01 | 0.81 | .01 |
|
| 0.56 ± 0.07 | 0.52 ± 0.06 | 8.06** | .08 | 0.56 ± 0.06 | 0.52 ± 0.05 | 9.81** | .09 | 0.6 ± 0.06 | 0.57 ± 0.06 | 8.54** | .08 | 0.54 ± 0.06 | 0.51 ± 0.05 | 8.18** | .08 |
|
| 131.82 ± 57.54 | 90.43 ± 38.27 | 13.79*** | .12 | 128.75 ± 46.52 | 93.65 ± 38.37 | 12.83*** | .11 | 144.87 ± 36.71 | 111.44 ± 35.54 | 23.83*** | . | 120.79 ± 42.4 | 84.84 ± 36.85 | 13.67*** | .12 |
|
| 297.16 ± 40.91 | 271.51 ± 45.86 | 8.58** | .08 | 296.22 ± 40.37 | 269.71 ± 50.15 | 9.07** | .08 | 267.89 ± 40.63 | 266.53 ± 42.48 | 16.56*** | .14 | 305.99 ± 30.86 | 272.83 ± 41.98 | 20.20*** | . |
*, p < 0.05; **, p < 0.01; ***, p < 0.001; In the column of ηp2, we use bold for representing the features have large effect sizes. the upright features represent the features which are significant across all tasks
Negative emotion: the different acoustic features between depressed and healthy people under different tasks
| Video Watching | Question Answering | Text Reading | Picture describing | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | healthy | depressed | F | ηp2 | |
| loudness | 0.35 ± 0.14 | 0.16 ± 0.16 | 28.47*** | . | 0.35 ± 0.15 | 0.16 ± 0.16 | 28.55*** | . | 0.48 ± 0.21 | 0.23 ± 0.22 | 25.57*** | . | 0.35 ± 0.14 | 0.17 ± 0.17 | 23.58*** | . |
|
| −0.29 ± 3.61 | 0.74 ± 3.81 | 2.03 | .02 | −0.37 ± 3.22 | 0.7 ± 3.91 | 2.26 | .02 | 1.23 ± 3.16 | 1.16 ± 2.83 | 0.00 | .00 | −0.23 ± 3.67 | 1.14 ± 3.22 | 4.38* | .04 |
|
| 8.07 ± 3.17 | 9.07 ± 3.18 | 2.17 | .02 | 8.39 ± 2.86 | 8.88 ± 3.27 | 0.66 | .01 | 8.05 ± 3.53 | 10.88 ± 4.15 | 13.65*** | .12 | 8.22 ± 2.5 | 8.82 ± 2.63 | 1.26 | .01 |
|
| 6.9 ± 4.83 | 3 ± 3.72 | 16.13*** | .14 | 7.42 ± 4.21 | 2.9 ± 3.47 | 29.55*** | . | 7.49 ± 5.7 | 2.65 ± 4.19 | 18.44*** |
| 6.68 ± 4.66 | 3.44 ± 3.56 | 11.25*** | .10 |
|
| 6.39 ± 4.13 | 4.16 ± 4.05 | 7.16** | .07 | 6.0 ± 4.05 | 3.26 ± 4.43 | 9.97** | .09 | 0.76 ± 6.35 | −1.04 ± 6.44 | 0.62 | .01 | 5.68 ± 3.87 | 3.52 ± 4.59 | 5.48* | .05 |
| mfcc5 | 3.15 ± 5.4 | − 4.4 ± 6.49 | 35.88*** | . | 2.93 ± 5.57 | − 4.37 ± 6.94 | 28.87*** | . | − 2.61 ± 6.2 | −11.26 ± 7.75 | 27.26*** | . | 3.1 ± 4.95 | −3.51 ± 5.17 | 36.73*** | . |
|
| 3.85 ± 6.21 | 5.72 ± 6.88 | 0.54 | .01 | 4.33 ± 5.68 | 6.0 ± 6.51 | 0.47 | .00 | 0.72 ± 6.87 | 5.48 ± 8.15 | 7.73** | .07 | 4.2 ± 5.39 | 6.26 ± 6.73 | 1.05 | .01 |
| mfcc7 | 0.02 ± 5.5 | − 7.46 ± 4.46 | 52.57*** | . | − 0.02 ± 4.93 | −7.51 ± 4.51 | 58.30*** | . | − 3.02 ± 5.37 | − 11.55 ± 4.97 | 55.80*** | . | −0.04 ± 5.08 | −6.95 ± 3.66 | 54.37*** | . |
|
| 1.86 ± 5.25 | 1.72 ± 4.71 | 0.05 | .00 | 1.22 ± 5.04 | 1.8 ± 4.62 | 0.30 | .00 | − 2.48 ± 8.24 | −0.51 ± 6.01 | 3.09 | .03 | 2.26 ± 4.43 | 1.61 ± 3.88 | 0.70 | .01 |
|
| 0.28 ± 4.86 | 2.56 ± 4.08 | 5.09* | .05 | 0.35 ± 5.33 | 1.96 ± 3.78 | 2.18 | .02 | − 2.81 ± 6.25 | − 0.01 ± 5 | 6.28* | .06 | 0.64 ± 4.78 | 2.42 ± 3.71 | 3.05 | .03 |
|
| 1.62 ± 5.62 | 0.42 ± 5.03 | 1.55 | .02 | 2.21 ± 5.32 | 0.52 ± 4.39 | 4.73* | .05 | − 1.1 ± 7.37 | − 3.51 ± 6.39 | 1.86 | .02 | 1.68 ± 5.47 | 1.21 ± 4.97 | 0.29 | .00 |
|
| −1.17 ± 4.27 | −1.29 ± 3.83 | 0.88 | .01 | −0.42 ± 4.25 | −0.35 ± 4 | 0.28 | .00 | −2.73 ± 5.8 | −2.63 ± 5.04 | 0.13 | .00 | −0.56 ± 3.93 | −0.14 ± 4 | 0.02 | .00 |
|
| − 2.03 ± 3.5 | − 1.04 ± 3.41 | 1.12 | .01 | −1.4 ± 3.53 | −1.06 ± 2.83 | 0.09 | .00 | −3.31 ± 3.96 | −3.72 ± 3.47 | 0.75 | .01 | −1.98 ± 3.3 | −1.26 ± 2.44 | 0.35 | .00 |
|
| 0.2 ± 0.03 | 0.2 ± 0.04 | 0.45 | .00 | 0.2 ± 0.03 | 0.21 ± 0.04 | 0.87 | .01 | 0.2 ± 0.03 | 0.2 ± 0.02 | 1.05 | .01 | 0.2 ± 0.04 | 0.2 ± 0.03 | 0.09 | .00 |
|
| 0.63 ± 0.05 | 0.62 ± 0.06 | 0.47 | .00 | 0.63 ± 0.05 | 0.61 ± 0.07 | 1.16 | .01 | 0.57 ± 0.04 | 0.58 ± 0.04 | 5.51* | .05 | 0.63 ± 0.05 | 0.61 ± 0.06 | 1.83 | .02 |
|
| 0.98 ± 0.06 | 0.99 ± 0.05 | 0.42 | .00 | 0.98 ± 0.06 | 0.98 ± 0.06 | 0.00 | .00 | 0.92 ± 0.07 | 0.95 ± 0.05 | 9.53** | .09 | 0.99 ± 0.06 | 0.98 ± 0.05 | 0.10 | .00 |
|
| 1.34 ± 0.07 | 1.3 ± 0.09 | 7.05** | .07 | 1.34 ± 0.06 | 1.3 ± 0.09 | 8.82** | .08 | 1.28 ± 0.07 | 1.24 ± 0.08 | 2.34 | .02 | 1.35 ± 0.06 | 1.3 ± 0.08 | 10.17** | .09 |
|
| 1.67 ± 0.07 | 1.61 ± 0.1 | 11.87*** | .11 | 1.68 ± 0.06 | 1.6 ± 0.11 | 15.23*** | .13 | 1.61 ± 0.08 | 1.53 ± 0.1 | 13.03*** | .12 | 1.68 ± 0.06 | 1.61 ± 0.09 | 15.65*** | .14 |
|
| 2.0 ± 0.06 | 1.95 ± 0.11 | 5.91* | .06 | 2.0 ± 0.05 | 1.95 ± 0.12 | 7.66** | .07 | 1.94 ± 0.07 | 1.88 ± 0.11 | 6.16* | .06 | 2.0 ± 0.06 | 1.95 ± 0.1 | 7.73** | .07 |
|
| 2.37 ± 0.06 | 2.3 ± 0.11 | 9.87** | .09 | 2.38 ± 0.05 | 2.31 ± 0.11 | 13.83*** | .12 | 2.32 ± 0.07 | 2.23 ± 0.13 | 16.43*** | .14 | 2.38 ± 0.06 | 2.31 ± 0.1 | 13.30*** | .12 |
|
| 2.72 ± 0.04 | 2.7 ± 0.06 | 7.80** | .07 | 2.73 ± 0.03 | 2.7 ± 0.05 | 11.11*** | .10 | 2.7 ± 0.05 | 2.65 ± 0.07 | 15.37*** | .13 | 2.73 ± 0.04 | 2.7 ± 0.05 | 7.50** | .07 |
|
| 0.03 ± 0.01 | 0.03 ± 0.01 | 7.68** | .07 | 0.03 ± 0.01 | 0.03 ± 0.01 | 9.07** | .08 | 0.03 ± 0.01 | 0.04 ± 0.01 | 14.70*** | .13 | 0.03 ± 0.01 | 0.03 ± 0.01 | 1.85 | .02 |
|
| 0.55 ± 0.07 | 0.52 ± 0.05 | 5.67* | .05 | 0.54 ± 0.06 | 0.51 ± 0.05 | 7.34** | .07 | 0.6 ± 0.07 | 0.58 ± 0.07 | 9.10** | .08 | 0.55 ± 0.06 | 0.51 ± 0.05 | 8.00** | .07 |
|
| 125.48 ± 51.38 | 92.61 ± 41.25 | 8.01** | .07 | 121.39 ± 45.68 | 87.43 ± 38.27 | 11.46*** | .10 | 147.96 ± 38.73 | 114.42 ± 37.68 | 21.22*** | . | 122.53 ± 44.32 | 88.1 ± 37.42 | 12.32*** | .11 |
|
| 298.82 ± 39.73 | 275.57 ± 47.45 | 6.47* | .06 | 302.75 ± 36.76 | 277.74 ± 49.99 | 7.53** | .07 | 271.56 ± 40.91 | 235.74 ± 42.61 | 14.26*** | .13 | 304.12 ± 35.79 | 272.66 ± 46.23 | 12.46*** | .11 |
*, p < 0.05; **, p < 0.01; ***, p < 0.001; In the column of ηp2, we use bold for representing the features have large effect sizes. the upright features represent the features which are significant across all tasks
Fig. 1The number of significant acoustic features in each scenario (Task: VW, video watching; QA, question answering; TR, text reading; PD, picture describing. Emotion: pos, positive; neu, neutral; neg, negative)