| Literature DB >> 32899881 |
Shuji Shinohara1, Hiroyuki Toda2, Mitsuteru Nakamura1, Yasuhiro Omiya3, Masakazu Higuchi1, Takeshi Takano3, Taku Saito2, Masaaki Tanichi2, Shuken Boku4, Shunji Mitsuyoshi1, Mirai So5, Aihide Yoshino2, Shinichi Tokuno1.
Abstract
Recently, the relationship between emotional arousal and depression has been studied. Focusing on this relationship, we first developed an arousal level voice index (ALVI) to measure arousal levels using the Interactive Emotional Dyadic Motion Capture database. Then, we calculated ALVI from the voices of depressed patients from two hospitals (Ginza Taimei Clinic (H1) and National Defense Medical College hospital (H2)) and compared them with the severity of depression as measured by the Hamilton Rating Scale for Depression (HAM-D). Depending on the HAM-D score, the datasets were classified into a no depression (HAM-D < 8) and a depression group (HAM-D ≥ 8) for each hospital. A comparison of the mean ALVI between the groups was performed using the Wilcoxon rank-sum test and a significant difference at the level of 10% (p = 0.094) at H1 and 1% (p = 0.0038) at H2 was determined. The area under the curve (AUC) of the receiver operating characteristic was 0.66 when categorizing between the two groups for H1, and the AUC for H2 was 0.70. The relationship between arousal level and depression severity was indirectly suggested via the ALVI.Entities:
Keywords: Hamilton Rating Scale for Depression; Hurst exponent; arousal level; emotion; major depression severity; voice index; zero-crossing rate
Mesh:
Year: 2020 PMID: 32899881 PMCID: PMC7570922 DOI: 10.3390/s20185041
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Ten phrases used for recording.
| Phrase | Phrase in Japanese | Purpose (Meaning) |
|---|---|---|
| P1 | I-ro-ha-ni-ho-he-to | Non-emotional (no meaning, similar to “a-b-c”) |
| P2 | Honjitsu ha seiten nari | Non-emotional (It is fine today) |
| P5 | Mukashi aru tokoro ni | Non-emotional (Once upon a time, there lived) |
| P11 | Garapagosu shotou | Check pronunciation (Galápagos Islands) |
| P12 | Tsukarete guttari shiteimasu. | Emotional (I am tired/dead tired) |
| P13 | Totemo genki desu | Emotional (I am very cheerful) |
| P14 | Kinou ha yoku nemuremashita | Emotional (I was able to sleep well yesterday) |
| P15 | Shokuyoku ga arimasu | Emotional (I have an appetite) |
| P16 | Okorippoi desu | Emotional (I am irritable) |
| P17 | Kokoroga odayaka desu | Emotional (My heart is calm) |
Participants’ information.
| Hospital | Sex | Number of Subjects | Mean Age ± SD |
|---|---|---|---|
| H1 | Female | 55 | 31.6 ± 8.6 |
| Male | 33 | 32.5 ± 6.5 | |
| Total | 88 | 32.0 ± 7.9 | |
| H2 | Female | 44 | 62.0 ± 13.1 |
| Male | 46 | 48.8 ± 13.5 | |
| Total | 90 | 55.2 ± 14.8 |
Note: H1: Ginza Taimei Clinic; H2: National Defense Medical College; SD: Standard deviation.
Figure 1A scatter plot between Hurst exponent and zero-crossing rate calculated from the utterances in the Interactive Emotional Dyadic Motion Capture database. Each data point represents data for each utterance. The low arousal level data are shown in blue and the high arousal level data are shown in orange. (Note. ZCR: zero-crossing rate; HE: Hurst exponent; IEMOCAP: Interactive Emotional Dyadic Motion Capture).
Figure 2Receiver operating characteristic curve when arousal level voice index identifies utterance data of low arousal level and high arousal level in the Interactive Emotional Dyadic Motion Capture database. The horizontal and vertical axes represent 1-specificity (false positive rate) and sensitivity (positive rate), respectively. (Note. ALVI: arousal level voice index).
Mean scores on the Hamilton Rating Scale for Depression.
| Hospital | Group | Number of Subjects | Mean HAM-D Score ± SD |
|---|---|---|---|
| H1 | No depression | 10 | 4.8 ± 1.3 |
| Depression | 78 | 24.4 ± 8.5 | |
| Total | 88 | 22.2 ± 10.1 | |
| H2 | No depression | 65 | 2.2 ± 2.2 |
| Depression | 25 | 15.3 ± 7.2 | |
| Total | 90 | 5.8 ± 7.2 |
Note. HAM-D: Hamilton Rating Scale for Depression; SD: Standard deviation.
Figure 3A scatter diagram of the Hurst exponent and zero crossing-rate calculated from each utterance (n = 1780) of depressed patients. The data of the no depression group and the depression group are shown in orange and blue, respectively. (Note. HAM-D: Hamilton Rating Scale for Depression; HE: Hurst exponent; ZCR: zero crossing rate).
Figure 4(a) The mean arousal level voice index for depression and no depression groups per hospital. Error bars represent standard error. (b) The mean HAM-D score for depression and no depression groups per hospital. Error bars represent standard deviation. *** (p < 0.001), ** (p < 0.01), * (p < 0.1). (Note. HAM-D: Hamilton Rating Scale for Depression; H1: Ginza Taimei Clinic; H2: National Defense Medical College; ALVI: arousal level voice index).
Figure 5The mean of arousal level voice index of the no depression and depression groups for each phrase. (a) represents Ginza Taimei Clinic (H1) and (b) represents National Defense Medical College (H2). Error bars represent standard error. (Note. ALVI: arousal level voice index; HAM-D: Hamilton Rating Scale for Depression).
A summary of classification performance between the no depression and depression groups through arousal level voice index.
| Phrase | AUC | |||
|---|---|---|---|---|
| H1 | H2 | H1 | H2 | |
| P1 | 0.33 | 0.027 * | 0.60 | 0.65 |
| P2 | 0.30 | 0.0092 ** | 0.60 | 0.68 |
| P5 | 0.20 |
| 0.63 |
|
| P11 | 0.096 * | 0.29 | 0.66 | 0.57 |
| P12 |
| 0.016 * |
| 0.67 |
| P13 | 0.17 | 0.0047 ** | 0.63 | 0.69 |
| P14 | 0.096* | 0.0062 ** | 0.66 | 0.69 |
| P15 | 0.099* | 0.040 * | 0.66 | 0.64 |
| P16 | 0.19 | 0.022 * | 0.63 | 0.68 |
| P17 | 0.28 | 0.028 * | 0.61 | 0.65 |
Note. H1: Ginza Taimei Clinic; H2: National Defense Medical College; AUC: Area under the curve. The minimum p-value and maximum AUC for each hospital are shown in bold type. *** (p < 0.001), ** (p < 0.01), * (p < 0.1). a By Wilcoxon rank sum test.