| Literature DB >> 34357541 |
Mitchel Kappen1,2,3, Kristof Hoorelbeke4, Nilesh Madhu5, Kris Demuynck5, Marie-Anne Vanderhasselt6,7.
Abstract
Recently, the possibilities of detecting psychosocial stress from speech have been discussed. Yet, there are mixed effects and a current lack of clarity in relations and directions for parameters derived from stressed speech. The aim of the current study is - in a controlled psychosocial stress induction experiment - to apply network modeling to (1) look into the unique associations between specific speech parameters, comparing speech networks containing fundamental frequency (F0), jitter, mean voiced segment length, and Harmonics-to-Noise Ratio (HNR) pre- and post-stress induction, and (2) examine how changes pre- versus post-stress induction (i.e., change network) in each of the parameters are related to changes in self-reported negative affect. Results show that the network of speech parameters is similar after versus before the stress induction, with a central role of HNR, which shows that the complex interplay and unique associations between each of the used speech parameters is not impacted by psychosocial stress (aim 1). Moreover, we found a change network (consisting of pre-post stress difference values) with changes in jitter being positively related to changes in self-reported negative affect (aim 2). These findings illustrate - for the first time in a well-controlled but ecologically valid setting - the complex relations between different speech parameters in the context of psychosocial stress. Longitudinal and experimental studies are required to further investigate these relationships and to test whether the identified paths in the networks are indicative of causal relationships.Entities:
Keywords: Fundamental frequency; Harmonics-to-noise ratio; Jitter; Psychological stress; Speech; Stress; Sympathetic nervous system; Voice stress analysis; Voiced speech
Mesh:
Year: 2021 PMID: 34357541 PMCID: PMC9046336 DOI: 10.3758/s13428-021-01670-x
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Fig. 1a; left Negative affect pre- and post-stress induction. b; right EDA pre-, during, and post-stress induction
Fig. 2Unique associations between the voice parameters pre- (a; left) and post-stressor (b; right). Note. Edges in the models represent the unique associations between each of the nodes. Edge thickness reflects the strength of association, where strong associations are presented using thicker edges. Blue/Full edges represent positive associations, whereas red/dashed edges represent negative associations; the edge weights presented in the model can also be found in the edge weight matrix (Supplemental Tables 6, 7). Node predictability (R2) is visualized as a pie chart around each node and can also be found in Supplementary Table 1
Fig. 3Strength centrality
Node predictability for pre-stressor network (aim 1), post-stressor network (aim 1), and change network (aim 2)
| Node | R2 Pre-stressor network | R2 Post-stressor network | Change network |
|---|---|---|---|
| F0 | .78 | .72 | .16 |
| HNR | .85 | .83 | .37 |
| JIT | .41 | .42 | .35 |
| VO | .40 | .44 | .19 |
| F1/2 | .24 | .31 | .12 |
| NA | .02 |
Fig. 4Unique associations between change in negative affect and speech parameters. Note. Edges in the model represent the unique associations between each of the nodes. Edge thickness reflects the strength of association, where strong associations are presented using thicker edges. Blue/Full edges represent positive associations, whereas red/dashed edges represent negative associations; the edge weights presented in the model can also be found in the edge weight matrix (Supplemental table 8). Node predictability (R2) is visualized as a pie chart around each node and can also be found in Table 1