| Literature DB >> 23785347 |
Irena Yanushevskaya1, Christer Gobl, Ailbhe Ní Chasaide.
Abstract
In emotional speech research, it has been suggested that loudness, along with other prosodic features, may be an important cue in communicating high activation affects. In earlier studies, we found different voice quality stimuli to be consistently associated with certain affective states. In these stimuli, as in typical human productions, the different voice qualities entailed differences in loudness. To examine the extent to which the loudness differences among these voice qualities might influence the affective coloring they impart, two experiments were conducted with the synthesized stimuli, in which loudness was systematically manipulated. Experiment 1 used stimuli with distinct voice quality features including intrinsic loudness variations and stimuli where voice quality (modal voice) was kept constant, but loudness was modified to match the non-modal qualities. If loudness is the principal determinant in affect cueing for different voice qualities, there should be little or no difference in the responses to the two sets of stimuli. In Experiment 2, the stimuli included distinct voice quality features but all had equal loudness to test the hypothesis that equalizing the perceived loudness of different voice quality stimuli will have relatively little impact on affective ratings. The results suggest that loudness variation on its own is relatively ineffective whereas variation in voice quality is essential to the expression of affect. In Experiment 1, stimuli incorporating distinct voice quality features consistently obtained higher ratings than the modal voice stimuli with varied loudness. In Experiment 2, non-modal voice quality stimuli proved potent in affect cueing even with loudness differences equalized. Although loudness per se does not seem to be the major determinant of perceived affect, it can contribute positively to affect cueing: when combined with a tense or modal voice quality, increased loudness can enhance signaling of high activation states.Entities:
Keywords: affect; emotion; intensity; loudness; perception; voice quality
Year: 2013 PMID: 23785347 PMCID: PMC3684800 DOI: 10.3389/fpsyg.2013.00335
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
The scaling factors and the difference in dB between the modal stimulus and the stimuli selected as best loudness matches for the voice-quality-varying stimuli.
| Modal vs. whispery | −73 (1.2) | 0.43 |
| Modal vs. breathy | −4.0 (1.3) | 0.63 |
| Modal vs. lax-creaky | −2.8 (0.8) | 0.73 |
| Modal vs. harsh | +2.6 (1.5) | 1.35 |
| Modal vs. tense | +3.1 (1.2) | 1.43 |
Figure 1Affective ratings of the stimuli in Experiment 1. Voice-quality-varying stimuli are shown in red and the corresponding loudness-matched modal stimuli are shown in black. As a reference, the ratings obtained for the modal voice stimulus are also shown (white data points joined by a fine black line). Plotted are estimated marginal means and 95% confidence intervals. Shaded is the area of weak affect cueing.
Results of the two-way repeated measures ANOVA in Experiment 1 for the six subtests.
| A-I | Part 1 | |||
| Part 2 | ||||
| B-I | Part 1 | |||
| Part 2 | ||||
| I-F | Part 1 | |||
| Part 2 | ||||
| R-S | Part 1 | |||
| Part 2 | ||||
| S-H | Part 1 | |||
| Part 2 | ||||
| S-F | Part 1 | |||
| Part 2 | ||||
The abbreviations in the left column are as follows: A-I, apologetic-indignant; B-I, bored-interested; I-F, intimate–formal; R-S, relaxed-stressed; S-H, sad-happy; S-F, scared-fearless. Part 1: breathy, whispery, lax-creaky voice;part 2: harsh and tense voice.
p < 0.05;
p < 0.01;
p < 0.001.
Pairwise comparisons in Experiment 1 (using Bonferroni adjustment for multiple comparisons; the mean difference is significant at the 0.05 level).
| A-I | Whispery | 1.18 | 2.19 | 1.02 | |||
| Breathy | 0.79 | 2.28 | 1.49 | ||||
| Lax-creaky | 0.59 | 0.90 | 1.39 | 0.13 | 0.79 | ||
| Harsh | −0.49 | −1.67 | 1.18 | ||||
| Tense | −0.64 | −1.6 | 0.96 | ||||
| B-I | Whispery | 1.25 | 1.79 | 0.57 | |||
| Breathy | 1.09 | 1.56 | 0.47 | ||||
| Lax-creaky | 0.71 | 2.96 | 2.26 | ||||
| Harsh | 0.25 | −0.44 | 1 | −0.13 | 0.41 | 0.32 | |
| Tense | −0.62 | 0.03 | −0.93 | 0.11 | 0.31 | ||
| I-F | Whispery | 1.52 | 2.76 | 1.24 | |||
| Breathy | 1.34 | 2.65 | 1.31 | ||||
| Lax-creaky | 0.88 | 3.05 | 2.14 | ||||
| Harsh | −0.59 | 1 | −0.58 | 0.97 | 0.01 | ||
| Tense | −0.92 | 0.18 | −0.91 | 0.98 | 0.01 | ||
| R-S | Whispery | 1.32 | 2.06 | 0.74 | |||
| Breathy | 1.11 | 2.31 | 1.19 | ||||
| Lax-creaky | 0.75 | 3.41 | 2.93 | ||||
| Harsh | −0.48 | −1.5 | 1.03 | ||||
| Tense | −0.72 | −1.51 | 0.79 | ||||
| S-H | Whispery | 1.08 | 2.11 | 1.04 | |||
| Breathy | 0.78 | 1.95 | 1.17 | ||||
| Lax-creaky | 0.34 | 0.46 | 2.97 | 2.51 | |||
| Harsh | −0.42 | 1 | −0.39 | 0.93 | 0.03 | ||
| Tense | −0.46 | 0.97 | −0.65 | 0.35 | 0.19 | ||
| S-F | Whispery | 0.91 | 0.76 | 0.05 | 1.42 | 0.66 | |
| Breathy | 1 | 0.56 | 1 | 0.82 | 0.18 | 0.26 | |
| Lax-creaky | 1 | 0.4 | 1 | 0.04 | 0.94 | 0.04 | |
| Harsh | 0.55 | −0.38 | 1 | 0.031 | 0.28 | 0.41 | |
| Tense | 0.99 | −0.47 | 1 | −0.29 | 0.50 | 0.18 | |
Mod(L), loudness-matched modal stimuli; VQ, voice-quality-varying stimuli.
p < 0.05;
p < 0.01;
p < 0.001.
Intraclass correlation coefficients (.
| Apologetic-indignant | 0.73 | 0.98 | Substantial |
| Bored-interested | 0.74 | 0.98 | Substantial |
| Intimate–formal | 0.86 | 0.99 | Outstanding |
| Relaxed-stressed | 0.89 | 0.99 | Outstanding |
| Sad-happy | 0.79 | 0.98 | Substantial |
| Scared-fearless | 0.14 | 0.72 | Poor |
Results of the two-way repeated measures ANOVA in Experiment 2 for the six subtests.
| A-I | |||
| B-I | |||
| I-F | |||
| R-S | |||
| S-H | |||
| S-F | |||
The abbreviations in the left column are as follows: A-I, apologetic-indignant; B-I, bored-interested; I-F, intimate–formal; R-S, relaxed-stressed; S-H, sad-happy; S-F, scared-fearless.
*p < 0.05;
p < 0.01;
p < 0.001.
Pairwise comparisons in Experiment 2 (using Bonferroni adjustment for multiple comparisons; the mean difference is significant at the 0.05 level): (A) compared are the ratings for each voice quality, for the three loudness series (Q, M, and L) and (B) compared are the ratings for different voice quality within each series.
| Whispery | Q vs. M | 0.20 | 0.97 | 0.97 | 0.19 | |||
| Q vs. L | ||||||||
| M vs. L | 0.27 | 0.05 | 0.05 | 0.27 | ||||
| Breathy | Q vs. M | 0.43 | 1.000 | 1.000 | 1.000 | |||
| Q vs. L | 0.06 | 1.000 | 0.30 | 0.06 | ||||
| M vs. L | 0.09 | 0.40 | ||||||
| Lax-creaky | Q vs. M | 0.63 | 0.112 | 1.000 | 1.000 | 0.14 | 0.18 | |
| Q vs. L | 0.58 | 1.000 | 1.000 | 0.25 | 0.06 | |||
| M vs. L | 1.000 | 0.449 | 1.000 | 1.000 | 0.78 | 0.12 | ||
| Modal | Q vs. M | 0.06 | 0.65 | 0.65 | ||||
| Q vs. L | ||||||||
| M vs. L | 0.39 | |||||||
| Tense | Q vs. M | 0.22 | 0.11 | 0.24 | 0.24 | |||
| Q vs. L | ||||||||
| M vs. L | 0.43 | 0.37 | ||||||
| Series Q | Whispery | Breathy | 0.07 | 1.000 | 1.000 | |||
| Lax-creaky | 1.000 | 1.000 | 1.000 | |||||
| Modal | 1.000 | 0.31 | ||||||
| Tense | 0.23 | 1.000 | ||||||
| Breathy | Lax-creaky | 1.000 | 0.14 | 0.81 | ||||
| Modal | 0.28 | 1.000 | ||||||
| Tense | 0.09 | 1.000 | ||||||
| Lax-creaky | Modal | 1.000 | ||||||
| Tense | 1.000 | |||||||
| Modal | Tense | 0.08 | 1.000 | |||||
| Series M | Whispery | Breathy | 0.13 | 1.000 | 1.000 | 1.000 | 1.000 | |
| Lax-creaky | 0.77 | 1.000 | 0.81 | |||||
| Modal | 1.000 | 0.09 | ||||||
| Tense | 0.48 | 1.000 | ||||||
| Breathy | Lax-creaky | 1.000 | 0.132 | 0.64 | 0.76 | |||
| Modal | 0.29 | 1.000 | ||||||
| Tense | 0.14 | 1.000 | ||||||
| Lax-creaky | Modal | 1.000 | ||||||
| Tense | 1.000 | |||||||
| Modal | Tense | 0.24 | 1.000 | |||||
| Series L | Whispery | Breathy | 1.000 | 1.000 | 1.000 | 1.000 | ||
| Lax-creaky | 1.000 | 0.051 | 0.20 | |||||
| Modal | 1.000 | 0.89 | ||||||
| Tense | 1.000 | 1.000 | ||||||
| Breathy | Lax-creaky | 1.000 | 0.15 | 1.000 | ||||
| Modal | 1.000 | 1.000 | ||||||
| Tense | 1.000 | 1.000 | ||||||
| Lax-creaky | Modal | 1.000 | ||||||
| Tense | 1.000 | |||||||
| Modal | Tense | 1.000 | 0.63 | 0.62 | ||||
The abbreviations in the first row are as follows: A-I, apologetic-indignant; B-I, bored-interested; I-F, intimate–formal; R-S, relaxed-stressed; S-H, sad-happy; S-F, scared-fearless.
p < 0.05;
p < 0.01;
p < 0.001.
Intraclass correlation coefficients (.
| Apologetic-indignant | 0.78 | 0.98 | Substantial |
| Bored-interested | 0.55 | 0.95 | Moderate |
| Intimate–formal | 0.78 | 0.98 | Substantial |
| Relaxed-stressed | 0.77 | 0.98 | Substantial |
| Sad-happy | 0.54 | 0.95 | Moderate |
| Scared-fearless | 0.1 | 0.63 | Poor |
Figure 2Affective ratings of the loudness-equalized voice quality stimuli in Experiment 2. Q, “quieter” version; M, loudness level of the original modal stimulus; L, “louder” version. Plotted are estimated marginal means. Shaded is the area of weak affect cueing.
| Whispery voice_M | 2.31 |
| Breathy voice_M | 1.59 |
| Lax-creaky voice_M | 1.38 |
| Modal voice | 1 |
| Tense voice_M | 0.70 |