| Literature DB >> 23805115 |
Simon Rigoulot1, Eugen Wassiliwizky, Marc D Pell.
Abstract
Recent studies suggest that the time course for recognizing vocal expressions of basic emotion in speech varies significantly by emotion type, implying that listeners uncover acoustic evidence about emotions at different rates in speech (e.g., fear is recognized most quickly whereas happiness and disgust are recognized relatively slowly; Pell and Kotz, 2011). To investigate whether vocal emotion recognition is largely dictated by the amount of time listeners are exposed to speech or the position of critical emotional cues in the utterance, 40 English participants judged the meaning of emotionally-inflected pseudo-utterances presented in a gating paradigm, where utterances were gated as a function of their syllable structure in segments of increasing duration from the end of the utterance (i.e., gated syllable-by-syllable from the offset rather than the onset of the stimulus). Accuracy for detecting six target emotions in each gate condition and the mean identification point for each emotion in milliseconds were analyzed and compared to results from Pell and Kotz (2011). We again found significant emotion-specific differences in the time needed to accurately recognize emotions from speech prosody, and new evidence that utterance-final syllables tended to facilitate listeners' accuracy in many conditions when compared to utterance-initial syllables. The time needed to recognize fear, anger, sadness, and neutral from speech cues was not influenced by how utterances were gated, although happiness and disgust were recognized significantly faster when listeners heard the end of utterances first. Our data provide new clues about the relative time course for recognizing vocally-expressed emotions within the 400-1200 ms time window, while highlighting that emotion recognition from prosody can be shaped by the temporal properties of speech.Entities:
Keywords: acoustics; auditory gating; prosody; speech perception; vocal emotions
Year: 2013 PMID: 23805115 PMCID: PMC3690349 DOI: 10.3389/fpsyg.2013.00367
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Duration of the stimuli presented in the experiment in each gate duration condition as a function of emotion.
| Duration | Anger | 370 | 585 | 771 | 1004 | 1230 | 1581 | 1759 |
| Disgust | 481 | 748 | 984 | 1290 | 1555 | 1958 | 2153 | |
| Fear | 329 | 498 | 636 | 795 | 930 | 1151 | 1269 | |
| Sadness | 405 | 626 | 815 | 1071 | 1286 | 1645 | 1846 | |
| Happiness | 375 | 601 | 763 | 978 | 1164 | 1478 | 1648 | |
| Neutral | 354 | 540 | 703 | 896 | 1122 | 1401 | 1553 | |
Pseudo-utterances were always gated at syllable boundaries from the offset of the utterance in gates of increasing syllable duration.
Mean accuracy (% target recognition) of the 40 listeners who judged pseudo-utterances conveying each emotion according to the gate duration, when utterances were gated from the offset of the sentence.
| Accuracy | Anger | 51.9 (33.9) | 73.0 (25.7) | 79.9 (22.6) | 79.0 (26.9) | 80.3 (26.3) | 81.8 (22.9) | 85.8 (17.9) |
| Disgust | 27.5 (16.9) | 44.3 (17.2) | 59.3 (13.1) | 64.3 (15.7) | 71.0 (15.2) | 71.4 (15.8) | 74.5 (14.5) | |
| Fear | 77.5 (15.6) | 85.4 (16.0) | 91.9 (8.3) | 95.9 (3.7) | 96.3 (4.1) | 95.4 (4.7) | 94.6 (3.9) | |
| Sadness | 65.6 (22.8) | 83.9 (13.9) | 87.0 (11.3) | 90.9 (12.1) | 92.8 (7.5) | 95.1 (5.0) | 94.4 (6.4) | |
| Happiness | 30.6 (27.9) | 53.8 (34.4) | 66.1 (32.7) | 71.8 (32.0) | 77.6 (25.9) | 82.4 (24.5) | 89.1 (13.5) | |
| Neutral | 55.3 (12.8) | 68.5 (11.8) | 73.4 (12.5) | 83.9 (10.8) | 81.5 (11.2) | 85.4 (8.5) | 86.6 (8.2) | |
Standard deviations are shown in parentheses.
Figure 1Mean Hu-scores (unbiased accuracy) for each emotion as a function of the gate duration (number of syllables).
Figure 2Comparison of mean accuracy (Hu) scores for each emotion as a function of gate duration (number of syllables) and the direction of presentation (forward vs. backward). Data in the forward condition are taken from Pell and Kotz (2011).
Figure 3Comparison of mean confidence ratings for each emotion as a function of gate duration (number of syllables) and the direction of presentation (forward vs. backward). Data in the forward condition are taken from Pell and Kotz (2011).
Figure 4Comparison of mean identification points (in milliseconds) for each emotion as a function of direction of presentation (forward vs. backward). Data in the forward condition are taken from Pell and Kotz (2011).