| Literature DB >> 21847385 |
Rebecca Jürgens1, Kurt Hammerschmidt, Julia Fischer.
Abstract
Play-acted emotional expressions are a frequent aspect in our life, ranging from deception to theater, film, and radio drama, to emotion research. To date, however, it remained unclear whether play-acted emotions correspond to spontaneous emotion expressions. To test whether acting influences the vocal expression of emotion, we compared radio sequences of naturally occurring emotions to actors' portrayals. It was hypothesized that play-acted expressions were performed in a more stereotyped and aroused fashion. Our results demonstrate that speech segments extracted from play-acted and authentic expressions differ in their voice quality. Additionally, the play-acted speech tokens revealed a more variable F(0)-contour. Despite these differences, the results did not support the hypothesis that the variation was due to changes in arousal. This analysis revealed that differences in perception of play-acted and authentic emotional stimuli reported previously cannot simply be attributed to differences in arousal, but by slight and implicitly perceptible differences in encoding.Entities:
Keywords: acoustic analysis; acting; authenticity; emotions; vocal expressions
Year: 2011 PMID: 21847385 PMCID: PMC3148714 DOI: 10.3389/fpsyg.2011.00180
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Number of vowels and speakers per condition.
| Vowels | Speaker vowels | Speaker fragments | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Authentic | Play-acted | Authentic | Play-acted | Authentic | Play-acted | ||||||||||||||
| a, | e, | i | a, | e, | i | a, | e, | i | a, | e, | i | ||||||||
| Male | Fear | 16, | 16, | 8 | 16, | 14, | 15 | 7, | 7, | 5 | 4, | 4, | 3 | 8 | 5 | ||||
| Anger | 25, | 20, | 21 | 14, | 16, | 10 | 12, | 12, | 10 | 6, | 6, | 5 | 12 | 6 | |||||
| Joy | 19, | 24, | 16 | 17, | 25, | 12 | 10, | 10, | 10 | 5, | 6, | 6 | 10 | 6 | |||||
| Sadness | 24, | 18, | 13 | 15, | 18, | 12 | 9, | 9, | 7 | 4, | 5, | 5 | 10 | 5 | |||||
| Female | Fear | 20, | 19, | 9 | 13, | 17, | 7 | 9, | 9, | 5 | 5, | 5, | 4 | 9 | 5 | ||||
| Anger | 21, | 20, | 16 | 14, | 18, | 6 | 10, | 10, | 9 | 6, | 7, | 4 | 10 | 6 | |||||
| Joy | 23, | 27, | 14 | 22, | 18, | 6 | 8, | 8, | 8 | 4, | 5, | 4 | 10 | 5 | |||||
| Sadness | 18, | 17, | 13 | 11, | 14, | 3 | 9, | 9, | 9 | 4, | 5, | 3 | 10 | 5 | |||||
aGiven are the vowels with a tonality in more than 10% of all time segments and with correctly calculated first and second formants that were used for the averaging over the speakers. The total number of vowels is 770.
bThe different vowels of one condition were spoken by the same speakers, while the speakers are independent across emotion, gender, and source condition.
cSpeakers are not totally independent of each other, as some contribute more than one stimulus to the set.
Influence of the different conditions on the factors for the vowels “a,” “e,” and “i.”
| Factors | Explained variance (%) | SOURCE | GENDER | ||||
|---|---|---|---|---|---|---|---|
| a | e | a | e | I | |||
| F1 | Peak frequency (PF), first quartile of distribution of frequency amplitudes (dfa 1) | 19.90 | 0.047 ↓ | – | – | – | 0.027 ↑ |
| F2 | Frequency range, second quartile of DFA (dfa 2), third quartile of DFA (dfa3) | 14.20 | – | – | – | – | – |
| F3 | Trend and modulation of the PF | 7.80 | – | – | – | – | – |
| F4 | Fundamental frequency | 6.70 | – | – | 0.000 ↑ | 0.000 ↑ | 0.000 ↑ |
| F5 | Percentage of tonal segments | 4.50 | – | 0.010 ↓ | – | – | – |
| F6 | Bandwidth of the first formant (BWF1) | 3.80 | 0.000 ↑ | – | – | – | – |
| F7 | Amplitude ratio between first harmonic and F0 (amprat2), and between third and second harmonic (amprat3) | 3.40 | 0.001 ↓ | 0.004 ↓ | – | 0.002 ↓ | 0.002 ↓ |
| F8 | Harmonic-to-noise-ratio (HNR) | 3.40 | – | – | – | 0.006 ↑ | 0.011 ↑ |
| F9 | Jitter | 3.00 | – | – | – | – | – |
| F10 | Location of maximum frequency amplitudes | 2.90 | – | – | – | – | – |
| F11 | Shimmer | 2.50 | – | 0.042 ↑ | – | – | – |
| F12 | Location of the minimum PF, location of maximum | 2.40 | – | – | – | – | – |
| correlation coefficient of successive time segments | |||||||
| F13 | Correlation coefficient of successive time segments | 2.20 | – | – | – | – | – |
Given are the interpretations of the factors, their explained variance, and the p-values of the LMMs. Upward directed arrows indicate an increased value from authentic to play-acted (SOURCE) or from male to female (GENDER), downward directed arrows indicate a decreased value.
Figure 1Differences in the factor loadings between authentic and play-acted speech tokens. Given are the median, the lower, and the upper quartile. Whiskers represent the values within the 1.5 interquartile range. The figures (A) to (E) represent the 5 factors that varied in the SOURCE condition. The asterisks mark the significant levels of the differences found by the LMMs (*<0.05, **<0.01, ***<0.001).
Figure 2Differences in the energy distribution of the lower frequency bands between authentic and play-acted utterances. The authentic stimulus possesses high amplitude ratios between the second harmonic and the F0 (amprat 2 = 3) and between third and first harmonic (amprat 3 = 2.71), while the play-acted stimulus is characterized by lower values (amprat 2 = 0.28, amprat 3 = 0.3). The differences in the F0 positions are due to individual differences. Given are the FFT spectrograms of one authentic and one play-acted female spoken “a” with a sampling rate of 2.7 kHz (Avisoft-SASLabPro).
Figure 3F. The pitch-contour was generated using Praat.
Abbreviations and descriptions of the acoustic parameters with high factor loadings.
| Parameter | Factor | Description |
|---|---|---|
| pf mean, max, min (Hz) | F1 | Mean, maximum, and minimum of the frequencies with the highest amplitude across time segments (PF) |
| diff mean, max, min (Hz) | F1 | Mean, maximum, and minimum differences between F0 and PF |
| dfa1 mean, max, min (Hz) | F1 | Mean, maximum, and minimum frequency, frequency at which the amplitude distribution reaches the first quartile across all time segments (distribution of frequency amplitudes = dfa) |
| dfa2 min (Hz) | F1 | Minimum frequency at which the amplitude distribution reaches the second quartile |
| fp1 mean (HZ) | F1 | Mean frequency of the first global frequency peak |
| pf total max, min (Hz) | F1 | Frequency of the total maximum and the total minimum amplitude |
| f2 mean (Hz) | F2 | Mean frequency of the second global frequency peak |
| Range mean, max, min (Hz) | F2 | Difference between highest and lowest frequency within a segment, mean across time segments, maximum, minimum |
| dfa2 mean, max (Hz) | F2 | Mean and maximum frequency at which the amplitude distribution reaches the second quartile |
| dfa3 mean, max, min (Hz) | F2 | Mean, maximum, and minimum frequency at which the amplitude distribution reaches the second quartile |
| pf jump | F3 | Maximum differences between successive PFs |
| pf trend mean, max | F3 | Mean and maximum deviation between pf and linear trend |
| F0 mean, max, min (Hz) | F4 | Mean, maximum, and minimum fundamental frequency across tonal time segments |
| Tonality (%) | F5 | Percentage of tonal time segments |
| BWF1 (Hz) | F6 | Bandwidth of the first formant |
| amprat2 | F7 | Amplitude ratio between second harmonic and F0 |
| amprat3 | F7 | Amplitude ratio between third and first harmonic |
| HNR mean, max | F8 | Differences between highest and lowest frequency within a segment, mean across all time segments, maximum (1 = no noise) |
| Jitter mean, max | F9 | Mean and maximum cycle-to-cycle variations in the F0, across all time segments |
| dfa1 max location | F10 | Relative position of the maximum value of the first dfa1 (0 = beginning of the call and 1 = end of the call) [(1/duration) × location] |
| dfa2 max location | F10 | Relative position of the maximum value of the second dfa1 [(1/duration) × location] |
| Shimmer mean, max | F11 | Mean and maximum cycle-to-cycle variations in the amplitude, across all time segments |
| cs max location | F12 | Relative position of the maximum value of the correlation coefficient of successive time segments [(1/duration) × location] |
| pf minimum location | F12 | Relative position of the minimum value of the peak frequency [(1/duration) × location] |
| cs mean | F13 | Mean correlation coefficient of successive time segments |
Detailed descriptions were partly taken from Hammerschmidt and Jürgens (.