| Literature DB >> 31898261 |
Roza G Kamiloğlu1, Agneta H Fischer2, Disa A Sauter2.
Abstract
Researchers examining nonverbal communication of emotions are becoming increasingly interested in differentiations between different positive emotional states like interest, relief, and pride. But despite the importance of the voice in communicating emotion in general and positive emotion in particular, there is to date no systematic review of what characterizes vocal expressions of different positive emotions. Furthermore, integration and synthesis of current findings are lacking. In this review, we comprehensively review studies (N = 108) investigating acoustic features relating to specific positive emotions in speech prosody and nonverbal vocalizations. We find that happy voices are generally loud with considerable variability in loudness, have high and variable pitch, and are high in the first two formant frequencies. When specific positive emotions are directly compared with each other, pitch mean, loudness mean, and speech rate differ across positive emotions, with patterns mapping onto clusters of emotions, so-called emotion families. For instance, pitch is higher for epistemological emotions (amusement, interest, relief), moderate for savouring emotions (contentment and pleasure), and lower for a prosocial emotion (admiration). Some, but not all, of the differences in acoustic patterns also map on to differences in arousal levels. We end by pointing to limitations in extant work and making concrete proposals for future research on positive emotions in the voice.Entities:
Keywords: Acoustic features; Nonverbal vocalizations; Positive emotions; Speech prosody; Vocal expression
Mesh:
Year: 2020 PMID: 31898261 PMCID: PMC7093353 DOI: 10.3758/s13423-019-01701-x
Source DB: PubMed Journal: Psychon Bull Rev ISSN: 1069-9384
Overview of research on acoustic parameters of positive emotions in the voice
| Study No | Authors and year | Type | Method | Emotion Categories | Speakers | Acoustic measures |
|---|---|---|---|---|---|---|
| 1 | Abelin & Allwood ( | SP | A | 1 male—nonprof. actor | ||
| 2 | Al-Watban ( | SP | A | 8 male—prof. actors | ||
| 3 | Anikin & Lima ( | NV | A, SN | 10 to 22 mixed male and female—taken from seven published corpora | ||
| 4 | Anikin & Persson ( | NV | SN | 25 to 48 mixed male female or child—taken from YouTube videos | Energy, | |
| 5 | Aubergé et al. ( | SP | A, SN | “Some” prof. actors, “some” nonprof. actors | ||
| 6 | Aubergé & Cathiard ( | NV | A, SN | 3 prof. actors & 1 nonprof. actor | ||
| 7 | Audibert et al. ( | SP | ST | 1 male prof. actor | ||
| 8 | Baldwin ( | SP | A | 6 male 6 female (3 prof. actors, 3 nonprof. actors for each gender) | Int., SR | |
| 9 | Banse & Scherer ( | SP | A | 6 male 6 female—prof. actors | ||
| 10 | Bänziger & Scherer ( | SP | A | 4 male 5 female-prof. actors | ||
| 11 | Banziger et al. ( | NV, SP | A | 4 male 5 female & 5 male 5 female—prof. actor—taken from two corpora | ||
| 12 | Baroni et al. ( | SP | A | 3 singers and 3 prof. actors | ||
| 13 | Baroni & Finarelli ( | SP | A | 3 singers and 3 prof. actors | Int, SR | |
| 14 | Barrett & Paus ( | SP | I | 63 speakers—nonprof. actors | ||
| 15 | Belin et al. ( | NV | A | 5 male 5 female—prof. actors and nonprof. actors | ||
| 16 | Belyk & Brown ( | SP | A | Motivational | 10 male 22 female—nonprof. actors | |
| 17 | Braun & Katerbow ( | SP | A | 3 female 3 male—prof. actors | ||
| 18 | Breitenstein et al. ( | SP | A | 1 female—prof. actor | ||
| 19 | Burkhardt & Sendlmeier ( | SP | ST | 5 male 5 female | ||
| 20 | Cahn ( | SP | ST | - | ||
| 21 | Carlson et al. ( | SP | ST | 2 speakers | ||
| 22 | Chronaki et al. ( | NV | A | Taken from Maurege corpus, actors | Int | |
| 23 | Corbeil et al., ( | SP, NV | I | 1 female | ||
| 24 | Costanzo et al. ( | SP | A | 12 male 11 female—nonprof. actors | ||
| 25 | Cowie & Douglas-Cowie ( | SP | A | 40 volunteers, nonprof. actors | ||
| 26 | Dai et al. ( | SP | A | 3 male 5 female—prof. actors | Various features | |
| 27 | Davitz ( | SP | A | 4 male 4 female speakers | ||
| 28 | Davitz ( | SP | A | 38 female, 23 male nonprof. actors | ||
| 29 | Erickson et al. ( | SP | A | 1 male 1 female—nonprof. actors | ||
| 30 | Fónagy ( | SP | A | 1 female—prof. actors | ||
| 31 | Friend & Farrar ( | SP | A | 1 female | ||
| 32 | Gårding & Abramson ( | SP | A | 5 speakers | ||
| 33 | Gérard & Clement ( | SP | A | 12 (6 children 3 adult)—nonprof. actors | SR, | |
| 34 | Gobl & Chasaide ( | SP | ST | - | Int, Jitter, Formants, Spectr, Glottal Waveform | |
| 35 | Goudbeek & Scherer ( | SP | A | 5 male 5 female—prof. actors | ||
| 36 | Hammerschimidt & Jürgens ( | SP | A | 11 male 12 female—prof. actors | ||
| 37 | Higuchi et al. ( | SP | A, SN | 1 male—prof. actor | ||
| 38 | Hirose et al. ( | SP | A | 2 semi-prof. actors | ||
| 39 | House ( | SP | I | Taken from Gârding 1986 | ||
| 40 | Huttar ( | SP | SN | 1 male—nonprof. actors | ||
| 41 | Iida et al. ( | SP | ST | Male and female—nonprof. actors | ||
| 42 | Iliou & Anagnostopoulos ( | SP | A | 5 male 5 female—prof. actors | ||
| 43 | Iriondo et al. ( | SP | ST | 4 male 4 female actors | ||
| 44 | Jiang et al. ( | SP | A | 2 male—lay actors | ||
| 45 | Jiang & Pell ( | SP, NV | A | 3 male 3 female—lay experience in acting or public speaking | ||
| 46 | Jo et al. ( | SP | ST | 1 speaker | SR, | |
| 47 | Johnstone & Scherer ( | SP | I | 36 male—nonprof. actors | ||
| 48 | Juslin et al. ( | SP | A, SN | Samples from 23 sources | 88 features from GeMAPs | |
| 49 | Juslin & Laukka ( | SP | A | 4 male 4 female—7 prof. actors, 1 semiprof. actor | ||
| 50 | Jürgens et al. ( | SP | A, SN | 21 male 21 female—31 prof. actors, 10 drama students, 1 prof. singer | ||
| 51 | Jürgens et al. ( | SP | A, SN | 21 male 21 female—30 prof. actors, 11 acting students & 19 male 12 female—nonprof. actors | ||
| 52 | Kaiser ( | SP | A | 2 male 2 female subjects | SR, | |
| 53 | Kao & Lee ( | SP | A | 2 male 2 female—drama students | ||
| 54 | Kienast & Sendlmeier ( | SP | A | 3 male 3 female actors | Spectr, formants, Art. | |
| 55 | Laukka et al. ( | SP | A | 4 male 4 female—7 prof. actors, 1 semi-prof actor | ||
| 56 | Laukka et al. ( | SP | A | 10 male 10 female from each of 5 cultures—prof. actors | Parameters included in the GeMAPs | |
| 57 | Laukkanen et al. ( | SP | A | 1 male 2 female—prof. actors | ||
| 58 | Laukkanen et al. ( | SP | A | 2 male 1 female | Glottal waveform, formants | |
| 59 | Leinonen et al. ( | SP | A | 8 male 8 female—nonprof. actors | SR, | |
| 60 | Levitt ( | SP | A | 25 male 25 female—nonprof. actors | ||
| 61 | Lieberman & Michaels ( | SP | ST | 6 male | ||
| 62 | Lima et al. ( | NV | A | 2 male 2 female—nonprof. actors | ||
| 63 | Lima et al. ( | NV | A | 4 male 4 female speakers from 2 different sources | ||
| 64 | Liscombe et al. ( | SP | A | 2 male 2 female—prof. actors | ||
| 65 | Luengo & Navas ( | SP | A | 1 female—prof. | ||
| 66 | Moriyama & Ozawa ( | SP | A | 1 male actor | SR, | |
| 67 | Mozziconacci ( | SP | A | 2 male 2 female | SR, | |
| 68 | Nagasaki & Komatsu ( | SP | A | 1 male | ||
| 69 | Paeschke et al. ( | SP | A | 7 speakers | ||
| 70 | Paeschke & Sendlmeier ( | SP | A | 5 male 4 female—prof. actors | ||
| 71 | Pajupuu et al. ( | SP | A | 1 female | ||
| 72 | Patel et al. ( | SP | A | 5 male 5 female—prof. actors | CQ, H1-H2, Leq, Shimmer, HNR, Jitter, Pulse Amp, | |
| 73 | Pell ( | SP | A | 5 male 5 female—nonprof. actors | SR, | |
| 74 | Pell et al. ( | SP | A | 2 male 2 female from each of 4 languages—nonprof. actors | SR, | |
| 75 | Pell et al. ( | NV, SP | I | 4 male 6 female & 5 female 5 male speakers | ||
| 76 | Pereira & Watson ( | SP | A | 1 male 1 female actors | ||
| 77 | Petrushin ( | SP | A | 30 nonprof. actors | ||
| 78 | Pollerman & Archinard ( | SP | A | 30 male 6 female—nonprof. actors | ||
| 79 | Rao et al. ( | SP | A | 5 male 5 female—prof. actors | SR, | |
| 80 | Sauter et al. ( | NV | A | 2 male 2 female—nonprof. actors | ||
| 81 | Scherer ( | SP | ST | - | SR, | |
| 82 | Scherer ( | SP | A, I | 83 male nonprof. actors | energy, | |
| 83 | Scherer et al. ( | SP | A | 2 male 2 female—prof. actors | ||
| 84 | Scherer et al. ( | SP, NV | A | 5 male 5 female—prof. actors | Tempo, SIL, Energy prop., Hammarberg Index, Spectral flatness, HNR, Jitter, Shimmer | |
| 85 | Scherer & Oshinsky ( | SP | ST | - | SR, | |
| 86 | Seppänen et al. ( | SP | A | 8 male 6 female—prof. actors | ||
| 87 | Skinner ( | SP | I | 9 male 10 female—prof. actors | ||
| 88 | Sobin & Alpert ( | SP | A | 31 female | SR, pauses, | |
| 89 | Soderstrom et al. ( | NV | A | 1 male & 2 male 2 female speakers | SR, Amplitude | |
| 90 | Stibbard ( | SP | SN | Samples taken from EISP data | Various features | |
| 91 | Szameitat et al. ( | NV | A | 3 male 5 female -prof. actors | Many | |
| 92 | Sztahó et al. ( | SP | SN | - | ||
| 93 | Tanaka & Campbell ( | NV | SN | 5 male 5 female volunteers, nonprof. actors | ||
| 94 | Thompson & Balkwill ( | SP | A | Volunteers, nonprof. actors | ||
| 95 | Tischer ( | SP | A | 2 male 2 female—prof. actors | SR, Pauses, | |
| 96 | Toivanen et al. ( | SP | A | 5 male 4 female, prof. actors | Jitter, shimmer, and SR | |
| 97 | Trainor et al. ( | SP | A | 23 female—nonprof. actors | SR, | |
| 98 | Trouvain & Barry ( | SP | SN | 3 male speakers | ||
| 99 | van Bezooijen ( | SP | A | 4 male 4 female—nonprof. actors | SR, | |
| 100 | Viscovich et al. ( | SP | A | 10 male, 9 female—nonprof. actors | ||
| 101 | Waaramaa et al. ( | SP | A | 5 male 4 female—prof. actors | ||
| 102 | Wallbott & Scherer ( | SP | A | 3 male 3 female—prof. actors | SR, | |
| 103 | Wang et al. ( | SP | A | Male female | ||
| 104 | Whiteside ( | SP | A | 1 male 1 female—nonprof. actor and prof. actor | ||
| 105 | Whiteside ( | SP | A | 1 male 1 female—nonprof. actor and prof. actor | ||
| 106 | Yildirim et al. ( | SP | A | 1 female—prof. actor | ||
| 107 | Yuan et al. ( | SP | A | 9 speakers | ||
| 108 | Zhang ( | SP | SN | 16 male 37 female speakers | SR, |
Note. Positive emotion categories as used by the authors are marked in boldface. SP = speech prosody; NV = nonverbal vocalizations; A =acted; SN = spontaneous; I = Induced; ST = synthesized; fo = fundamental frequency; Int = voice intensity; SR = speech rate; Dur = duration; nonprof = nonprofessional actors; prof = professional actors
Fig. 1Different positive emotion terms used in research on acoustic features of positive emotions in the voice. Emotion categories are only linked if the material used for elicitation of two emotion categories was the same, or if the authors explicitly stated that the two categories were the same. For instance, if two studies used the same materials, but labelled them with different terms (e.g., happiness vs amusement), then a connection line was added between those terms. Similarly, if two emotion terms were explicitly treated as equivalent, such as with a slash mark (e.g., achievement/triumph), a parenthesis (e.g., elation [joy]), or used interchangeably in an article, then a connection line was created between the two emotion terms. Larger circles reflect terms used more often in connection with others. Thicker connections reflect more frequent connections
Fig. 2The source-filter framework of vocal production. Left: Spectrogram of a vocalization of the vowel /a/ illustrating fo (fundamental frequency), and the first four formant frequenciesF1, F2, F3, F4. Right: Schema of the approximate locations of the vocal organs involved in the source and filter. Oscillation of the vocal folds in the larynx produces a source sound which determines the fundamental frequency (fo) of the vocalization. Then the sound is filtered through the vocal tract, which determines the formant frequencies (F1–F4)
Common acoustic parameters and their definitions
| Acoustic Parameter | Perceptual Correlate | Definition |
|---|---|---|
| Pitch | Lowest periodic cycle of the acoustic signal | |
| Voice quality | Concentration of acoustic energy around first and second formants | |
| Intensity and amplitude | Loudness | Measures of energy in the acoustic signal |
| Speech rate | Velocity of speech | Number of complete utterances or elements produced per time unit |
| Jitter | Pitch irregularity | Frequency instability of |
| Shimmer | Loudness irregularity | Amplitude instability of |
| Spectral Energy | Timbre | Relative energy in different frequency bands |
| Glottal waveform | Voice quality | The time of airflow between the vocal folds and the time glottis is closed for each vibrational cycle |
| HNR (harmonics to noise ratio) | Voice quality | Mean ratio of quasi-periodic to non-periodic signals across time segments |
Note. Though parameter names may differ from those in the original studies, they correspond to the definitions given
Fig. 3Acoustic features used at least in two separate publications. a Frequently used acoustic parameters involved in comparisons of individual positive emotions in comparison to neutral vocalizations. b Frequently used acoustic parameters involved in comparisons of acoustic features across several positive emotions. The size of each circle refers to the frequency of use of that type of acoustic feature; the thickness of a connection line between two acoustic features represents the frequency of inclusion of these features together in the same study. The larger the size of the circle, the more frequently a given feature has been studied; the thicker the connection line, the more frequently two acoustic features have been studied together
Changes in acoustic parameters of positive emotions compared with neutral vocalizations
| Category | Effect direction | Happiness | Joy | Elation | Interest | Tenderness | Pleasure | Positive surprise | Pride | Relief | Lust | Satisfaction |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F0 mean | > | 2a,2b,7,14,44a,44b,49a,49b,53,56,69,74a,74b,74c,74d,79a,79b,100a,100b,105b*,104b* | 7,17a,17b,17c,17d,17e,17f,71,99,101a,101b | 104b,105b | 56 | 74a,74b,74c,74d | 56 | 56 | ||||
| < | 104a*,105a* | 104a,105a | 104a,105a | 101a,101b | 56 | |||||||
| F0 variability | > | 2a,2b,14, | 99,101b | 104a,104b,105a,105b | 99,104a,104b,105a,105b | 101b | ||||||
| < | 79b,104b | 101a | 101a | |||||||||
| F0 range | > | 7, | 5,7,17a, 17b,17d, 17e,17f,19,71,99,101b | 99 | 101b | 74a,74b, 74c,74d | 5,7 | |||||
| < | 14,47 | 17c,101a | 101a | |||||||||
| F1 mean | > | 49a,49b, 104a,104b | 101b | 104a,104b | 104a,104b | 101a | ||||||
| < | 101a | 101b | ||||||||||
| F2 mean | > | 104a,104b | 101a,101b | 104a | 104a | 101a101b | ||||||
| < | 104b | 104b | ||||||||||
| Voice intensity mean | > | 2a,2b,8,38,39,49a,49b,53,56,79a,79b,104a, 104b,105a,105b | 104a,104b,105a,105b | 56,104a,105a,105b | 56 | 56 | ||||||
| < | 44a,44b | 71 | 56 | |||||||||
| Voice intensity variability | > | 2,14, | 104b,105a,105b | 105a,105b | ||||||||
| < | 44b,104b | 104a | 104a,104b | |||||||||
| Voice intensity range | > | 14,44b | ||||||||||
| < | 71 | |||||||||||
| Speech rate | > | 14,38,49b,74b,104b | 71,19 | 104b | 99,104b | |||||||
| < | 2a,2b, | 99,101a,101b | 104a | 101a, 101b | ||||||||
| Jitter | > | 47,49a,49b,105a | 105a,105b | 105a,105b | ||||||||
| < | 105b | |||||||||||
| Shimmer | > | 105a | 105a,105b | 105a,105b | ||||||||
| < | 105b | |||||||||||
| HNR | > | 56 | 56 | 56 | 56 | |||||||
| < | 56 |
Note. The numbers correspond to the codes given in Table 1. ‘>’ indicates that parameter value is higher/faster in the positive emotion than neutral condition; ‘<’ indicates that parameter value is lower/slower in the positive emotion than neutral condition. Studies using nonverbal vocalizations are marked in boldface, others used speech prosody. * = used same male or female actor in the same project. 2a = Arabic; 2b = English; 17a = German female; 17b = Japanese female; 17c = English female; 17d = German male; 17e = Japanese male; 17f = English male; 18a = German; 18b = English; 44a = Hindi; 44b = English; 49a = high intensity; 49b = low intensity; 74a = English; 74b = German; 74c = Hindi; 74d = Arabic; 79a = female; 79b = male; 100a = female; 100b = male; 101a = female; 101b = male; 104a = female; 104b = male; 105a = female; 105b = male
Changes in acoustic parameters of positive emotions compared with each other
| Emotions compared | Voice int. ( | Voice int. variability | Speech rate | Jitter | Shimmer | HNR | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jo | af | >:28 | >:28 | >:28 | ||||||||
| am | >:35 | >:35 | >:35 | <:35 | ||||||||
| ch | >:28 | >:28 | >:28 | |||||||||
| in | >:35 | >:35, 99 | >:99 | >:35 | >:35 | >:35, 99 | ||||||
| <:99 | <:35 | |||||||||||
| pl | >:35 | >:35 | >:35 | >:35 | >:35 | |||||||
| <:35 | ||||||||||||
| pr | >:35 | >:35 | >:35 | <:35 | <:35 | >:35 | ||||||
| <:35 | ||||||||||||
| re | >:35*, 72* | >:35 | >:35 | >: 72 | ||||||||
| <:35 | <:35 | <:72 | <:35,72 | |||||||||
| sa | >:28 | >:28 | >:28 | |||||||||
| <:7 | <:7 | |||||||||||
| sc | ||||||||||||
| <: | <: | |||||||||||
| ti | ||||||||||||
| <: | <:91a | |||||||||||
| te | >:101a,101b | >:101a | >:101a | >:101a | >:101a,101b | |||||||
| <:101b | <:101b | <:101b | <:101a,101b | |||||||||
| Ha | el | >:104a,105a, 105b | >:81,104a, 105b | >:104b | >:104a,105a | >:81,104a, 105a | >:104b | |||||
| <:9,104b | <:9,104b,105a | <:104a,104b | <:104a | <:9,104b,105b | <:104b,105b | <:9,104a | <:105a,105b | <:105a, 105b | ||||
| in | >:56,104a,105a, 105b | >:9,81,104a, 105a | >:104a | >:104b | >:56,104a, 105a,105b | >:81,104a, 105a,105b | >:9,104b | |||||
| <:9,104b | <:104b,105b | <:104 | <:104b | <:104a | <:8,104b | <:104b | <104a | <:105a,105b | <:105a, 105b | <:56 | ||
| lu | >:56 | >:56 | >:56 | |||||||||
| pl | ||||||||||||
| pr | >:56 | >:9 | >:56 | <:9 | >:56 | |||||||
| <:9 | <:9 | |||||||||||
| re | >:56 | >:56 | >:56 | |||||||||
| su | <:74a,74b,74c, 74d | <:74a,74b, 74c,74d | >:74b | |||||||||
| <:74a,74c,74d | ||||||||||||
| In | am | <:35 | <:35 | <:35 | <:35 | <:35 | <:35 | |||||
| el | >:104b,105a | >:81,104b, 105b | <:104a,104b | <:104a,104b | >:104b | <:81,104a, 104b,105a, 105b | >:104a,104b | >:105b | ||||
| <:9,104a,105b | <:9,104a, 105a | <:9,104a, 105a,105b | <:9,105a,105b | <:105a,105b | <:105a | |||||||
| lu | >:56 | >:56 | >:56 | |||||||||
| pl | >:35 | >:35 | >:35 | >:35 | <:35 | |||||||
| pr | >:9,56 | >:9,35 | >:9,56 | >:35 | ||||||||
| <:35 | <:35 | <:9,35 | <:35 | >:56 | ||||||||
| re | >:35,56 | >:56 | >:56 | |||||||||
| <:35 | <:35 | <:35 | <:35 | <:35 | ||||||||
| Pl | ac | >: | >: | >: | ||||||||
| <: | <: | <: | <: | <: | <: | <: | ||||||
| am | >: | >: | >: | >:3b, | >:35 | >: | ||||||
| <:35, | <:3a,3b,35, | <: | <:35 | <:35, | <:3a,35, | |||||||
| co | >: | <: | <: | >: | >: | |||||||
| pr | >:35 | >:35 | >:35 | >: | ||||||||
| <:35 | <:35 | <:35 | <:35 | |||||||||
| re | >: | >: | >: | >: | <:35, | >:35 | >: | |||||
| <:35, | <:35, | <: | <:35 | <:35, | ||||||||
| Re | ac | >: | >: | >: | >: | |||||||
| <: | <: | <: | <: | <: | <: | |||||||
| am | >: | >: | >: | >:35, | >:35, | |||||||
| <:35, | <:35, | <: | <:35, | <: | ||||||||
| co | >: | <: | <: | >: | <: | >: | ||||||
| lu | >:56 | >:56 | >:56 | |||||||||
| pr | >:35 | >:35 | >:35 | >:56 | ||||||||
| <:35,56 | <:35,56 | <:35 | ||||||||||
| Am | ac | >: | >: | >: | >: | |||||||
| <: | <: | <: | <: | <: | <: | |||||||
| co | >: | >: | ||||||||||
| <: | <: | <: | <: | |||||||||
| el | ||||||||||||
| <:9 | <:9 | <:9 | <:9 | |||||||||
| pr | >:35 | >:35 | >:35 | >:35 | >:35 | |||||||
| <:35 | <:35 | |||||||||||
| Pr | af | >:28,36b | >:36a,36b | >:28,36b | >:36b | >:28,36a,36b | >:36b | |||||
| <:36a | <:36a | <:36a | ||||||||||
| lu | >:56 | >:56 | >:56 | |||||||||
| Sa | ch | |||||||||||
| <:28 | <:28 | <:28 | ||||||||||
| jo su | >:36a; <:36b | >:36a, 36b | ||||||||||
| <:36a,36b | <:36a,36b | <:36a,36b | <:36a, 36b | |||||||||
| Af | ch | <:28 | <:28 | <:28 | ||||||||
| jo su | >:36a,36b | >:36a, 36b | ||||||||||
| <:36a,36b | <:36a,36b | <:36a,36b | <:36a,36b | |||||||||
| Co | ac | >: | >: | >: | ||||||||
| <: | <: | |||||||||||
| ad | >:59 | >:59 | >:59 | |||||||||
Note. The numbers correspond to the codes given in Table 1. ‘>’ indicates that parameter value is higher/faster in the positive emotion listed on the left than the right; ‘<’ indicates that parameter value is lower/slower in the positive emotion listed on the left than the right. Studies using nonverbal vocalizations are marked in boldface, others used speech prosody. * = used partially same stimuli with another study; ac = achievement; ad = admiration; af = affection; am = amusement; ch = cheerfulness; co. = content; el = elation; ha = happiness; in = interest; jo su = joyful surprise; lu = lust; pl = pleasure; re = relief; ti = tickling; sc = schadenfreude; sa = satisfaction; su = positive surprise; HNR = Harmonics to Noise Ratio; 3a = spontaneous; 3b = acted; 36a = female; 36b = male; 74a = English; 74b = German; 74c = Hindi; 74d = Arabic; 91a = female; 91b = male; 101a = female; 101b = male; 104a = female; 104b = male; 105a = female; 105b = male
Listeners’ recognition rates (%)
| Study No | Happiness | Joy | Elation | Interest | Amusement | Pleasure | Positive surprise | Pride | Relief | Lust | Achievement | Others |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2 | 73(Arabic), 71(English) | |||||||||||
| 9 | 52 | 38 | 75 | 43 | ||||||||
| 60 | 59 | |||||||||||
| 17 | 40*(male) 55*(female) | |||||||||||
| 19 | 61.9** | 81** | ||||||||||
| 36 | 43<(accuracy)<87 | 43<(af), (en), (sa)<87 | ||||||||||
| 39 | 87 | |||||||||||
| 44 | 66(English), 70(Hindi) | |||||||||||
| 49 | 51 | |||||||||||
| 53 | 92.3** | |||||||||||
| 56 | 56 | 43 | 18 | 35 | 48 | |||||||
| 59 | 34(ad), 72(as), 26(co) | |||||||||||
| 95.9 | 85.9 | 86.3 | 77.7 | |||||||||
| 91 | 86.5 | 89 | 80 | |||||||||
| 69 | (accuracy)>80 | |||||||||||
| 71 | 75.4 | |||||||||||
| 74 | 79.6(English) 59.6(German) 67.1(Hindi) 59.9(Arabic) | 71.5(English) 68.8(German) 57.9(Hindi) 50.4(Arabic) | ||||||||||
| 79.5 | 65 | 86 | 77 | 46(co) | ||||||||
| 44 | 37(sc), 45(ti) | |||||||||||
| 99 | 72 | 63 | ||||||||||
| Mean NV | 60 | 44 | 88.8 | 74.1 | 87.1 | 78.2 | 46(co), 37(sc), 45(ti) | |||||
| Mean SP | >67.6 | 64.7 | 38 | 60.3 | 62.1 | 30.5 | 35 | 48 | 34(ad), 72(as), 26(co) | |||
| Mean Total | >67.1 | 61.2 | 38 | 60.3 | 88.8 | 74.1 | 62.1 | 30.5 | 74.1 | 48 | 78.2 | 34(ad), 72(as), 36(co), 37(sc), 45(ti) |
Note. The numbers correspond to the codes given in Table 1. Studies using nonverbal vocalizations are marked in bold, others used speech prosody. * = Based on stimuli produced by one speaker, full confusion matrix is not reported. ** = After best acoustic feature selection. ad = admiration; af = affection; as = astonishment; co. = content; en = enjoyment; sa = satisfaction; sc = schadenfreude; ti = tickling
Fig. 4* = Only for speech prosody. Emotion families of positive emotions based on pitch, loudness, and speech rate. The larger the circle, the higher the related acoustic feature
Fig. 5Comparison of different ways of eliciting emotional vocalizations in terms of experimental control and ecological validity