| Literature DB >> 34229494 |
Andrey Anikin1,2, Katarzyna Pisanski2,3, Mathilde Massenet2, David Reby2.
Abstract
A lion's roar, a dog's bark, an angry yell in a pub brawl: what do these vocalizations have in common? They all sound harsh due to nonlinear vocal phenomena (NLP)-deviations from regular voice production, hypothesized to lower perceived voice pitch and thereby exaggerate the apparent body size of the vocalizer. To test this yet uncorroborated hypothesis, we synthesized human nonverbal vocalizations, such as roars, groans and screams, with and without NLP (amplitude modulation, subharmonics and chaos). We then measured their effects on nearly 700 listeners' perceptions of three psychoacoustic (pitch, timbre, roughness) and three ecological (body size, formidability, aggression) characteristics. In an explicit rating task, all NLP lowered perceived voice pitch, increased voice darkness and roughness, and caused vocalizers to sound larger, more formidable and more aggressive. Key results were replicated in an implicit associations test, suggesting that the 'harsh is large' bias will arise in ecologically relevant confrontational contexts that involve a rapid, and largely implicit, evaluation of the opponent's size. In sum, nonlinearities in human vocalizations can flexibly communicate both formidability and intention to attack, suggesting they are not a mere byproduct of loud vocalizing, but rather an informative acoustic signal well suited for intimidating potential opponents.Entities:
Keywords: acoustic communication; body size; nonlinear vocal phenomena; pitch; roughness; voice
Mesh:
Year: 2021 PMID: 34229494 PMCID: PMC8261225 DOI: 10.1098/rspb.2021.0872
Source DB: PubMed Journal: Proc Biol Sci ISSN: 0962-8452 Impact factor: 5.349
Figure 1Spectrograms of a prototype vocalization (female cry, file ‘cry_F_475’) resynthesized with different nonlinear vocal phenomena (NLP). Amplitude modulation is created by multiplying the signal by a relatively low-frequency waveform (M ± s.d. = 90 ± 20 Hz); subharmonics are synthesized as a new, harmonically related voiced component at 1/2, 1/3 or 1/4 of the original fundamental frequency; chaos is emulated by adding strong jitter—rapid random pitch changes (audio and code for creating the stimul on http://cogsci.se/publications.html).
Figure 2The effect of manipulating nonlinear phenomena (NLP) on listeners' ratings of vocalizations on three psychoacoustic scales (top) and three ecological scales (bottom). Solid circles show the median of posterior distribution of the difference between each condition and the no-NLP condition, with 95% CI. Violin plots show the distribution of fitted values for each prototype vocalization (N = 82). The dotted line in each panel marks null effect (no difference between sounds with and without NLP).
Figure 3(a) The effect of manipulating nonlinear phenomena (NLP) on perceived body size in the two-alternative forced choice task: fitted values from ordinal logistic regression. Markers indicate the probability of perceiving the first vocalization (e.g. in the top example, a vocalization with chaos) as larger than the second vocalization (e.g. a vocalization without any NLP, ‘none’) minus the probability of perceiving it as smaller (median of posterior distribution and 95% CI), ignoring ties. If the point lies to the right of the dotted line, it means that the first listed NLP condition was associated with larger size; 100% = all listeners always chose the first vocalization. The greyed-out markers have CIs that fail to clear the region of practical equivalence corresponding to the effect size for catch pairs of perfectly identical sounds. (b) Implicit associations tests showing the increase in errors (odds ratio, left panel) and response times (ms, right panel) in blocks with incongruent versus congruent combinations, where a congruent combination is defined as pairing a sound with NLP with the image of a large person, and a sound without NLP with the image of a small person. The solid points show population-level effects (medians of posterior distribution and 95% CI, greyed out if the CI fails to clear the null value), and the violin plots show the distribution of fitted values across participants.