| Literature DB >> 32257325 |
Katarzyna Pisanski1, Jordan Raine2, David Reby1,2.
Abstract
Fundamental frequency (F0, perceived as voice pitch) predicts sex and age, hormonal status, mating success and a range of social traits, and thus functions as an important biosocial marker in modal speech. Yet, the role of F0 in human nonverbal vocalizations remains unclear, and given considerable variability in F0 across call types, it is not known whether F0 cues to vocalizer attributes are shared across speech and nonverbal vocalizations. Here, using a corpus of vocal sounds from 51 men and women, we examined whether individual differences in F0 are retained across neutral speech, valenced speech and nonverbal vocalizations (screams, roars and pain cries). Acoustic analyses revealed substantial variability in F0 across vocal types, with mean F0 increasing as much as 10-fold in screams compared to speech in the same individual. Despite these extreme pitch differences, sexual dimorphism was preserved within call types and, critically, inter-individual differences in F0 correlated across vocal types (r = 0.36-0.80) with stronger relationships between vocal types of the same valence (e.g. 38% of the variance in roar F0 was predicted by aggressive speech F0). Our results indicate that biologically and socially relevant indexical cues in the human voice are preserved in simulated valenced speech and vocalizations, including vocalizations characterized by extreme F0 modulation, suggesting that voice pitch may function as a reliable individual and biosocial marker across disparate communication contexts.Entities:
Keywords: communication; emotion; fundamental frequency; nonverbal vocalization; speech; vocal
Year: 2020 PMID: 32257325 PMCID: PMC7062086 DOI: 10.1098/rsos.191642
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.Waveforms and spectrograms representing each of eight vocal types produced by a single representative individual (female, aged 22), demonstrating the high degree of intra-individual variability in F0 across vocal types.
Figure 2.Violin plots representing the full distribution in mean F0 for each sex and each vocal type. Mean F0 values are given in both Hertz and ERBs above the violin plot for each sex and each vocal type. Significant sex differences were observed for each vocal type (***p < 0.001 following Šidák correction). Plots were produced in R ggplot 2 package.
Correlation matrix of F0 (ERB), pairwise comparisons across all vocal types. Results for men's voices are given in the top right portion of the table (significant relationships are highlighted in blue), and results for women's voices are given in the bottom left portion of the table (significant relationships are highlighted in beige). See electronic supplementary material, table S3 for exact p-values and electronic supplementary material, table S4 for analogous comparisons on Hz scale. See electronic supplementary material, figure S1 for corresponding scatterplot matrices.
| statistic | modal speech | aggress speech | fear speech | aggress roar | fear scream | pain mild | pain mod | pain intense | |
|---|---|---|---|---|---|---|---|---|---|
| modal speech | −0.06 | 0.36* | 0.16 | 0.15 | 0.80** | 0.31 | 0.39* | ||
| 95% CI | −0.41, 0.30 | −0.07, 0.68 | −0.24, 0.50 | −0.29, 0.53 | 0.63, 0.89 | −0.12, 0.65 | 0.01, 0.69 | ||
| aggress speech | 0.09 | 0.55** | 0.43* | 0.26 | 0.11 | 0.08 | 0.15 | ||
| 95% CI | −0.33, 0.47 | 0.15, 0.77 | 0.04, 0.75 | −0.13, 0.53 | −0.30, 0.48 | −0.28, 0.45 | −0.22, 0.48 | ||
| fear speech | 0.04 | 0.46* | 0.41* | 0.54** | 0.50** | 0.12 | 0.26 | ||
| 95% CI | −0.47, 0.51 | 0.09, 0.75 | 0.03, 0.68 | 0.16, 0.77 | 0.10, 0.75 | −0.23, 0.45 | −0.12, 0.58 | ||
| aggress roar | 0.23 | 0.62** | 0.32 | 0.26 | 0.08 | 0.22 | 0.53** | ||
| 95% CI | −0.19, 0.58 | 0.30, 0.83 | −0.06, 0.64 | −0.14, 0.58 | −0.29, 0.45 | −0.18, 0.56 | 0.23, 0.74 | ||
| fear scream | −0.08 | 0.30 | 0.45* | 0.30 | 0.26 | 0.28 | 0.25 | ||
| 95% CI | −0.51, 0.38 | −0.14, 0.71 | 0.10, 0.71 | −0.13, 0.63 | −0.18, 0.61 | −0.14, 0.65 | −0.13, 0.57 | ||
| pain mild | 0.15 | −0.17 | −0.13 | −0.01 | 0.55** | 0.39* | 0.28 | ||
| 95% CI | −0.28, 0.48 | −0.59, 0.31 | −0.56, 0.29 | −0.44, 0.47 | 0.20, 0.80 | −0.03, 0.67 | −0.12, 0.62 | ||
| pain mod | −0.07 | −0.12 | 0.07 | −0.17 | 0.25 | 0.43* | 0.30 | ||
| 95% CI | −0.49, 0.36 | −0.51, 0.30 | −0.40, 0.49 | −0.60, 0.30 | −0.13, 0.54 | −0.01, 0.74 | −0.11, 0.67 | ||
| pain intense | −0.06 | 0.18 | 0.12 | 0.03 | 0.64** | 0.44* | 0.38× | ||
| 95% CI | −0.45, 0.35 | −0.25, 0.66 | −0.36, 0.57 | −0.42, 0.51 | 0.34, 0.81 | −0.03, 0.80 | −0.05, 0.70 |
Notes: Spearman's rho (rs) correlation coefficients are followed by lower and upper 95% confidence intervals (bootstrapping, 1000 samples). Highlighted cells show significant relationships where **p < 0.01, *p < 0.05. A single relationship (marked ‘×’) did not pass Benjamini–Hochberg correction (unadjusted p = 0.04).