| Literature DB >> 27910886 |
Pawel Fedurek1,2,3, Klaus Zuberbühler1,2,4, Christoph D Dahl1.
Abstract
Birdsong is a prime example of acoustically sophisticated vocal behaviour, but its complexity has evolved mainly through sexual selection to attract mates and repel sexual rivals. In contrast, non-human primate calls often mediate complex social interactions, but are generally regarded as acoustically simple. Here, we examine arguably the most complex call in great ape vocal communication, the chimpanzee (Pan troglodytes schweinfurthii) 'pant hoot'. This signal consists of four acoustically distinct phases: introduction, build-up, climax and let-down. We applied state-of-the-art Support Vector Machines (SVM) methodology to pant hoots produced by wild male chimpanzees of Budongo Forest, Uganda. We found that caller identity was apparent in all four phases, but most strongly in the low-amplitude introduction and high-amplitude climax phases. Age was mainly correlated with the low-amplitude introduction and build-up phases, dominance rank (i.e. social status) with the high-amplitude climax phase, and context (reflecting activity of the caller) with the low-amplitude let-down phase. We conclude that the complex acoustic structure of chimpanzee pant hoots is linked to a range of socially relevant information in the different phases of the call, reflecting the complex nature of chimpanzee social lives.Entities:
Mesh:
Year: 2016 PMID: 27910886 PMCID: PMC5133612 DOI: 10.1038/srep38226
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Four phases of pant-hoot and feature extraction.
(A) An exemplar call sequence of a pant-hoot. The four phases of a pant-hoot exemplar are color-coded below the spectrogram: red: Introduction; green: Build-up; blue: Climax; magenta: Let-down. The x-axis shows the time in seconds; the y-axis shows the Frequency [kHz]. (B) Length of phases. The lengths of the four phases are shown on the y-axis of the boxplot. The color-coding is in correspondence with A. The boxes indicate the upper and lower quartiles and the median (red solid line). The whiskers indicate the highest and lowest values of 95% of the data samples. Outliers are indicated by red x-markers. (C) Number of recorded events. The number of recordings for each phase are presented in percentage and number count. Colors of pie chart patches correspond with the colors in (A,B). (D) Events across individuals. The number of recordings for each phase is shown across individuals. The y-axis shows the number count. Colors and markers as in (B). (E) Social status and age. Social status and age are shown across individuals. The y-axis has double scaling properties: Elo-ratings for social status and years for age. Classes as used for the classifier are indicated by labels like ‘C1’, ‘C2’, etc., or ‘Class 1’, etc. (F) Mel-frequency cepstral coefficients (MFCCs). MFCCs are shown as feature vectors (columns) for each call segment (rows). The x-axis has been arbitrarily limited to 200 samples for better illustration. (G) Delta features. Delta features are shown in the same way as the MFCCs in C.
Figure 2Classification performance.
(A–D) The raw values of classification performance for each feature length and phase (dots in color-code; red: Introduction; green: Build-up; blue: Climax; magenta: Let-down). The x-axis is the feature/window length; the y-axis is the classification performance as percentage correct classification. (E–H) The number of occurrences in the range of 75 to 95 percentage correct classification. The x-axis shows the number of occurrences. (I) Prominence. The effect sizes are illustrated as filled circles with radii corresponding to the effect sizes. (J) Modality specificity. The relative contribution of each modality (caller’s attributions) in a particular phase is shown as filled circles with respective radii. (K) Phase specificity. The relative contribution of each phase to a particular modality (caller’s attributions) is shown as filled circles with respective radii. (L) Window length. Classification performances as in (A–D) were binned into five classes to illustrate the relative importance of window lengths for classification. The x-axis shows the phases at the global level and the time of the window bins on the local level. The y-axis shows the modality at the global level and the average performance scores at the local level. (M) Time dependency. The five points in the time course of a call sequence are highlighted with downward pointing arrowheads. The x-axis illustrates the time in seconds aligned to the onset of each individual phase. The y-axis shows the modalities. (N) Information flow. We illustrate the progression of information over time. The x-axis shows the time in seconds of a whole pant hoot. The y-axis is restricted to the spectrogram example and represents the frequency (kHz).