Roberta Salmi1, Kurt Hammerschmidt2, Diane M Doran-Sheehy3. 1. Department of Anthropology, University of Georgia, Athens, Georgia, United States of America; Interdepartmental Doctoral Program in Anthropological Sciences, Stony Brook University, Stony Brook, New York, United States of America. 2. Cognitive Ethology Laboratory, German Primate Center, Göttingen, Germany. 3. Department of Anthropology, Stony Brook University, Stony Brook, New York, United States of America.
Abstract
Individually distinct vocalizations play an important role in animal communication, allowing call recipients to respond differentially based on caller identity. However, which of the many calls in a species' repertoire should have more acoustic variability and be more recognizable is less apparent. One proposed hypothesis is that calls used over long distances should be more distinct because visual cues are not available to identify the caller. An alternative hypothesis proposes that close calls should be more recognizable because of their importance in social interactions. To examine which hypothesis garners more support, the acoustic variation and individual distinctiveness of eight call types of six wild western gorilla (Gorilla gorilla) females were investigated. Acoustic recordings of gorilla calls were collected at the Mondika Research Center (Republic of Congo). Acoustic variability was high in all gorilla calls. Similar high inter-individual variation and potential for identity coding (PIC) was found for all call types. Discriminant function analyses confirmed that all call types were individually distinct (although for call types with lowest sample size - hum, grumble and scream - this result cannot be generalized), suggesting that neither the distance at which communication occurs nor the call social function alone can explain the evolution of identity signaling in western gorilla communication.
Individually distinct vocalizations play an important role in animal communication, allowing call recipients to respond differentially based on caller identity. However, which of the many calls in a species' repertoire should have more acoustic variability and be more recognizable is less apparent. One proposed hypothesis is that calls used over long distances should be more distinct because visual cues are not available to identify the caller. An alternative hypothesis proposes that close calls should be more recognizable because of their importance in social interactions. To examine which hypothesis garners more support, the acoustic variation and individual distinctiveness of eight call types of six wild western gorilla (Gorilla gorilla) females were investigated. Acoustic recordings of gorilla calls were collected at the Mondika Research Center (Republic of Congo). Acoustic variability was high in all gorilla calls. Similar high inter-individual variation and potential for identity coding (PIC) was found for all call types. Discriminant function analyses confirmed that all call types were individually distinct (although for call types with lowest sample size - hum, grumble and scream - this result cannot be generalized), suggesting that neither the distance at which communication occurs nor the call social function alone can explain the evolution of identity signaling in westerngorilla communication.
Advances in the study of animal vocal communication have shown that many species display higher inter- and intra-individual acoustic variability in their vocal signals than previously thought (e.g., birds: [1]–[3]; bats: [4]–[6]; ungulates: [7], [8]; carnivores: [9]–[11]; primates: [12]–[16]). Although it is less clear whether these subtle acoustic differences are always meaningful for the animals, we know that vocal signals have the potential to carry different kinds of information, which include: (a) diverse external events (i.e. predators, food) and behavioral contexts experienced by the callers (birds: [3]; mammals: [17]; primates: [18]); (b) caller internal state (birds: [19]; elephants: [20]; tree shrew: [21]; hyenas: [9]; marmots: [22]; primates: [23], [24]); and (c) caller identity (birds: [25]; koalas: [26]; pandas: [27]; deer: [8]; hyraxes: [28]; hyenas: [9]; primates: [29]–[31]).Individual differences in vocal signals could reflect both physical and social characteristics. Physical characteristics are those related to morphological differences including age, sex, body size and vocal tract morphology (goats: [32]; hyenas: [9], [33]; deer: [7]; primates: [34]–[37]), whereas social characteristics are those reflecting specific social categories including rank, kinship, and even group membership (birds: [38], [39]; seals: [40]; dolphins: [41], [42]; bats: [5], [43]; primates: [44]–[47]).Individually distinct vocalizations play an important role in animal communication, since they may reduce the uncertainty of the external world experienced by listeners, allowing them to respond differentially based on caller identity, age, sex, social rank, and group membership. However, there are many questions not yet fully answered regarding animal identity signaling [48]–[50]. First, is inter- and intra- acoustic variability similar across all calls? Second, are all calls in a species' vocal repertoire individually distinct or has selection for individual uniqueness been stronger on some calls than on others? In the latter case, which of the many calls in a species' repertoire should be more individually recognizable? Third, what acoustic characteristics are responsible for generating distinctive voice cues, and are those characteristics the same across call types? Although systematic study of individual distinctiveness and acoustic variability across the species' complete vocal repertoire is needed to understand the adaptive functions they play in the animal's communication systems, few attempts have been made thus far (skuas: [51]; primates: [12], [52], [53]). Instead, the majority of studies have focused on the analysis of only one vocalization at a time (e.g., [14], [15], [54]) or at maximum two (e.g., chimpanzees: [55], [56]; baboons: [50], [57]; putty-nosed monkeys: [58]).Four non-mutually exclusive hypotheses, based on the distance at which calls are given [59], the social [60], [61] or spatial context [62] in which they are produced, and the direct effects to the nervous system they can elicit in receivers [63], [64], have been proposed to explain the evolution of individually distinct vocal signals and their variation across a species' vocal repertoire. Among non-human primates studies have yielded differing results, supporting two of the hypotheses proposed: the distance communication hypothesis and the social context hypothesis. Under the distance communication hypothesis, calls given or exchanged over long distance are expected to be more distinct among individuals than those given at close distance, since no other cues could be simultaneously used to enable listeners to recognize the identity of the caller [59]. Support for this hypothesis comes from studies conducted in chimpanzees and mouse lemurs, two primate species in which individuals forage commonly alone or in a fission-fusion system. In both species long-distance calls, which are used when individuals are not in visual contact, were more individually distinct than calls exchanged over close distance [12], [55]. In contrast, under the social context hypothesis, calls used in intragroup social interactions at close distance are expected to be more distinct than louder calls directed to a more generalized audience (e.g. the entire group) [60], although some researchers have suggested that threat calls, despite being exchanged at close distances, might be more stereotypic than other close calls ([61] but see [34], [65]). Support for this hypothesis has been found in primate species that are more spatially cohesive. For instance, among red-capped mangabeys' and female Campbell's monkeys' vocalizations, although all calls are individually distinct, those emitted during social/affiliative interactions showed higher acoustic variation and were more individually distinct than either long or alarm calls [52], [53].Here we test which of these two hypotheses better explains the pattern occurring in the wild westerngorilla's (Gorilla gorilla) vocal repertoire. We explore whether the acoustic variability differs across female gorilla call types. If gorilla calls are acoustically distinct among individuals, we also determine whether some call types are more distinct than others. We then explore which vocal characteristics are responsible for generating distinctive voice cues, and whether they are the same across call types.Western gorillas are an appropriate species for this study since: (a) they inhabit the dense tropical rainforest where vocal signals represent an important communicative channel; (b) they live in cohesive, polygynous groups, and, in contrast with chimpanzees, individuals often interact with each other (R. Salmi pers. obs.), and many of their calls are used for regulating these important social interactions [66], [67]; and finally (c) since group members can be separated by over 700m, all age classes need to communicate with dispersed group members using loud broadcast calls (R. Salmi in revision).The species' vocal repertoire has been described recently and includes 17 call types [68]. Since the male body size is more than double than that of the female (170 vs. 71 kg, [69]) and we have data only from one male, to investigate both acoustic variability and individual distinctiveness we examined only calls among adult females. We considered eight call types (Figure 1), which represent the entire vocal repertoire of female western gorillas, except for the play chuckle, a rare call that is difficult to record in the wild [68]. The call types include: soft threat grunts given during within-group aggression; loud screams given during escalated within-group aggression; soft copulation grunts emitted during sexual behavior; single grunts, double grunts, and grumbles, frequent soft calls given during different contexts; hums, given exclusively during foraging at intermediate distance (hereafter referred to as “food call”); and hoot series, long-distance calls given when individuals are separated ([68]; R. Salmi in revision). We measured the same acoustic parameters in all calls and determined both acoustic variability and individual distinctiveness. If the distance at which communication occurs has shaped the acoustic properties of western gorilla vocalizations, we would expect loud calls, such as long-distance calls, to have both higher acoustic variability and individual distinctiveness, and calls exchanged at close distance to show lower values. However, if the social context better explains the variation across call types, we would expect to find calls exchanged at close distance to display higher levels of acoustic variability and individual distinctiveness; food calls (for their indiscriminate target and soft nature) to exhibit intermediate values; and long-distance calls to display lower ones (see Table 1 for a schematic explanation of our predictions).
Figure 1
Spectrograms of female gorilla call types.
Call types include single grunt (SG), double grunt (DG), grumble (GR), threat grunt (TG), copulation grunt (CG), hum (HM), hoot series (HT), and scream (SC). The call type is equivalent to the call segment, except for TG, CG HT, in which case the call segment is indicated by a bracket.
Table 1
Hypotheses and predictions for acoustic variability and individual distinctiveness across western gorilla call types.
Call
Distance Hypothesis
Social Hypothesis
Predictions
Result
Hoot series
High
Low
High
Hums
Low
Intermediate
High*
Single grunt
Low
High
High
Double grunt
Low
High
High
Grumble
Low
High
High*
Copulation grunt
Low
High
High
Threat grunt
Low
High
High
Scream
Low
High
High*
The last column summarizes the results.
* Not generalizable since p-values generated by cross-validated permutated DFAs did not reach significance.
Spectrograms of female gorilla call types.
Call types include single grunt (SG), double grunt (DG), grumble (GR), threat grunt (TG), copulation grunt (CG), hum (HM), hoot series (HT), and scream (SC). The call type is equivalent to the call segment, except for TG, CG HT, in which case the call segment is indicated by a bracket.The last column summarizes the results.* Not generalizable since p-values generated by cross-validated permutated DFAs did not reach significance.
Methods
Ethic statement
This study, based on non-invasive research, followed standard guidelines to prevent disease transmission between observers and animals, and was in compliance with the legal requirements of the country in which it was conducted and the Stony Brook University Animal Care and Use Committee (IACUC numbers: 20071444 and 20091444, 20101444). Permission to conduct research at the site was received from the Ministere de la Économie de Forêt of Congo Republic.
Subjects and study area
Study subjects included six adult female wild western gorillas from a single group at the Mondika Research Center (02° 21′ 859˝N; 016° 16′ 465˝E), Republic of Congo (for study site description, see [70]). Three of the six females were present during the entire 16-month study period (May-June 2007, April 2009-May 2010), whereas three others were present for shorter periods, due to death and immigration during the study. (Table S1 in File S1).
Data collection
Data were collected during 2–4 hour focal follows (n = 331) during a total of 749 focal hours. These include an average of 80±6 follows and 174±15 focal hours for four females, and an additional 12 follows and 54 focal hours for a fifth female (Table S1 in File S1). During focal follows, we continuously collected digital recordings of all focal vocalizations, using a portable Marantz recorder (PMD671) and a Sennheiser MKH 416 short shotgun microphone (equivalent noise 13dB, minimal impedance 25Ω, sensitivity in free field, no load (1kHz) 25mV/Pa ± 1dB), handled with a shock mount and protected by a foam windshield (MZW415ANT). Recordings were taken at a sampling frequency of 48 kHz at a distance of 5 meters or less from the focal subject. An additional 62 rare vocalizations were also recorded opportunistically from a sixth female from a comparable distance.
Data analysis
Using the Avisoft SASLab Pro 5 software (R. Specht, Berlin, Germany), we generated spectrograms (1024-pt FFT length containing 256 samples; overlap 93%) after converting the sampling frequency of the calls depending upon call frequency range (22,050 Hz, 11,025 Hz, or 4000 Hz). Gorilla vocalizations are typically composed of one or more call segments, each of which may be given once or more than once (Figure 1). We use the call segment, rather than the entire vocalization, as the unit for acoustic analysis, following Bezerra et al. [71]. We analyzed all call segments within a sequence for all call types that consisted of repeated call segments of a similar type (i.e., threat grunts, copulation grunts and hoot series), because the segments within a sequence showed high variability. Only independent sequences (separated by at least 1 hr) were considered. Of the 2,360 calls recorded, we selected 681 for acoustic analysis based on the acoustic quality of the recording (i.e. background-noise ratio). We analyzed all eight female call types that make up the female vocal repertoire including the single, double, copulation and threat grunts, grumbles, hums, screams and hoot series. These calls varied in amplitude (soft and loud calls), typical exchange distance (short, medium and long range call distance), whether they were directed to broad or specific audiences, and the context in which they occurred (Table 2) [68]. Average call rates varied between both call types and individuals [68], and therefore the sample sizes available for analysis also vary across individuals and call types (see n in Table 3 for complete description of sample size of each female for each call type). Individuals were excluded from the analysis of a particular call type when sample size was less than four calls. To reduce potential bias and for comparative purposes we used the same number of subjects (typically 3) for the analysis of most call types, varying their identity as needed. All recordings used in this analysis were collected during the 2009–2010-study period, except for those recorded in 2007 for female 6, who died in 2008 (Table S1 in File S1).
Table 2
Description of female western gorilla call types, including amplitude (soft or loud), typical distance at which they are exchanged, whether they are given to a broad or specific audience and the contexts in which they are given.
Call type
Soft/Loud
Distance
Directionality
Context
Single grunt1
soft
close
broad
maintenance activities (forage, rest, travel)
Double grunt1
soft
close
broad
maintenance activities (forage, rest, travel)
Grumble1
soft
close
broad
maintenance activities (forage, rest, travel)
Copulation grunt1
soft
close
specific/?
sexual intercourse
Threat grunt1
soft
close
specific
within-group aggression
Scream1
loud
close
specific/?
within-group escalated aggression
Hum1
soft
intermediate?
broad
Forage
Hoot series1,2
loud
large
broad
when individuals are separated
: Salmi et al. 2013; 2: R. Salmi in revision.
Table 3
Individual distinctiveness in eight western gorilla call types.
Call type
ID
n
DFA
P1
C-DFA
P2
Chance
F
Wilk's λ
df
χ2
p-value
M
Significant parameters
Single grunt
52
75.0
0.001
75.0
0.004
1
0.36
2
49.57
<0.001
1
fp1mean
F1
16
62.5
62.5
30.0
F2
25
84.0
84.0
30.0
F4
11
72.7
72.7
30.0
Double grunt
38
86.8
0.004
84.2
0.006
2
0.33
4
38.69
<0.001
2
pfmin dfa1maloc
F1
26
84.6
84.6
30.0
F4
8
87.5
87.5
30.0
F6
4
100
75.0
30.0
Threat grunt
149
79.2
0.001
74.5
0.001
2
0.26
18
189.59
<0.001
5
dfa2mean diffreq pfmean ampratio duration
F1
62
82.3
80.6
30.0
F3
58
79.3
70.7
30.0
F5
29
72.4
69.0
30.0
Cop. grunt
59
86.4
0.001
83.1
0.001
2
0.21
4
88.04
<0.001
2
diffreq, pfmean
F1
18
77.8
77.8
30.0
F4
8
87.5
75.0
30.0
F5
33
90.9
87.9
30.0
Grumble
24
87.5
0.036
79.2
0.701*
2
0.13
6
40.28
<0.001
2
tonality fp1amean
F1
6
66.7
66.7
30.0
F2
12
91.7
75.0
30.0
F3
6
100
100
30.0
Hum
23
78.3
0.026
78.3
0.132*
2
0.23
4
28.30
<0.001
2
noise pfmax
F1
14
78.6
78.6
30.0
F4
4
50
50.0
30.0
F6
5
100
100
30.0
Hoot series
324
78.1
0.001
77.2
0.007
1
0.67
7
126.95
<0.001
2
dfa1maloc diffreq
F1
209
77.0
76.1
50.0
F2
115
79.1
79.1
50.0
Scream
12
100
0.04
83.3
0.255*
1
0.12
3
18.37
<0.001
1
fp1max
F1
7
100
85.7
50.0
F3
5
100
80.0
50.0
Summary of eight discriminant function analyses, including the identity of the females (F1-6), sample size (n) per female, the % of calls assigned correctly to individual based on discriminant functional analysis (DFA) or cross-validated DFA (C-DFA), relative p-values calculated from pDFAs interactions (P1 and P2), number of Functions (F), Wilk's λ, Chi square (χ2) and p-values, number of linear mixed models (M) and significant parameters after LMM and Hochberg correction.
* In cases with low number of calls a meaningful cross-validation was not possible.
: Salmi et al. 2013; 2: R. Salmi in revision.Summary of eight discriminant function analyses, including the identity of the females (F1-6), sample size (n) per female, the % of calls assigned correctly to individual based on discriminant functional analysis (DFA) or cross-validated DFA (C-DFA), relative p-values calculated from pDFAs interactions (P1 and P2), number of Functions (F), Wilk's λ, Chi square (χ2) and p-values, number of linear mixed models (M) and significant parameters after LMM and Hochberg correction.* In cases with low number of calls a meaningful cross-validation was not possible.We used the custom software program LMA 2011 [72] to calculate twenty acoustic parameters related to time, frequency, energy, relative amplitude characteristics, and tonality (Table 4) for each call segment. Because we used call segments, as the unit of analysis we could not measure the total call duration, the number of call segments in the call or the silent intervals between them, in contrast to some previous studies (e.g., [52]). We excluded measurements of frequency modulation because it was not a noticeable property in gorilla calls. We also excluded parameters that are greatly influenced by external factors such as start and end call frequency measurements [73].
Table 4
Acoustic parameters list. Number, type, name and description of parameters used in the analysis.
Parameters
Description
duration
Duration from beginning to the end of the call [ms]
dfa1mean
Mean value of the frequency at which the first quartile of global energy is reached across all time segments [Hz]
dfa1maloc
Location of the maximum frequency at which the first quartile of global energy is reached across all time segments
dfa2mean
Mean value of the frequency at which the second quartile of global energy is reached across all time segments [Hz]
dfa2maloc
Location of the maximum frequency at which the second quartile of global energy is reached across all time segments
df1max
Maximum value of the first frequency in the call which contains more energy than a particular thresholds in all time segments (DF) [Hz]
df1min
Minimum value of the first frequency in the call which contains more energy than a particular thresholds in all time segments (DF) [Hz]
df1mean
Mean value of the first frequency in the call which contains more energy than a particular thresholds in all time segments (DF) [Hz]
diffmean
Minimum difference between first and second dominant frequency bands (DF) [Hz]
diffreq
Mean number of dominant frequency bands (DF)
ampratio1
Amplitude ratio between first and second dominant frequency bands
fp1max
Max frequency first peak (global frequency peak) [Hz]
fp1mean
Mean frequency first peak [Hz]
fp1amean
Mean amplitude first peak [rel. amplitude]
ranmean
Mean frequency range [Hz]
pfmax
Maximum value in all time segments of the frequency with the highest energy [Hz]
pfmin
Minimum value in all time segments of the frequency with the highest energy [Hz]
pfmean
Mean value across all time segments of the frequency with the highest energy [Hz]
noise
Percentage of noisy time segments [%]
tonality
Percentage of tonal time segments [%]
Statistical analysis
Acoustic variability
We used the twenty call parameters as single acoustic characteristics. For each of them, across all call types, we calculated the inter-individual variation means (MEANinter calculated over the whole set of calls of a given type) and standard deviation (SDinter calculated over the whole set of calls of a given type), and the intra-individual means (MEANintra calculated by averaging individual means) and standard deviations (SDintra calculated by averaging the SD of every individual's set of calls and totaling it with the SD calculated over the individual means values). We then computed the coefficients of variation, both inter-individually (CVinter = 100x SDinter/MEANinter) and intra-individually (CVintra = mean of individual CV values; with for each individual, CV = 100x SD/MEAN) for each parameter. We test whether the inter- and intra-individual coefficients of variation differed among call types using the Friedman test. Separately, we computed for each call type inter- and intra-individually mean coefficients of variation (CVmean) averaged over all parameters studied (CVinter-mean and CVintra-mean).
Individual distinctiveness
We assessed whether call types were individually distinct using two methods. First, for comparative purposes, we considered individual distinctiveness measuring the potential of identity coding (PIC) following Robisson et al. [1], since this method was also used in some previous studies of primate communication [52], [53]. PIC is calculated as CVinter/CVintra. We consider PIC value of greater than 1 as signifying that the parameter is distinct among individuals and can thus be used for individual recognition, as in previous studies (e.g., [1], [52], [53], [56]). We tested whether calls differed in their PIC values using the Friedman test. For each call we also calculated the overall potential for individual identity coding (overall PIC = CVinter-mean/CVintra-mean) to assess whether some calls display higher values than others.The second method we used to test for individual distinctiveness in call types was a multilevel analysis of variance, following a two-step procedure [72]. We performed discriminant function analysis (DFA) [74] of the acoustic parameters using the stepwise method and then testing for internal validation of the classification with a cross-validation procedure using the leave-one-out method (U-method) [75]. The comparison of DFA and cross-validated DFA indicates whether the profiles are stable [76], especially in cases in which individual call samples are small (<10 calls). Plots of the distribution of discriminant scores were generated for those cases in which more than one discriminant function was used to distinguish among callers and/or more than two subjects were included in the analysis. We then used linear mixed modeling (LMM) [79] to test whether distinctiveness of call parameters (those which correlated >0.45 with the functions generated by DFA) was significant after controlling for unequal samples and after adjusting for multiple analyses using Hochberg's correction [80]. Additionally, in order to obtain the statistical significance (equivalent to p-value) of the mean effect size [56], [77], we ran permutated DFAs (fitting and validation) using a function written by Roger Mundry in R (version 3.0.1; R Core Team 2013), which is based on the function lda of the R package MASS [78]. This procedure, including 100 random selections and 1000 iterations, allows associating p-values to the observed correct classification rates of both the normal and cross-validated DFAs, given enough cases per individuals. To determine whether some call types were more individually distinct than others, we compared the cross-validated percentages of correct assignment (arcsine transformed) of each female caller for the eight call types and ran a LMM, considering call types as fixed factor and caller identity and number of calls/segments as random factors. All statistical tests have been executed using SPSS 20.0, if not otherwise indicated.
Results
Acoustic variability across female calls
The overall mean coefficients of inter- and intra-individual variation were greater than 40% for most call types (Figure 2; see Table S2 in File S1 for coefficients' values). Intra-individual variation differed significantly across call types (Friedman test on CVintra: χ2 = 15.57, df = 7, n = 20, p = 0.03; Table S2 in File S1), with CVintra ranging from 33% to 56% in food calls (HM) and threat grunts (TG) respectively (Figure 2). Inter-individual acoustic variation did not differ among calls (Friedman test on CVinter values: χ2 = 9.42; df = 7, n = 20, p = 0.22; Table S4 in File S1).
Figure 2
Inter- and intra-individual mean coefficient of variation (CV) for eight gorilla call types.
Call types include: hoot series (HT), hum (HM), grumble (GR), double grunt (DG), single grunt (SG), copulation grunt (CG), threat grunt (TG), and scream (SC).
Inter- and intra-individual mean coefficient of variation (CV) for eight gorilla call types.
Call types include: hoot series (HT), hum (HM), grumble (GR), double grunt (DG), single grunt (SG), copulation grunt (CG), threat grunt (TG), and scream (SC).
Individual distinctiveness across female calls
The overall potential for identity coding (PIC) values for all call types was >1, and thus all call types displayed individual vocal cues (Figure 3) There was relatively little variation in PIC values across call types (range = 1.07–1.23, n = 8 call types) and PIC values did not vary significantly among call types (Friedman test: χ2 = 7.73, df = 7, n = 20, p = 0.36; Table S3 in File S1). There is no indication that PIC values tended to vary based on call function since the three call types with equally low PIC values (hoot series, single grunts and threat grunts; Figure 3) differed considerably in amplitude (soft versus loud calls), distance at which they were exchanged (close or long range) and in whether they were given in varied or highly specific contexts (Table 2).
Figure 3
Overall Potential for Identity Coding (PIC) for each of eight call types.
The overall PIC (PIC) for each call type is calculated as mean coefficient of inter-individual variation divided by the mean coefficient of intra-individual variation, where each coefficient of variation is averaged over all parameters. Call types include: hoot series (HT), hum (HM), grumble (GR), double grunt (DG), single grunt (SG), copulation grunt (CG), threat grunt (TG), and scream (SC).
Overall Potential for Identity Coding (PIC) for each of eight call types.
The overall PIC (PIC) for each call type is calculated as mean coefficient of inter-individual variation divided by the mean coefficient of intra-individual variation, where each coefficient of variation is averaged over all parameters. Call types include: hoot series (HT), hum (HM), grumble (GR), double grunt (DG), single grunt (SG), copulation grunt (CG), threat grunt (TG), and scream (SC).Discriminant function analyses also distinguished among female callers for all eight call types; classification accuracy ranged from 75% in single grunts to 100% in screams (Table 3; cross-validated DFA = 75 to 84%). The percentage of calls assigned correctly to each subject ranged from 50 to 100% (cross-validated DFA 50 to 100%), which was, in each case, significantly greater than would be expected by chance (i.e. 33% when n = 3 females and 50% when n = 2 females). Plots were generated by the discriminant function analyses for five call types (Figure 4), and in each case females showed a degree of separation, with some gradation in acoustic properties. In each case the DFA and cross-validated DFA produced similar results, indicating that the derived profiles were stable (Table 3). Individual variation in call type can be observed in the spectrograms of the same call type given by different individuals (Figure 5). Using permutated discriminant function analyses (pDFAs) we obtained significant discrimination levels for each DFA (p-values (P1) ranging from 0.001 to 0.036; Table 3). The discrimination level for cross-validated p-DFAs (P2) was significant for all call types except those with very low numbers of calls (hum: 18 calls; grumble: 18 calls; scream: 10 calls) for which a meaningful permutated cross-validated DFA was not possible. Therefore the results for these three call types cannot be generalized. No call type, however, resulted to be more individually distinct than any other, i.e., cross-validated classification assignments did not differ significantly among the eight call types (LMM controlling for unequal sample size, caller identity and number of individual calls/segments per call type: F(7, 14) = 0.24; p = 0.97).
Figure 4
Plots of the two first discriminant functions for five female western gorilla call types based on the DFA of three females.
Each individual is indicated with a different symbol.
Figure 5
Spectrograms of three call types given by different female gorillas (F1, F2, F4, F6) showing the acoustic variation.
Plots of the two first discriminant functions for five female western gorilla call types based on the DFA of three females.
Each individual is indicated with a different symbol.
Important acoustic parameters
From PIC analysis
Based on the PIC analysis, most of the 20 acoustic parameters could be considered useful in distinguishing among callers since they had PIC values >1 (Table S3 in File S1). Five parameters displayed values above 1 for all call types (Table S3: mean frequency range (ranmean), the mean number of dominant frequency bands (diffreq), the mean and the minimum values of the frequency with the highest energy (pfmean and pfmin) and the percentage of noisy time segments (noise). However, the parameters with highest PIC values were not consistent across call types. For example, the minimum difference between first and second dominant frequency bands (diffmean) was very important (PIC>2.0) in distinguishing among individuals in copulation calls, but not in other calls (Table S3 in File S1).
From DFA
The discriminant function analysis identified fewer (an average of 2 and range of 1–5) acoustic parameters that were significant in distinguishing among individual callers (Table 3). The set of acoustic parameters that significantly distinguished among callers was different for each call type. For example single grunts were distinguished by the mean frequency of first peak (df1mean), hoots were distinguished by the location of the maximum frequency at which the first quartile of global energy is reached (dfa1maloc) and the mean number of dominant frequency bands (diffreq), and hums were distinguished by the amount of noise in the call and the maximum value of the frequency with the highest energy (pfmax). All parameters used by each DFA to discriminate among callers were still significant after controlling for unequal sample size (LMM) and correcting for multiple analyses (Hochberg's correction) (Table 3; for calls' characteristics see Table S4 in File S1).No single parameter was significant in all call types (Table 3). However, one parameter, the mean number of dominant frequency bands (diffreq), was used more frequently to distinguish among female callers than any other (i.e. in three of the eight call types, including hoot series, threat grunts and copulation calls). Two other parameters, the location of the global energy in the first quartile of the call (dfa1maloc) and the mean frequency with the highest energy (pfmean) distinguished callers in two call types (i.e., dfa1maloc: double grunt and hoot series; pfmean: threat and copulation grunt; Table 3). No other parameter was significant in more than one call.
Discussion
We found that all eight call types characterizing the female western gorilla carry important individual cues that could potentially be used by call recipients to distinguish among different callers. This was true regardless of the call amplitude (soft vs. loud) distance at which calls were exchanged (calls used over close and long range), the emotional state of the caller (i.e., calls given when agitated (i.e. cough grunt) versus undisturbed (i.e. grunts)), and the context or lack thereof of calls (i.e. in context-specific calls such as copulation grunts and hoot series versus those given in many contexts such as single and double grunts). Although the PIC analyses, the discriminant function analyses and the normal permuted discriminant function analyses showed that all call types used by western gorilla females were individually distinct, the permutated cross-validated DFAs failed to confirm it for three call types (the hum, the grumble and the scream). Not surprisingly, these calls were those for which we had very low number of calls per individual, which might have strongly lowered the power for a cross-validated method. We find no support in western gorillas for either of the two hypotheses proposed to explain the adaptive function of individual distinctiveness, namely that there is a greater selective pressure for calls to be more individually distinct if they are exchanged over long distances (i.e., the distance communication hypothesis [59]), or used at close range for social communication (i.e., the social communication hypothesis, [60], [61]), in contrast to previous studies in other primate species [12], [52], [53].This discrepancy may stem, in part, from the more graded nature of apes' vocal systems relative to those of other primates [81]. Graded calls typically exhibit more variability in their acoustic structure than do the more discrete or stereotypic calls typically studied in other primate species ([59], [61], but see [82], [83]). Therefore it was not surprising that the average level of intra-individual variation in each call type was generally higher in western gorillas (33–56%) compared to that of monkey species (red-capped monkeys, Cercocebus torquatus: 11–31%, [52]; Campbell monkeys, Cercopithecus campbelli: 20–50% [53]). Inter-individual variation was instead intermediate between those of the other two species (red-capped monkeys, Cercocebus torquatus: 20–50% [52]; western gorillas, Gorilla gorilla: 40 to 60% (this study); Campbell monkeys, Cercopithecus campbelli: 45–227% [53]), in which alarm calls were consistently less distinct than other calls. It is possible that the absence of predator alarm calls in western gorillas contributes to the relative similarity of individual distinctiveness since these calls are typically more stereotypic and less variable across individuals than other calls [52], [53].Although individual acoustic variability and distinctiveness and the potential ability to distinguish among callers may have evolved in many taxa (e.g., birds: [1]–[3]; bats: [4]–[6]; ungulates: [7], [8]; carnivores: [9]–[11]; primates: [12]–[16]), this capacity may be particularly beneficial to primates, because in the majority of primate genera individuals live long lives (i.e., they have slow life histories, [84]), in the company of other group members, with whom they develop long-term relationships. The ability to recognize all types of vocalizations of others would be beneficial because it could potentially allow an individual to monitor the interactions of others while engaged in other maintenance activities (e.g., [85]–[89]), avoid the unseen approaches of higher-ranking individuals, and allow the coordination of group activity and spatial proximity (e.g., [90], [91]) when group members are spatially separated [92], [93]. In western gorillas it might be especially important for silverback males to be able to distinguish among the calls of different female callers, since males are actively involved in protecting group members from predation, harassment, and infanticide [94], [95], serve as peacekeeper in female conflicts [95], [96] and play an important role in group travel and coordination [97]. Therefore, since all female gorilla calls can serve some sort of social function, independently of the distance at which they are exchanged, our results seem to support the social-vocal coevolution of communicative abilities [98]. Additionally, although western gorilla groups are relatively small [99]–[101], individuals are frequently visually separated from each other (R. Salmi in revision), and thus individually distinct calls may be particularly important in keeping individuals in contact with each other, as the “vocal grooming hypothesis” proposed previously [61], [102].It has also been suggested that individual distinctiveness might simply be the result of chance, emerging from idiosyncrasies and differences in the development of the vocal tract of each individual [50], [103]. However, this explanation has not gained much traction, because it appears too simplistic and unable to account for the level of individual differences found in vocal signals of primates [104], [105].To date, it continues to remain unclear the exact role that individual acoustic variability plays in animal communication. An important step toward a better understanding of the evolution and the adaptive function of individual vocal cues in animal communication is testing whether these differences are meaningful and used by listeners to distinguish callers. That calls are individually distinct, in fact, does not mean that individuals can recognize the calls of different individuals, and playback studies are needed to this end (e.g., [31], [46], [106]–[108]).One question raised by this study is whether there is in fact any biological relevance to finding that one call type is more individually distinct than another, if all call types are distinct. In addition, evidence in primate literature suggests that acoustic recognition might not follow the same pattern of call distinctiveness. For example, Kojima et al. [109] demonstrated that although chimpanzee long-distance calls (pant hoots) were more individually distinct than close calls (pant grunts) [55], a female chimpanzee easily distinguished between different callers for both calls, and recognized caller identities even when calls of more than one individual were played simultaneously. This suggests that acoustic recognition of caller identity is dependent upon some individual distinctiveness, but does not necessarily improve with the relative degree of distinctiveness.Different acoustic parameters (or set of parameters) were responsible for generating individually distinctive voice cues in the different call types. For example, the significant acoustic parameters for distinguishing among individual single grunts did not overlap with those important for double grunts (Table 4). We did not find that the parameter “noise” was significant in distinguishing individuals in many call types, or that “noisy” calls (such as grunts) were less individually distinct than harmonic calls (such as hoot series), as was suggested previously [12]. There was, however, one parameter, the mean number of dominant frequency bands (i.e., diffreq) that contributed to distinguishing three call types (Table 4). This parameter may be important because might reflect differences in individual vocal tract configuration during the vocalization. It has been suggested that this parameter could potentially overestimate individual variation in calls as a result of sampling bias when calls are recorded at different distances to the microphone [72]. However, that is unlikely in this study since western gorillas are terrestrial, most female calls are of low amplitude and therefore calls used in this analysis were all recorded at close sampling distances, typically less than 5 meters.An interesting question that emerges from the analysis of call individual distinctiveness across an entire species' repertoire is whether the same individual signatures are present across call types. Although our study was not designed to address this specific question, the fact that different sets of parameters were important in distinguishing among callers in each call type does not exclude this possibility. In fact, vocal tract, age, body size, and other anatomical (or social) individual differences are likely to affect similarly most utterances (or at least those with similar acoustic structure). For example, from our results, several acoustic characteristics (e.g., dfa1 mean, df1max, fp1mean, pfmean) of a specific female (F1) have consistently lower values than those of another female (F4) across most call types (Table S4 in File S1). However, since subjects' identity was not maintained constant across call types, future studies with a proper design should test this possibility in more details.An additional factor that could contribute to differences between our results and previous work is our use of the call segment rather than entire call as the unit of acoustic analysis. The use of call segments for analysis can reduce the effect of emotional state on call acoustic variability, since higher arousal are generally associated with an increase in the length of the entire call and the number of repetitions of call segments within each call, and a decrease in duration of silent time between call segments (e.g., [22], [24]). However its use is likely to have underestimated the degree of individual variation and distinctiveness relative to previous studies, particularly in calls of long duration, such as hoot series, threat and copulation grunts, because it fails to account for differences in the temporal parameters of the entire calls, which have been found to differ significantly among individuals in other species (e.g., [53]). Nonetheless, our method did not biased our results towards any specific hypothesis, since most of the gorilla call types are singularly uttered, and those that are not include call types that are exchanged at close and long distances.The greatest limitation of the current study is the small sample size, both in the number of individuals and calls per subject. Western gorillas have been notoriously difficult to habituate [110] and as a result few groups are available for study. Additionally, their small average group size (one male, four females and their offspring; [99]–[101]) further limit the availability of subjects within groups. Furthermore, there is a pronounced rank effect on female calling rates which further limits sample sizes of low-ranking females ([68], R. Salmi in revision). Small sample size restricted our ability to control for factors that might contribute to individual distinctiveness, such as, age and arousal level of the caller (which may affect call frequency characteristics; e.g., [111], [112]) or whether the same acoustic parameters would also remain significant in distinguishing callers, if other individuals were sampled. If sample size was larger we might expect some minor differences between different groups, since it has been shown that groups exhibit some subtle acoustic differences [43], [44]. However, overall we would expect similar findings due to the limited structural variation within a vocal type, although we would also expect a greater number of parameters to be useful in distinguishing individuals in larger groups, as shown in previous studies (e.g., [113], [114]). However, in spite of these limitations our study showed high levels of acoustic variation across the entire vocal repertoire of female western gorillas, with all call types being individually distinct. Just as individual voices in humans provide the ability to distinguish and recognize the identity of speakers (reviewed in [115]), even before the development of speech perception [116], [117], it is likely that the same may be true in western gorillas, as well as other non-human primates. We suggest that the study of caller recognition is critical to elucidate the adaptive function of individual vocal cues, and playback experiments can contribute to this goal by revealing whether listeners use these acoustic differences to recognize caller identity and investigating what parameters are important for this end.Table S1. Summary of female data, including age, time spent in the group, number of follows, total sampling hours and number of calls. Table S2. Coefficient of intra- (a) and inter-variation (b) of each acoustic parameter and overall values for each call type. Table S3. PIC values of 20 acoustic parameters [CVinter/CVintra] and overall values [CVmean-inter/CVmean-intra] for each call type. Table S4. Acoustic characteristics (mean ± SD) of call types [scream (SC), threat grunt (TG), copulation grunt (CG), single grunt (SG), double grunt (DG), grumble (GR), hum (HM), and hoot series (HT)]. In bold are indicated those parameters used by the DFAs to discriminate among females.(DOCX)Click here for additional data file.
Authors: Nicolas Mathevon; Aaron Koralek; Mary Weldele; Stephen E Glickman; Frédéric E Theunissen Journal: BMC Ecol Date: 2010-03-30 Impact factor: 2.964
Authors: Stéphanie Mercier; Eloïse C Déaux; Erica van de Waal; Axelle E J Bono; Klaus Zuberbühler Journal: PLoS One Date: 2019-05-01 Impact factor: 3.240