| Literature DB >> 36034316 |
Alexandra B Bosshard1,2, Maël Leroux1,2, Nicholas A Lester1,2, Balthasar Bickel1,2, Sabine Stoll1,2, Simon W Townsend1,2,3.
Abstract
Abstract: Emerging data in a range of non-human animal species have highlighted a latent ability to combine certain pre-existing calls together into larger structures. Currently, however, the quantification of context-specific call combinations has received less attention. This is problematic because animal calls can co-occur with one another simply through chance alone. One common approach applied in language sciences to identify recurrent word combinations is collocation analysis. Through comparing the co-occurrence of two words with how each word combines with other words within a corpus, collocation analysis can highlight above chance, two-word combinations. Here, we demonstrate how this approach can also be applied to non-human animal signal sequences by implementing it on artificially generated data sets of call combinations. We argue collocation analysis represents a promising tool for identifying non-random, communicatively relevant call combinations and, more generally, signal sequences, in animals. Significance statement: Assessing the propensity for animals to combine calls provides important comparative insights into the complexity of animal vocal systems and the selective pressures such systems have been exposed to. Currently, however, the objective quantification of context-specific call combinations has received less attention. Here we introduce an approach commonly applied in corpus linguistics, namely collocation analysis, and show how this method can be put to use for identifying call combinations more systematically. Through implementing the same objective method, so-called call-ocations, we hope researchers will be able to make more meaningful comparisons regarding animal signal sequencing abilities both within and across systems. Supplementary Information: The online version contains supplementary material available at 10.1007/s00265-022-03224-3.Entities:
Keywords: Call combinations; Collocation analysis; Comparative approach; Non-random structure; Syntax
Year: 2022 PMID: 36034316 PMCID: PMC9395491 DOI: 10.1007/s00265-022-03224-3
Source DB: PubMed Journal: Behav Ecol Sociobiol ISSN: 0340-5443 Impact factor: 2.944
Distribution of identified bigrams occurring in the four artificial vocal repertoires split along the two variables i) size of data set and ii) extent of recombination. The last two rows show the number of combinations that are described in the data set (combinations) and, therefore, how many calls a data set comprises (data set size). See appendix I for detailed distribution. The values are larger for the recombinational data sets as in these data sets all call types recombine with other call types outside of the three call combinations of interest (for details see appendix I)
| 4 | 4 | 40 | 40 | |
| 5 | 5 | 50 | 50 | |
| 7 | 7 | 70 | 70 | |
| 16 | 49 | 160 | 490 | |
| 32 calls | 98 calls | 320 calls | 980 calls |
Example of a Multiple Distinctive Collocation Analysis output for the bigrams of one of the four artificial vocal data sets (large-recombinations). Columns and rows show the first and second unit within a call combination, respectively. Values are pbins and can be translated to p estimation values (abs(pbin) > 3: P < 0.001, > 2: P < 0.01, > 1.3: P < 0.05). Relevant combinations are coloured in green
Multiple Distinctive Collocation values for the bigrams in the four artificial vocal data sets. SE (small-exclusive), SR (small-recombinations), LE (large-exclusive), and LR (large-recombinations). Values are pbins and can be translated to p estimation values (abs(pbin) > 3: P < 0.001, > 2: P < 0.01, > 1.3: P < 0.05)
| 2.4 | 1.2 | 24.1 | 7.9 | |
| 2.5 | 1.4 | 25.3 | 10.3 | |
| 2.5 | 1.9 | 25.1 | 14.5 | |
Example of a Mutual Information Collocation Analysis output for the bigrams of one of the four artificial vocal data sets (large-recombinations). Columns and rows show the first and second unit within a call combination respectively. Values are pbins and can be translated to p estimation values (abs(pbin) > 3: P < 0.001, > 2: P < 0.01, > 1.3: P < 0.05). Relevant combinations are coloured in green
Mutual Information Collocation values for the bigrams in the four artificial vocal data sets. SE (small-exclusive), SR (small-recombinations), LE (large-exclusive), LR (large-recombinations). Values are pbins and can be translated to p estimation values (abs(pbin) > 3: P < 0.001, > 2: P < 0.01, > 1.3: P < 0.05). As here, MICA does not control for specific ordering of calls in a structure, the bigrams Howl-Peep and Peep-Howl are considered to be the same combination, rendering only one entry, namely Peep-Howl, that incorporates both bigrams
| 3 | 2.3 | 3 | 2.3 | |
| 1.4 | 1.3 | 1.4 | 1.3 | |
Comparison of Multiple Distinctive Collocation Analysis and Mutual Information Collocation Analysis. Arrows represent if the collocation values get lower or higher due to a characteristic of the data set (small data set) or a call combination (recombination, low frequency, linearisation). The checkmark indicates that MDCA identifies ordering patterns, while MICA does not. NA designates that the size of the data set has no effect on MICA
| ↓ | ↓ | ↓ | |||
| NA | ↓ | ↑ | X | ||