| Literature DB >> 25077021 |
Georg Fritsch1, Alexander Bruckner1.
Abstract
Software-aided identification facilitates the handling of large sets of bat call recordings, which is particularly useful in extensive acoustic surveys with several collaborators. Species lists are generated by "objective" automated classification. Subsequent validation consists of removing any species not believed to be present. So far, very little is known about the identification bias introduced by individual validation of operators with varying degrees of experience. Effects on the quality of the resulting data may be considerable, especially for bat species that are difficult to identify acoustically. Using the batcorder system as an example, we compared validation results from 21 volunteer operators with 1-26 years of experience of working on bats. All of them validated identical recordings of bats from eastern Austria. The final outcomes were individual validated lists of plausible species. A questionnaire was used to enquire about individual experience and validation procedures. In the course of species validation, the operators reduced the software's estimate of species richness. The most experienced operators accepted the smallest percentage of species from the software's output and validated conservatively with low interoperator variability. Operators with intermediate experience accepted the largest percentage, with larger variability. Sixty-six percent of the operators, mainly with intermediate and low levels of experience, reintroduced species to their validated lists which had been identified by the automated classification, but were finally excluded from the unvalidated lists. These were, in many cases, rare and infrequently recorded species. The average dissimilarity of the validated species lists dropped with increasing numbers of recordings, tending toward a level of ˜20%. Our results suggest that the operators succeeded in removing false positives and that they detected species that had been wrongly excluded during automated classification. Thus, manual validation of the software's unvalidated output is indispensable for reasonable results. However, although application seems easy, software-aided bat call identification requires an advanced level of operator experience. Identification bias during validation is a major issue, particularly in studies with more than one participant. Measures should be taken to standardize the validation process and harmonize the results of different operators.Entities:
Keywords: Acoustic identification; automated classification; batcorder; identification bias; observer experience
Year: 2014 PMID: 25077021 PMCID: PMC4113294 DOI: 10.1002/ece3.1122
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Principal component analysis of three working experience traits in a study on operator bias in bat call identification. Experience traits: (i) number of years working on bats; (ii) experience in analyzing bat calls, regardless of method used; and (iii) experience with the batcorder system before this study. Operators are labeled O1–O21. The gradient of experience ranges along PC1 from least experienced (left) to most experienced (right).
Figure 2Questionnaire ratings of supporting resources used in the individual validation procedure of 21 operators in a study on operator bias in bat call analysis. Ratings were from r1 (light blue, not important) to r5 (dark blue, very important), “na”: no answer given. Three sets of questions dealt with: supplements provided for site characterization (9a–h), functions of the identification software bcAdmin and batIdent (10a–j), and other resources used by the operators (11a–g).
Figure 3Relationship between the number of recorded call sequences per study site and (1) the number of unvalidated species from bat call identification software (diamonds), (2) the number of validated species produced by 21 operators (circles), in a study on operator bias in bat call identification. Each vertical strip represents a site (S1 to S12). Note the logarithmic scale of the abscissa. On two occasions, the same number of call sequences were recorded in two sites (S6/S10, S2/S5; 11 and 21 sequences).
Figure 4Variability in validation of species lists by 21 operators in a study on operator bias in bat call identification. Each vertical strip represents an operator (O1 to O21), each circle one of 12 sites. Operators were ranked along the abscissa from left (least experienced) to right (most experienced), using principal component analysis scores for a synthetic experience variable (Fig. 1).
Identification of species in a study on operator bias in bat call analysis. Numbers of sites (of a maximum of 12) in which each species was identified by automated identification software (batIdent: species unvalidated), and numbers of sites in which identification of each species was accepted by 21 operators (O1–O21: species validated). Parenthesized bold font: Number of times an operator listed a species which was not included in batIdent's overall classifications for whole call sequences, but which had been identified from single calls (“additional species”). Operators are ranked by experience (scores on PC1, Fig. 1) from least (left) to most experienced (right)
| batIdent | O21 | O17 | O5 | O2 | O3 | O19 | O16 | O18 | O14 | O8 | O6 | O13 | O12 | O20 | O11 | O10 | O7 | O15 | O9 | O1 | O4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ppip | 10 | 9 | 8 | 9 | 9 | 10 | 10 | 9 | 10 | 9 | 10 | 10 | 10 | 10 | 11 ( | 10 | 10 | 10 | 10 | 10 | 10 | 9 |
| Bbar | 5 | 3 | 4 | 2 | 4 | 4 | 4 | 3 | 4 | 5 | 4 | 4 | 4 | 5 | 4 | 4 | 4 | 4 | 3 | 4 | 4 | 3 |
| Nnoc | 9 | 6 | 7 | 4 | 6 | 8 | 8 | 6 | 8 | 3 | 8 | 8 | 9 | 9 | 7 | 7 | 9 ( | 5 | 7 | 8 | 7 | 7 |
| Ppyg | 3 | 2 | 2 | 2 | 3 | 3 | 3 | 2 | 3 | 3 | 3 | 3 | 3 | 3 | 4 ( | 3 | 2 | 2 | 3 | 3 | 1 | 2 |
| Mdau | 9 | 1 | 1 | 6 | 6 | 9 ( | 6 | 7 | 9 | 8 | 8 | 6 | 9 | 9 | 6 | 8 | 8 | 5 | 4 | 2 | 4 | 9 |
| Mbart | 10 | 3 | 3 | 1 | 9 | 10 | 5 | 8 | 10 | 3 | 6 | 9 | 10 | 9 | 7 | 8 | 11 ( | 6 | 8 | 2 | 8 | |
| Pmid | 6 | 3 ( | 2 | 5 ( | 1 | 7 ( | 4 ( | 8 ( | 6 ( | 7 ( | 5 ( | 6 ( | 8 ( | 6 ( | 5 ( | 7 ( | 4 ( | 7 ( | ||||
| Mnat | 3 | 2 | 3 | 2 | 2 | 4 ( | 1 | 3 | 3 | 1 | 1 | 3 ( | 1 | 2 | 2 | |||||||
| Hsav | 3 | 2 ( | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 ( | 1 | 1 | ||||||||||
| Eser | 3 | 3 ( | 5 ( | 5 ( | 2 ( | 4 ( | 3 ( | 2 | 2 | 3 | 3 | 6 ( | 2 | 2 | 2 | |||||||
| Mbec | 7 | 1 | 1 | 3 | 4 | 3 | 2 | 5 | 1 | 1 | 2 | 4 | 1 | 3 | 3 | 1 | ||||||
| Enil | 2 | 1 | 2 ( | 1 | 4 ( | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 2 ( | 1 | 1 | 1 | ||||
| Mema | 4 | 1 | 3 | 6 ( | 4 ( | 2 | 3 | 2 | 4 | 2 | 3 | 3 | 1 | 2 | 7 ( | 1 | 2 | |||||
| Mmyo | 2 | 1 | 1 | 1 | 1 | 1 | 3 ( | 1 | 2 ( | 1 | 2 ( | 3 ( | 3 ( | 2 ( | 1 | |||||||
| Malc | 3 | 1 | 2 | 1 | 1 | 2 | 1 | |||||||||||||||
| Plec | 0 | ( | ( | ( | ( | ( | ( | |||||||||||||||
| Pkuh | 5 | 4 | 5 | 1 | 2 | 5 | ||||||||||||||||
| Pnat | 3 | 1 | 3 ( | 2 ( | 3 | 5 ( | 5 ( | 1 | ||||||||||||||
| Nlei | 2 | 1 | 1 | 2 | ( | ( | ||||||||||||||||
| Vmur | 0 | ( | ( | |||||||||||||||||||
| Mdas | 1 | 1 |
Ppip, Pipistrellus pipistrellus; Bbar, Barbastella barbastellus; Nnoc, Nyctalus noctula; Ppyg, P. pygmaeus; Mdau, M. daubentonii; Mbart, M. mystacinus/brandtii; Pmid, P. kuhlii/nathusii; Mnat, M. nattereri; Hsav, Hypsugo savii; Eser, E. serotinus; Mbec, M. bechsteinii; Enil, Eptesicus nilssonii; Mema, M. emarginatus; Mmyo, Myotis myotis; Malc, M. alcathoe; Plec, Plecotus spp.; Pkuh, P. kuhlii; Pnat, P. nathusii; Nlei, N. leisleri; Vmur, Vespertilio murinus; and Mdas, M. dasycneme.
Figure 5Nonmetric multidimensional scaling (NMDS, stress 0.19) of species lists for 12 study sites produced by 21 operators in a study on operator bias in bat call analysis. Each dot represents a validated species list produced by an operator. Not all dots are visible, due to manifold overplotting.
PERMDISP analysis as a measure of the average variance of assemblages (% dissimilarity) for each site in a study on operator biases in bat call identification. Medians are calculated from the operators' individual PERMDISP values. The sites (S1–S12) are ranked by ascending number of call sequences (Fig 3). “batIdent” gives the unvalidated number of species from automated classification
| Sites | S6 | S10 | S2 | S5 | S4 | S8 | S1 | S3 | S9 | S7 | S11 | S12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Dissimilarity (median, %) | 1.4 | 29.0 | 30.2 | 27.0 | 19.7 | 22.4 | 21.7 | 21.1 | 16.0 | 18.0 | 19.0 | 18.1 |
| batIdent (number spec.) | 2 | 7 | 6 | 5 | 5 | 9 | 10 | 7 | 10 | 7 | 14 | 12 |
| Call sequences | 11 | 11 | 21 | 21 | 27 | 62 | 85 | 93 | 164 | 197 | 1338 | 1429 |
Figure 6Percent dissimilarity (PERMDISP values) of the validated species lists for 12 sites produced by 21 operators (circles) in relation to the number of call sequences recorded in a study on operator bias in bat call identification. Each vertical strip represents a site (S1 to S12). Note the logarithmic scale of the abscissa. On two occasions, the same number of call sequences were recorded in two sites (S6/S10, S2/S5; 11 and 21 sequences).