| Literature DB >> 35413068 |
Morgan A Ziegenhorn1, Kaitlin E Frasier1, John A Hildebrand1, Erin M Oleson2, Robin W Baird3, Sean M Wiggins1, Simone Baumann-Pickering1.
Abstract
Passive acoustic monitoring (PAM) has proven a powerful tool for the study of marine mammals, allowing for documentation of biologically relevant factors such as movement patterns or animal behaviors while remaining largely non-invasive and cost effective. From 2008-2019, a set of PAM recordings covering the frequency band of most toothed whale (odontocete) echolocation clicks were collected at sites off the islands of Hawai'i, Kaua'i, and Pearl and Hermes Reef. However, due to the size of this dataset and the complexity of species-level acoustic classification, multi-year, multi-species analyses had not yet been completed. This study shows how a machine learning toolkit can effectively mitigate this problem by detecting and classifying echolocation clicks using a combination of unsupervised clustering methods and human-mediated analyses. Using these methods, it was possible to distill ten unique echolocation click 'types' attributable to regional odontocetes at the genus or species level. In one case, auxiliary sightings and recordings were used to attribute a new click type to the rough-toothed dolphin, Steno bredanensis. Types defined by clustering were then used as input classes in a neural-network based classifier, which was trained, tested, and evaluated on 5-minute binned data segments. Network precision was variable, with lower precision occurring most notably for false killer whales, Pseudorca crassidens, across all sites (35-76%). However, accuracy and recall were high (>96% and >75%, respectively) in all cases except for one type of short-finned pilot whale, Globicephala macrorhynchus, call class at Kaua'i and Pearl and Hermes Reef (recall >66%). These results emphasize the utility of machine learning in analysis of large PAM datasets. The classifier and timeseries developed here will facilitate further analyses of spatiotemporal patterns of included toothed whales. Broader application of these methods may improve the efficiency of global multi-species PAM data processing for echolocation clicks, which is needed as these datasets continue to grow.Entities:
Mesh:
Year: 2022 PMID: 35413068 PMCID: PMC9004765 DOI: 10.1371/journal.pone.0266424
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map of recording locations.
Map showing the latitude-longitude locations of the Kona, Kauaʻi, and PHR sites. Location and depth of each site was averaged among deployments from that site. Basemap image is the intellectual property of Esri and is used herein with permission. Copyright © 2022 Esri and its licensors. All rights reserved.
Quantitative click type descriptions.
| Neural Network Class, Number of Clicks | Spectral Peak 1 (kHz) | Spectral Peak 2 (kHz) | Spectral Peak 3 (kHz) | Modal ICI (milliseconds) | |||
|---|---|---|---|---|---|---|---|
| Peak frequency | -3 dB bandwidth | Peak frequency | -3 dB bandwidth | Peak frequency | -3 dB bandwidth | ||
| False killer whale, n = 4000 | 16.5 (13.0–20) | 6.5 (1.0–12.0) | -- | -- | -- | -- | 28.4 (+/- 28.0) |
| Low-frequency 1, n = 4000 | 22.0 (20–25) | 5.5 (1.5–19.5) | -- | -- | -- | -- | 169 (+/- 132) |
| Short-finned pilot whale 1, n = 7000 | 13.0 (12.0–13.5) | 1.5 (1.0–6.5) | 28.0 (26.0–31.0) | 5 (2.0–10.0) | -- | -- | 184 (+/- 66.9) |
| Short-finned pilot whale 2, n = 40000 | 13.0 (12.5–14.0) | 1.5 (1.0–2.0) | 18.5 (16.5–20.5) | 3 (1.5–9.5) | 48.5 (36.0–40.5) | 3.0 (2.0–8.5) | 206 (+/- 56.0) |
| Tt/Pe, n = 20000 | 12.5 (11.5–13.5) | 1.5 (1.0–3.0) | 32.5 (30.0–35.5) | 5.5 (2.5–12.0) | -- | -- | 109 (+/- 109) |
| Blainville’s beaked whale, n = 25000 | 24.0 (23.0–25.5) | 2.5 (1.5–4.5) | 36.0 (32.0–41.5) | 9.0 (4.5–15.0) | -- | -- | 319 (+/- 109) |
| Cuvier’s beaked whale, n = 2500 | 17.0 (16.0–18.5) | 2.5 (2.0–3.5) | 24.0 (22.0–25.5) | 4.0 (2.0–9.0) | 40.0 (37.0–44.0) | 6.5 (3.0–12.5) | 433 (+/- 59.0) |
| Stenellid 1, n = 100000 | 18.5 (16.5–20.5) | 4.25 (3.0–9.75) | 50.0 (45.0–54.0) | 9.0 (3.5–18.5) | -- | -- | 48.5 (+/- 43.5) |
| Stenellid 2, n = 90000 | 25.0 (22.5–27.0) | 4.5 (3.0–6.5) | 39.5 (35.0–44.5) | 8.5 (4.0–18.0) | -- | -- | 53.5 (+/- 0.0401) |
| 93.5 (87.5–99.5) | 10.0 (5.0–17.5) | -- | -- | -- | -- | 90.3 (+/- 41.8) | |
Parameters of click types (i.e. neural network classes) including median location of spectral peaks for all evaluated clicks as well as peak value for ICI distributions of evaluated acoustic encounters of each class. Species are organized based on overall frequency content of clicks (lowest to highest) and then by number of spectral peaks. Number of clicks evaluated is given along with class name. For spectral peaks, values in parentheses give the 10th and 90th percentiles of the data. For ICI, standard deviation from the peak (i.e. modal) value is given instead. For the false killer whale class, the ICI distribution was bimodal; in this case, two ICI values are given instead of one. The bottlenose dolphin/melon-headed whale type is abbreviated as Tt/Pe.
Validation types.
| Echolocation Click Type | Validation Type [References] |
|---|---|
| False killer whale | previous acoustic [ |
| Low-frequency 1 | previous acoustic (limited) [ |
| Short-finned pilot whale 1 | previous acoustic [ |
| Short-finned pilot whale 2 | previous acoustic [ |
| Tt/Pe | previous acoustic [ |
| Blainville’s beaked whale | previous acoustic [ |
| Cuvier’s beaked whale | previous acoustic [ |
| Stenellid 1 | previous acoustic [ |
| Stenellid 2 | previous acoustic [ |
| previous acoustic [ |
Validation sources (with references) for each echolocation click type. Validation types are previous acoustic, spatial distribution (including abundance information), temporal behavior, and auxiliary sighting/acoustic data. The Tt/Pe abbreviation corresponds to the bottlenose dolphin/melon-headed whale type.
* Validation of this type as likely rough-toothed dolphin is included in the main manuscript as this represented a novel type description. Further detail regarding all other types is available in S1 File.
Fig 2Echolocation click types.
Plots A-H depicting data from representative clicks from each of 10 final click types: (A) False killer whale, (B) Low-frequency type 1 (LF1), (C1) Short-finned pilot whale 1, (C2) Short-finned pilot whale 2, (D) Bottlenose dolphin/ melon-headed whale, (E) Blainville’s beaked whale, (F) Cuvier’s beaked whale, (G1) Stenellid 1, (G2) Stenellid 2, and (H) Kogia spp. Panels 1–4 (left to right) depict the following: (1) mean spectra, shown along with 10th and 90th percentile values, (2) modal inter-click interval distribution, (3) concatenated click spectra of all clicks included, and (4) click waveform envelope for all clicks. Click waveform envelope has been sorted by peak amplitude (highest to the left), and concatenated clicks have been sorted correspondingly. Types are ordered by peak frequency.
Fig 3Towed-array S. bredanensis encounters.
Figure depicting (a) mean spectra, (b) concatenated click spectra, and (c) an example long-term spectral average of a towed-array acoustic encounter of verified rough-toothed dolphins. Panel (a) includes the mean type spectra of the LF1 click type for comparison. Delineations in panel (b) (white lines) separate clicks coming from encounters 1–4. Panel c shows a long-term spectral average of raw data from an example encounter coming from the towed array dataset.
Fig 4Additional towed-array examples.
Long-term spectral average of two different hours of towed-array data, displaying the persistence of a notch in sensitivity at ~ 50 kHz regardless of species present. Panel (a) displays sound data from anthropogenic sources, while (b) displays both anthropogenic noise and a delphinid encounter (starting at ~ 0.5 hours).
Fig 5Long-term spectral average of an LF1 encounter.
Long-term spectral average of data from 11/13/2015 including clicks labelled as type LF1. A sighting of rough-toothed dolphins occurred about 1 hour after this encounter, approximately 5 km from the location of the HARP.
Neural network results.
| Neural Network Class | Kona | Kauaʻi | PHR | All Sites | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Accuracy (%) | Recall (%) | Precision (%) | nBins | Accuracy (%) | Recall (%) | Precision (%) | nBins | Accuracy (%) | Recall (%) | Precision (%) | nBins | Accuracy (%) | Recall (%) | Precision (%) | nBins, nClicks | |
| False killer whale | 96.2 | 88.0 |
| 324 | 98.1 | 93.4 | 75.9 | 259 | 96.6 | 96.5 |
| 86 | 96.9 | 91.2 |
| 669, 7938 |
| Rough-toothed dolphin | 98.4 | 83.1 | 83.8 | 468 | 97.3 | 96.7 | 98.9 | 4602 | 98.1 | 94.1 | 94.7 | 801 | 98 | 95.3 | 97.1 | 5871, 7293 |
| Short-finned pilot whale 1 | 98.1 | 89.6 | 89.0 | 988 | 97.8 | 92.2 | 79.1 | 230 | 98.4 | 80.6 |
| 72 | 98.1 | 89.5 | 84.9 | 1290, 13271 |
| Short-finned pilot whale 2 | 98.2 | 96.3 | 81.1 | 463 | 98.7 | 66.7 | 37.5 | 36 | 99.2 | 72.7 | 25.8 | 11 | 98.5 | 93.7 |
| 510, 95262 |
| Tt/Pe | 98.5 | 75.6 |
| 213 | 98.4 | 94.8 | 80.5 | 192 | 98.2 | 82.7 | 91.0 | 307 | 98.4 | 83.8 | 82.9 | 712, 3291 |
| Blainville’s beaked whale | 99.4 | 96.0 | 90.8 | 227 | 99.8 | 98.9 | 98.9 | 91 | 99.5 | 98.9 | 99.8 | 2008 | 99.5 | 98.6 | 98.9 | 2326, 11713 |
| Cuvier’s beaked whale | 99.7 | 83.3 | 45.5 | 6 | 99.8 | N/A | 0 | 0 | 99.7 | 97.3 | 100 | 546 | 99.7 | 97.1 | 98.5 | 552, 271561 |
| Stenellid sp. 1 | 98.4 | 97.8 | 97.9 | 5840 | 98.5 | 98.4 | 95.6 | 1219 | 98.4 | 94.5 | 90.3 | 364 | 98.4 | 97.8 | 97.1 | 7423, 9802 |
| Stenellid sp. 2 | 99.5 | 99.7 | 93.9 | 729 | 99.6 | 97.8 | 94.7 | 183 | 100 | 100 | 96.0 | 24 | 99.7 | 99.4 | 94.1 | 936, 824 |
| 99.7 | 99.6 | 93.6 | 250 | 99.6 | 100 | 91.2 | 52 | 99.4 | 100 | 52.0 | 13 | 99.6 | 99.7 | 90.2 | 315, 81444 | |
Results describing the accuracy, recall, and precision on novel, manually-labelled data for each class at each site, as well as for all sites combined. Values less than 75% with >50 bins have been bolded. NBins gives the number of manually labelled positive bins for each type. For full-site results, NClicks gives the average number of clicks in each NBin.
*Manual evaluation of all Kauaʻi data found no clicks of Cuvier’s beaked whale, so recall and precision could not be evaluated.
Classifier confusion Matrix- Kauaʻi.
| False killer whale | Rough-toothed dolphin | Short-finned pilot whale 1 | Short-finned pilot whale 2 | Tt/Pe | Blainville’s beaked whale | Cuvier’s beaked whale | Stenellid 1 | Stenellid 2 | Noise | ||
| False killer whale | 242 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
| Rough-toothed dolphin | 20 | 4450 | 38 | 7 | 29 | 0 | 1 | 44 | 9 | 0 | 4 |
| Short-finned pilot whale 1 | 5 | 9 | 212 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 |
| Short-finned pilot whale 2 | 2 | 7 | 1 | 24 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| Tt/Pe | 0 | 3 | 5 | 0 | 182 | 0 | 0 | 2 | 0 | 0 | 0 |
| Blainville’s beaked whale | 0 | 0 | 0 | 1 | 0 | 90 | 0 | 0 | 0 | 0 | 0 |
| Cuvier’s beaked whale | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Stenellid 1 | 0 | 4 | 1 | 3 | 8 | 0 | 0 | 1199 | 0 | 0 | 4 |
| Stenellid 2 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 179 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 52 | 0 | |
| Noise | 50 | 12 | 10 | 28 | 2 | 1 | 0 | 9 | 0 | 5 | 1088 |
Confusion matrix showing the number of 5 minute bins (from novel Kauaʻi data not used in training/testing/validation) labelled as each class. Diagonal cells across classes show the number of correctly labelled positive bins for that class. Cell(i,j) is the number of bins of i that were labelled j by the network.
Classifier confusion Matrix- Kona.
| False killer whale | Rough-toothed dolphin | Short-finned pilot whale 1 | Short-finned pilot whale 2 | Tt/Pe | Blainville’s beaked whale | Cuvier’s beaked whale | Stenellid 1 | Stenellid 2 | Noise | ||
| False killer whale | 285 | 0 | 25 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 5 |
| Rough-toothed dolphin | 1 | 389 | 19 | 14 | 2 | 1 | 0 | 33 | 3 | 0 | 6 |
| Short-finned pilot whale 1 | 45 | 23 | 885 | 5 | 6 | 2 | 0 | 14 | 2 | 0 | 6 |
| Short-finned pilot whale 2 | 6 | 3 | 5 | 446 | 0 | 0 | 0 | 2 | 0 | 0 | 1 |
| Tt/Pe | 0 | 14 | 16 | 11 | 161 | 1 | 0 | 3 | 7 | 0 | 0 |
| Blainville’s beaked whale | 0 | 0 | 0 | 1 | 1 | 218 | 1 | 3 | 0 | 1 | 2 |
| Cuvier’s beaked whale | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 0 | 0 | 0 | 1 |
| Stenellid 1 | 1 | 24 | 19 | 17 | 35 | 3 | 5 | 5713 | 16 | 1 | 6 |
| Stenellid 2 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 727 | 0 | 0 |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 249 | 0 | |
| Noise | 250 | 10 | 25 | 47 | 8 | 15 | 0 | 67 | 19 | 15 | 3662 |
Confusion matrix showing the number of 5 minute bins (from novel data not used in training/testing/validation) labelled as each class. Diagonal cells across classes show the number of correctly labelled positive bins for that class. Cell(i,j) is the number of bins of i that were labelled j by the network.
Classifier confusion Matrix- Pearl and Hermes Reef.
| False killer whale | Rough-toothed dolphin | Short-finned pilot whale 1 | Short-finned pilot whale 2 | Tt/Pe | Blainville’s beaked whale | Cuvier’s beaked whale | Stenellid 1 | Stenellid 2 | Noise | ||
| False killer whale | 83 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Rough-toothed dolphin | 11 | 754 | 21 | 1 | 4 | 0 | 0 | 7 | 0 | 2 | 1 |
| Short-finned pilot whale 1 | 11 | 0 | 58 | 0 | 2 | 0 | 0 | 1 | 0 | 0 | 0 |
| Short-finned pilot whale 2 | 1 | 1 | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| Tt/Pe | 0 | 33 | 0 | 0 | 254 | 0 | 0 | 20 | 0 | 0 | 0 |
| Blainville’s beaked whale | 0 | 0 | 8 | 7 | 2 | 1986 | 0 | 1 | 0 | 2 | 2 |
| Cuvier’s beaked whale | 0 | 0 | 4 | 7 | 1 | 1 | 531 | 0 | 1 | 0 | 1 |
| Stenellid 1 | 0 | 4 | 2 | 0 | 14 | 0 | 0 | 344 | 0 | 0 | 0 |
| Stenellid 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | |
| Noise | 132 | 3 | 4 | 8 | 1 | 3 | 0 | 8 | 0 | 8 | 1187 |
Confusion matrix showing the number of 5 minute bins (from novel PHR data not used in training/testing/validation) labelled as each class. Diagonal cells across classes show the number of correctly labelled positive bins for that class. Cell(i,j) is the number of bins of i that were labelled j by the network.