| Literature DB >> 34873193 |
Christian Bergler1, Alexander Gebhard2, Jared R Towers3,4, Leonid Butyrev2, Gary J Sutton3,4, Tasli J H Shaw3,4, Andreas Maier2, Elmar Nöth2.
Abstract
Biometric identification techniques such as photo-identification require an array of unique natural markings to identify individuals. From 1975 to present, Bigg's killer whales have been photo-identified along the west coast of North America, resulting in one of the largest and longest-running cetacean photo-identification datasets. However, data maintenance and analysis are extremely time and resource consuming. This study transfers the procedure of killer whale image identification into a fully automated, multi-stage, deep learning framework, entitled FIN-PRINT. It is composed of multiple sequentially ordered sub-components. FIN-PRINT is trained and evaluated on a dataset collected over an 8-year period (2011-2018) in the coastal waters off western North America, including 121,000 human-annotated identification images of Bigg's killer whales. At first, object detection is performed to identify unique killer whale markings, resulting in 94.4% recall, 94.1% precision, and 93.4% mean-average-precision (mAP). Second, all previously identified natural killer whale markings are extracted. The third step introduces a data enhancement mechanism by filtering between valid and invalid markings from previous processing levels, achieving 92.8% recall, 97.5%, precision, and 95.2% accuracy. The fourth and final step involves multi-class individual recognition. When evaluated on the network test set, it achieved an accuracy of 92.5% with 97.2% top-3 unweighted accuracy (TUA) for the 100 most commonly photo-identified killer whales. Additionally, the method achieved an accuracy of 84.5% and a TUA of 92.9% when applied to the entire 2018 image collection of the 100 most common killer whales. The source code of FIN-PRINT can be adapted to other species and will be publicly available.Entities:
Year: 2021 PMID: 34873193 PMCID: PMC8648837 DOI: 10.1038/s41598-021-02506-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1FIN-PRINT workflow including: (1) dorsal fin/saddle patch detection, (2) extraction of the detected killer whale markings, (3) valid versus invalid dorsal fin/saddle patch binary classification, and (4) multi-class killer whale individual identification.
Figure 2Bigg’s killer whale image long-tailed data distribution (2011–2018), summing up a total of 121,095 identification images, with 86,789 containing single labels, as well as 34,306 photos including multiple labels, resulting in 367 identified individuals (average number of images per individual 456, standard deviation 442). The two colored graphs visualize the number of identification images per whale in descending order w.r.t. all images, including single and multiple labels (purple curve) and those only containing a single label (green curve). Furthermore, an exemplary data point is visualized for both curves, presenting the number of identification images in relation to a selected number of whales, here for the top-100, clearly describing the exponential decline. Moreover, the number of animals at which the total amount of identification images is < 10 were marked for both curves. In total, 367 individuals were encountered across 2011–2018. Among them, 128 and 125 were found at least once in each year when considering all images and only those with single labels, respectively.
Figure 3Examples of image content which either lead to completely unusable/invalid data samples, or which make a robust and correct detection/classification much more difficult.
Human-Annotated Detection Dataset (HADD), including human-labeled dorsal fin/saddle-patch bounding boxes, as well as Extended-Annotated Detection Dataset (EADD) containing human- and machine-labeled dorsal fin/saddle-patch bounding boxes.
| Dataset | Split | ||||||
|---|---|---|---|---|---|---|---|
| Training | Validation | Test | |||||
| Samples | Samples | Samples | |||||
| Photos | % | Photos | % | Photos | % | ||
| HADDa | 2286 | 1686 | 73.8 | 300 | 13.1 | 300 | 13.1 |
| EADDb | 7511 | 5257 | 70.0 | 1127 | 15.0 | 1127 | 15.0 |
aHADD Human-Annotated Detection Dataset.
bEADD Extended-Annotated Detection Dataset.
Valid/Invalid Killer Whale Identification Dataset 2011–2017 (VIKWID11-17), a human-annotated dataset consisting of valid and invalid identification images (dorsal fin + saddle-patch), utilized to train, validate, and test VVI-DETECT, after applying the interval rule of 5 s with respect to the validation and test set.
| Dataset | Split | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training | Validation | Test | |||||||||||
| Samples | Samples | Samples | |||||||||||
| Valid | Invalid | % | Valid | Invalid | % | Valid | Invalid | % | |||||
| VIKWID11-17a | 1590 | 700 | 509 | 1209 | 76.0 | 126 | 89 | 215 | 13.5 | 83 | 83 | 166 | 10.5 |
aVIKWID11-17 Valid/Invalid Killer Whale Identification Dataset 2011–2017.
Killer Whale Individual Dataset 2011–2017 (KWID11-17), including machine-annotated data of valid images (dorsal fin saddle-patch) for the 100 most commonly photographed individuals satisfying the data constraints (one label per image exactly one bounding box prediction), in combination with machine-annotated invalid data utilizing VVI-DETECT after applying the interval rule of 5 s.
| Dataset | Split | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Training | Validation | Test | |||||||||||
| Samples | Samples | Samples | |||||||||||
| Valid | Invalid | % | Valid | Invalid | % | Valid | Invalid | % | |||||
| KWID11-17a | 39,464 | 27,238 | 2227 | 29,465 | 74.7 | 4940 | 395 | 5,335 | 13.5 | 4279 | 385 | 4664 | 11.8 |
| KWIDE11-17b | 65,713 | 48,200 | 2226 | 50,426 | 76.7 | 7729 | 392 | 8121 | 12.4 | 6811 | 355 | 7166 | 10.9 |
Killer Whale Individual Dataset Extended 2011–2017 (KWIDE11-17) extends the KWID11-17 data archive with images of the 100 most common individuals represented in images containing more than one label and classified via the first version of FIN-IDENTIFY, trained on KWID11-17. Notice that the distribution of the invalid photos differs slightly between KWID11-17 and KWIDE11-17 due to the different data splits and subsequent effect of the 5 s interval rule. Furthermore, additional statistics regarding the number of identification images (100 classes) are reported for both datasets.
aKWID11-17 Killer Whale Individual Dataset 2011–2017—Statistics on the number of identification images (100 most common classes):
mean = 364.57, stdv = 162.91, min = 135 (T073B), max = 916 (T019B)
training stats (only valid images): mean = 272.38, stdv = 125.79, min = 107 (T073B), max = 695 (T019B)
validation stats (only valid images): mean = 49.40, stdv = 20.81, min = 16 (T073B), max = 120 (T019B)
testing stats (only valid images): mean = 42.79, stdv = 18.60, min = 8 (T121A), max = 101 (T019B).
bKWIDE11-17 Killer Whale Individual Dataset Extended 2011–2017—Statistics on the number of identification images (100 most common classes): mean = 627.40, stdv = 245.06, min = 172 (T073B), max = 1442 (T019B)
training stats (only valid images): mean = 482.00, stdv = 192.32, min = 139 (T073B), max = 1122 (T019B)
validation stats (only valid images): mean = 77.29, stdv = 28.74, min = 17 (T073B), max = 174 (T019B)
testing stats (only valid images): mean = 68.11, stdv = 26.98, min = 10 (T121A), max = 146 (T019B).
Detection results while training two versions of FIN-DETECT with respect to HADD and EADD.
| Metric | Dataset | |||
|---|---|---|---|---|
| HADDa | EADDb | |||
| Validation [%] | Test [%] | Validation [%] | Test [%] | |
| Recall | 95.0 | 89.0 | 95.1 | 94.4 |
| Precision | 80.0 | 85.0 | 94.2 | 94.1 |
| F1-Score | 86.9 | 87.0 | 94.7 | 94.2 |
| mAPc | 92.0 | 82.0 | 94.2 | 93.4 |
aHADD Human-Annotated Detection Dataset.
bEADD Extended-Annotated Detection Dataset.
cmAP mean Average Precision.
Figure 4Dorsal fin/saddle patch detection and extraction results based on randomly chosen identification images from various years (2011–2017), applying FIN-DETECT, trained on the machine-extended EADD data archive (see Table 1), and FIN-EXTRACT.
Detection results of VVI-DETECT to filter between valid versus invalid identification images (data enhancement), while training VVI-DETECT on VIKWID11-17.
| Metric | Dataset | |
|---|---|---|
| VIKWID11-17a | ||
| Validation [%] | Test [%] | |
| Recall | 97.8 | 92.8 |
| Precision | 92.6 | 97.5 |
| FPR | 5.6 | 2.4 |
| F1-Score | 95.1 | 95.1 |
| Accuracy | 95.8 | 95.2 |
aVIKWID11-17 Valid/Invalid Killer Whale Identification Dataset 2011–2017.
Figure 5Detected (FIN-DETECT) and extracted (FIN-EXTRACT) unseen identification images from 2018, which were successfully categorized and filtered as invalid identification images by VVI-DETECT, trained on VIKWID11-17, reported in Table 2.
Individual killer whale classification results (101-classes), while training two versions of FIN-IDENTIFY, using the initial KWID11-17 or KWIDE11-17 datasets.
| Metric | Dataset | |||
|---|---|---|---|---|
| KWID11-17a | KIWIDE11-17b | |||
| Validation [%] | Test [%] | Validation [%] | Test [%] | |
| Accuracy | 85.8 | 86.7 | 91.1 | 92.5 |
| TWAc | 89.0 | 89.9 | 93.4 | 94.6 |
| TUAd | 93.2 | 94.3 | 96.3 | 97.2 |
aKWID11-17 Killer Whale Individual Dataset 2011–2017.
bKWIDE11-17 Killer Whale Individual Dataset Extended 2011–2017.
cTWA Top-3 Weighted Accuracy.
dTUA Top-3 Unweighted Accuracy.
Figure 6Examples of detected (FIN-DETECT), extracted (FIN-EXTRACT), and pre-filtered (VVI-DETECT) unseen and valid killer whale identification images from 2018 which were successfully classified by FIN-IDENTIFY, trained on KWIDE11-17.