| Literature DB >> 35739889 |
Yujie Lei1,2,3,4, Ying Xiang3,4, Yuhui Zhu3, Yan Guan3,4, Yu Zhang3, Xiao Yang3, Xiaoli Yao3, Tingxuan Li3, Meng Xie5, Jiong Mu3,4, Qingyong Ni1,2.
Abstract
The slow loris (Genus Nycticebus) is a group of small, nocturnal and venomous primates with a distinctive locomotion mode. The detection of slow loris plays an important role in the subsequent individual identification and behavioral recognition and thus contributes to formulating targeted conservation strategies, particularly in reintroduction and post-release monitoring. However, fewer studies have been conducted on efficient and accurate detection methods of this endangered taxa. The traditional methods to detect the slow loris involve long-term observation or watching surveillance video repeatedly, which would involve manpower and be time consuming. Because humans cannot maintain a high degree of attention for a long time, they are also prone to making missed detections or false detections. Due to these observational challenges, using computer vision to detect slow loris presence and activity is desirable. This article establishes a novel target detection dataset based on monitoring videos of captive Bengal slow loris (N. bengalensis) from the wildlife rescue centers in Xishuangbanna and Pu'er, Yunnan, China. The dataset is used to test two improvement schemes based on the YOLOv5 network: (1) YOLOv5-CBAM + TC, the attention mechanism and deconvolution are introduced; (2) YOLOv5-SD, the small object detection layer is added. The results demonstrate that the YOLOv5-CBAM + TC effectively improves the detection effect. At the cost of increasing the model size by 0.6 MB, the precision rate, the recall rate and the mean average precision (mAP) are increased by 2.9%, 3.7% and 3.5%, respectively. The YOLOv5-CBAM + TC model can be used as an effective method to detect individual slow loris in a captive environment, which helps to realize slow loris face and posture recognition based on computer vision.Entities:
Keywords: Nycticebus; animal protection; behavior recognition; computer vision; object detection
Year: 2022 PMID: 35739889 PMCID: PMC9219483 DOI: 10.3390/ani12121553
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 3.231
Information about the wildlife rescue centers and the captive Bengal slow lorises.
| Captive Site | Xishuangbanna | Puer |
|---|---|---|
| Coordinate | 22.39276° N, 100.89636° E | 22.62198° N, 101.08916° E |
| Altitude (m) | 1060 | 1600 |
| Annual mean temperature (°C) | 17.5 | 17.5 |
| No. of individuals | 9 | 9 |
| No. of enclosures | 1 | 1 |
| Enclosure size (L × W × H) (m) | 5.7 × 4.2 × 3.5 | 3.5 × 2.1 × 2.0 |
| No. of cameras | 3 | 2 |
Figure 1Partial images in the dataset.
Figure 2Example diagram of slow loris states.
Figure 3YOLOv5 structure diagram.
Figure 4CBAM structure diagram.
Figure 5The network structure of YOLOv5-CBAM + TC.
Parameter definitions.
| Confusion Matrix | Predicted Results | ||
|---|---|---|---|
| Positive | Negative | ||
| Expected Results | positive | TP 1 | FN 2 |
| negative | FP 3 | TN 4 | |
1 True positive (TP): The prediction result is positive, and the prediction is correct. 2 False negative (FN): The prediction result is negative, but the prediction is incorrect. 3 False positive (FP): The prediction result is positive, but the prediction is incorrect. 4 True negative (TN): The prediction result is negative, and the prediction is correct.
Figure 6Experimental results of the three models: (a) precision; (b) recall; (c) mAP@0.5; (d) mAP@0.5:0.95.
Experimental information of the three models.
| Model | Category | Precision | Recall | mAP@0.5 | mAP@0.5:0.95 | Model Size |
|---|---|---|---|---|---|---|
| YOLOv5 | All | 0.936 | 0.916 | 0.934 | 0.609 | 14.4 MB |
| Single | 0.955 | 0.926 | 0.956 | 0.554 | ||
| Socializing | 0.923 | 0.906 | 0.913 | 0.665 | ||
| YOLOv5-SD | All | 0.931 | 0.943 | 0.95 | 0.572 | 16.4 MB |
| Single | 0.912 | 0.924 | 0.955 | 0.523 | ||
| Socializing | 0.951 | 0.963 | 0.944 | 0.622 | ||
| YOLOv5-CBAM + TC | All | 0.965 | 0.953 | 0.969 | 0.642 | 15.0 MB |
| Single | 0.956 | 0.943 | 0.964 | 0.568 | ||
| Socializing | 0.974 | 0.963 | 0.973 | 0.716 |
Figure 7Comparison of YOLOv5-CBAM + TC and YOLOv5 prediction results: (a) true labels; (b) YOLOv5; (c) YOLOv5-CBAM+TC.
Comparison between mainstream detection algorithms.
| Category | YOLOv5-CBAM + TC | SSD | CenterNet | Faster-RCNN |
|---|---|---|---|---|
| mAP | 0.969 | 0.911 | 0.889 | 0.939 |
Figure 8Detection effect of YOLOv5-CBAM + TC.