| Literature DB >> 35454293 |
Xiaolang Chen1, Tianlong Yang1, Kaizhan Mai1, Caixing Liu1, Juntao Xiong1, Yingjie Kuang1, Yuefang Gao1.
Abstract
In precision dairy farming, computer vision-based approaches have been widely employed to monitor the cattle conditions (e.g., the physical, physiology, health and welfare). To this end, the accurate and effective identification of individual cow is a prerequisite. In this paper, a deep learning re-identification network model, Global and Part Network (GPN), is proposed to identify individual cow face. The GPN model, with ResNet50 as backbone network to generate a pooling of feature maps, builds three branch modules (Middle branch, Global branch and Part branch) to learn more discriminative and robust feature representation from the maps. Specifically, the Middle branch and the Global branch separately extract the global features of middle dimension and high dimension from the maps, and the Part branch extracts the local features in the unified block, all of which are integrated to act as the feature representation for cow face re-identification. By performing such strategies, the GPN model not only extracts the discriminative global and local features, but also learns the subtle differences among different cow faces. To further improve the performance of the proposed framework, a Global and Part Network with Spatial Transform (GPN-ST) model is also developed to incorporate an attention mechanism module in the Part branch. Additionally, to test the efficiency of the proposed approach, a large-scale cow face dataset is constructed, which contains 130,000 images with 3000 cows under different conditions (e.g., occlusion, change of viewpoints and illumination, blur, and background clutters). The results of various contrast experiments show that the GPN outperforms the representative re-identification methods, and the improved GPN-ST model has a higher accuracy rate (up by 2.8% and 2.2% respectively) in Rank-1 and mAP, compared with the GPN model. In conclusion, using the Global and Part feature deep network with attention mechanism can effectively ameliorate the efficiency of cow face re-identification.Entities:
Keywords: GPN model; GPN-ST model; cow face re-identification; deep learning; precision dairy farming
Year: 2022 PMID: 35454293 PMCID: PMC9028456 DOI: 10.3390/ani12081047
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Figure 1Generation process of cow face dataset.
Figure 2Data distribution of cow face dataset.
Figure 3Comparisons of intra-class gap and inter-class gap in dataset. Three cow face images of the same cow are in a row.
Figure 4The architecture of the proposed GPN model which cooperates with three branch modules, capturing both the global feature and the local detail to enhance the feature representation discriminability.
Figure 5Architecture of Classifier.
Figure 6Architecture of the proposed GPN-ST model. The GPN-ST model is improved in the Part branch used to extract the local regions via four STN modules.
Recognition results of different models in Gallery 1.
| Model | Rank-1 | Rank-5 | Rank-10 | mAP |
|---|---|---|---|---|
| GPN-ST (Ours) | 90.0% | 95.9% | 97.4% | 91.3% |
| GPN (Ours) | 87.2% | 95.6% | 97.4% | 89.1% |
| ResNet50 | 80.9% | 92.2% | 94.7% | 83.3% |
| MiddleNet | 82.8% | 93.8% | 95.6% | 85.5% |
| PCB | 85.3% | 94.4% | 96.2% | 87.3% |
| MGN | 86.9% | 95.5% | 97.4% | 90.8% |
Rank-1 accuracy of different models in different gallery sets (%).
| Model | Gallery 1 | Gallery 2 | Gallery 3 | Gallery 4 | Gallery 5 | Gallery 10 | Gallery 15 | Gallery 20 |
|---|---|---|---|---|---|---|---|---|
| GPN-ST (Ours) | 90.0% | 94.9% | 95.9% | 96.5% | 96.6% | 97.4% | 97.7% | 98.0% |
| GPN (Ours) | 87.2% | 92.6% | 94.7% | 95.6% | 95.7% | 97.1% | 97.4% | 97.7% |
| ResNet50 | 80.9% | 89.5% | 92.0% | 92.9% | 93.7% | 95.7% | 96.5% | 96.8% |
| MiddleNet | 82.8% | 90.2% | 92.8% | 93.9% | 94.3% | 96.1% | 96.7% | 97.0% |
| PCB | 85.3% | 91.8% | 94.0% | 94.8% | 95.4% | 96.5% | 97.4% | 97.5% |
| MGN | 86.9% | 92.1% | 94.2% | 95.1% | 95.5% | 96.7% | 97.4% | 97.5% |
mAP of different models in different gallery sets (%).
| Model | Gallery 1 | Gallery 2 | Gallery 3 | Gallery 4 | Gallery 5 | Gallery 10 | Gallery 15 | Gallery 20 |
|---|---|---|---|---|---|---|---|---|
| GPN-ST (Ours) | 91.3% | 91.5% | 91.3% | 91.4% | 91.1% | 91.2% | 91.2% | 91.0% |
| GPN (Ours) | 89.1% | 88.6% | 88.7% | 88.7% | 88.3% | 88.3% | 88.4% | 88.3% |
| ResNet50 | 83.3% | 83.6% | 83.0% | 83.0% | 82.5% | 82.6% | 82.7% | 82.6% |
| MiddleNet | 85.5% | 85.8% | 85.1% | 85.6% | 84.8% | 84.8% | 85.0% | 84.8% |
| PCB | 87.3% | 87.3% | 87.3% | 87.3% | 87.0% | 87.0% | 87.1% | 87.0% |
| MGN | 90.8% | 90.2% | 90.3% | 90.8% | 90.7% | 91.3% | 91.8% | 91.5% |
Results of different module combinations of the GPN model.
| Model | Rank-1 | Rank-5 | Rank-10 | mAP |
|---|---|---|---|---|
| GPN (Ours) | 87.2% | 95.6% | 97.4% | 89.1% |
| Only GAP | 84.4% | 94.0% | 96.4% | 86.6% |
| Only GMP | 83.8% | 94.0% | 96.4% | 86.0% |
| Simple Global | 80.9% | 92.2% | 94.7% | 83.3% |
| Simple Part | 85.3% | 94.4% | 96.2% | 87.3% |
| Global + Part | 86.1% | 94.8% | 96.4% | 88.1% |
| Global + Middle | 86.2% | 94.4% | 96.4% | 88.1% |
| Part + Middle | 85.7% | 94.9% | 96.9% | 87.7% |
Figure 7The extracted region visualization of the GPN and GPN-ST models. The left and right column of (a) separately represents the input image and the even partitions. The left and right columns of (b,c) separately represent the input images and the extracted the local regions with STN module.
Figure 8The visualization of failures in the extracted regions of the GPN-ST model. The left columns of (a,b) are the same cow with different angles. The right columns of (a,b) separately represent the extracted the local regions with STN module.