| Literature DB >> 35812846 |
Tom Menaker1, Joke Monteny2, Lin Op de Beeck2, Anna Zamansky1.
Abstract
Traditional methods of data analysis in animal behavior research are usually based on measuring behavior by manually coding a set of chosen behavioral parameters, which is naturally prone to human bias and error, and is also a tedious labor-intensive task. Machine learning techniques are increasingly applied to support researchers in this field, mostly in a supervised manner: for tracking animals, detecting land marks or recognizing actions. Unsupervised methods are increasingly used, but are under-explored in the context of behavior studies and applied contexts such as behavioral testing of dogs. This study explores the potential of unsupervised approaches such as clustering for the automated discovery of patterns in data which have potential behavioral meaning. We aim to demonstrate that such patterns can be useful at exploratory stages of data analysis before forming specific hypotheses. To this end, we propose a concrete method for grouping video trials of behavioral testing of animal individuals into clusters using a set of potentially relevant features. Using an example of protocol for testing in a "Stranger Test", we compare the discovered clusters against the C-BARQ owner-based questionnaire, which is commonly used for dog behavioral trait assessment, showing that our method separated well between dogs with higher C-BARQ scores for stranger fear, and those with lower scores. This demonstrates potential use of such clustering approach for exploration prior to hypothesis forming and testing in behavioral research.Entities:
Keywords: Data Science; animal behavior; behavioral testing; clustering; machine learning
Year: 2022 PMID: 35812846 PMCID: PMC9260587 DOI: 10.3389/fvets.2022.884437
Source DB: PubMed Journal: Front Vet Sci ISSN: 2297-1769
Figure 1A high level overview of the general approach.
Figure 2(A) detection; (B) trajectory extraction.
Features for stranger test.
|
|
|
|
|---|---|---|
| Time until approach | Seconds | Time from start to first approach of |
| Duration of approach | Seconds | Time from start of the approach to |
| Speed of first | Pixels/Seconds | Average speed of first approach |
| Trajectory length | Pixels | Length of all traveled trajectory |
| Trajectory length | Pixels | Length of the trajectory from start to |
| Area | Pixels2 | Approximation of the area |
| covered by the dog, using convex | ||
| Intensity of use | Integer | Ratio between the total trajectory |
| Total contact | Seconds | Time spent in proximity to stranger |
| Straightness | Decimal | Ratio between the distance from |
| Straightness until | Decimal | Ratio between the distance from |
| Contact ratio | Decimal | Percentage of frames in proximity to |
Figure 3Descriptive statistics (before normalization and scaling).
Generated clustering scenarios list.
|
|
|
|
| ||
|---|---|---|---|---|---|
|
| 1 | 0.537 | 4 | 2 | 21 - 14(c-0), 7(c-3) |
|
| 2 | 0.324 | 4 | 3 | 27 - 11 (c-0), 8 (c-1), 8 (c-3) |
| 3 | 3 | 0.272 | 5 | 2 | 23 - 8 (c-0), 14 (c-1) |
| 4 | 4 | 0.255 | 6 | 3 | 24 - 6 (c-0), 10 (c-1), 8 (c-3) |
Bold are the chosen clusters for analysis.
Figure 4Patterns of Scenario 1: Cluster 0 red dots, Cluster 3 green dots.
Figure 5Results of clustering scenario 1 along the axes of total contact and duration of approach.
Figure 6Results of clustering scenario 2 along the axes of contact ratio and area.
Figure 7C-BARQ factors descriptive statistics for scenario 1.
Figure 10PS comparison between C0 and C3 (scenario 1), dotted line is the median, solid line in the box is the mean.