| Literature DB >> 29563648 |
L R Brewster1,2,3, J J Dale4, T L Guttridge1, S H Gruber1,5, A C Hansell6, M Elliott2, I G Cowx3, N M Whitney7, A C Gleiss8.
Abstract
Discerning behaviours of free-ranging animals allows for quantification of their activity budget, providing important insight into ecology. Over recent years, accelerometers have been used to unveil the cryptic lives of animals. The increased ability of accelerometers to store large quantities of high resolution data has prompted a need for automated behavioural classification. We assessed the performance of several machine learning (ML) classifiers to discern five behaviours performed by accelerometer-equipped juvenile lemon sharks (Negaprion brevirostris) at Bimini, Bahamas (25°44'N, 79°16'W). The sharks were observed to exhibit chafing, burst swimming, headshaking, resting and swimming in a semi-captive environment and these observations were used to ground-truth data for ML training and testing. ML methods included logistic regression, an artificial neural network, two random forest models, a gradient boosting model and a voting ensemble (VE) model, which combined the predictions of all other (base) models to improve classifier performance. The macro-averaged F-measure, an indicator of classifier performance, showed that the VE model improved overall classification (F-measure 0.88) above the strongest base learner model, gradient boosting (0.86). To test whether the VE model provided biologically meaningful results when applied to accelerometer data obtained from wild sharks, we investigated headshaking behaviour, as a proxy for prey capture, in relation to the variables: time of day, tidal phase and season. All variables were significant in predicting prey capture, with predations most likely to occur during early evening and less frequently during the dry season and high tides. These findings support previous hypotheses from sporadic visual observations.Entities:
Year: 2018 PMID: 29563648 PMCID: PMC5842499 DOI: 10.1007/s00227-018-3318-y
Source DB: PubMed Journal: Mar Biol ISSN: 0025-3162 Impact factor: 2.573
Juvenile lemon sharks that exhibited the five behaviours for classification during semi-captive trials for development of an acceleration ethogram
| PIT tag ID | Sex | Total length (cm) | Weight (kg) |
|---|---|---|---|
| 985121031792723 | Female | 82.6 | 3.75 |
| 4C4A2D3A12 | Female | 80.5 | 3.10 |
| 4C3A6C313A | Male | 79.2 | 3.15 |
| 4C3B312275 | Male | 85.2 | 3.75 |
Wild lemon sharks tagged with the accelerometer data logger/acoustic transmitter package. Seasons are split into wet (April–September; n = 10) and dry season (October–March; n = 10)
| Pit tag# | Season | Sex | Dates | Total length (TL) (cm) | Weight (kg) |
|---|---|---|---|---|---|
| 4A0A043D40a | Wet | F | 29/07/12–03/08/12 | 77.5 | 3.25 |
| 4A73536511 | Wet | F | 29/07/12–03/08/12 | 83.6 | 3.00 |
| 4A66401437 | Wet | F | 29/07/12–03/08/12 | 78.1 | 2.80 |
| 4A44545C6C | Wet | M | 31/08/12–05/09/12 | 82.4 | 3.25 |
| 4A63380105b | Wet | F | 31/08/12–05/09/12 | 81.4 | 2.75 |
| 4C3B211816 | Wet | F | 31/08/12–05/09/12 | 81.0 | 2.10 |
| 4B7B473332 | Wet | F | 31/08/12–05/09/12 | 83.1 | 3.40 |
| 4A68061232 | Wet | M | 31/08/12–05/09/12 | 74.3 | 2.60 |
| 4C3B086000 | Wet | M | 31/08/12–05/09/12 | 76.0 | 2.10 |
| 985121031823859 | Wet | F | 29/08/14–03/09/14 | 77.5 | 2.30 |
| 4B7B442028 | Dry | M | 12/01/13–17/01/13 | 80.6 | 2.75 |
| 4A63380105b | Dry | F | 12/01/13–17/01/13 | 82.5 | 3.50 |
| 4A603C232D | Dry | M | 12/01/13–17/01/13 | 87.3 | 3.25 |
| 4C3B2A712D | Dry | F | 26/03/14–31/03/14 | 88.3 | 3.17 |
| 4A0A043D40a | Dry | F | 26/03/14–31/03/14 | 89.0 | 4.10 |
| 4C3B032B0C | Dry | F | 26/03/14–31/03/14 | 86.5 | 3.10 |
| 4A5A577669 | Dry | M | 08/11/14–13/11/14 | 88.4 | 3.60 |
| 4B7B464873 | Dry | M | 08/11/14–13/11/14 | 87.1 | 3.50 |
| 4C497D6463 | Dry | F | 08/11/14–13/11/14 | 78.4 | 2.40 |
| 4C4A736341 | Dry | F | 08/11/14–13/11/14 | 81.7 | 3.00 |
a,bIndicate individuals tagged during both seasons
Fig. 1a Examples of the five behaviours for classification. Overall dynamic body acceleration (ODBA) is calculated as the sum of the absolute values of dynamic acceleration from the three axes. b Dynamic acceleration in the three orthogonal axes: sway (blue), heave (red) and surge (grey) during each behaviour and c corresponding wavelet spectrum generated from the sway axis showing increased signal strength amplitude during the burst and headshake event
Features extracted from acceleration data loggers and used to train the base learner classifiers (see Zheng et al. 2013 for equations)
| Parameter | Label | Definition |
|---|---|---|
| Static acceleration | Xstat, Ystat, Zstat | Static acceleration for each axis reflective of body orientation |
| Dynamic acceleration | Xdyn, Ydyn, Zdyn | 1 s means of body movement generated acceleration in each axis |
| Overall Dynamic Body Acceleration | ODBA | Sum of the absolute values from the three dynamic axis |
| Amplitude | Amp | Amplitude of the signal derived from the sway axis body movement |
| Frequency | Hz | Dominant tailbeat frequency from lateral acceleration |
| Standard deviation | XstatSD, YstatSD, ZstatSD, XdynSD, YdynSD, ZdynSD, ODBASD | Standard deviation of static and dynamic acceleration measures in each axis |
| Skewness | XstatSkew, YstatSkew, ZstatSkew, XdynSkew, YdynSkew, ZdynSkew, ODBASkew | A measure of the symmetry of the feature vector |
| Kurtosis | XstatKurt, YstatKurt, ZstatKurt, XdynKurt, YdynKurt, ZdynKurt, ODBAKurt | A measure of the tail shape of the feature vector |
| Maximum | XstatMax, YstatMax, ZstatMax, XdynMax, YdynMax, ZdynMax, ODBAMax | Maximum values per second for dynamic and static acceleration in each axis and for ODBA |
| Minimum | XstatMin, YstatMin, ZstatMin, XdynMin, YdynMin, ZdynMin, ODBAMin | Minimum values per second for dynamic and static acceleration in each axis and for ODBA |
| Frequencies from wavelet spectra | X.values | Amplitude for the relevant frequency obtained through the continuous wavelet transformation generated spectrogram |
Static acceleration was calculated from the raw acceleration using 3-s box smoothing, leaving dynamic acceleration remaining. Overall dynamic body acceleration (ODBA) is calculated as the sum of the absolute values of dynamic acceleration from the three axes
Covariates included in binomial generalized additive mixed model investigating headshake events in juvenile lemon sharks in Bimini, Bahamas
| Variable | Range | Description | Variable Type |
|---|---|---|---|
| Time of day | 0–23 h | 24-h day | Cyclic smoother |
| Season | Dry/wet | Season sharks were tagged | Categorical |
| Tidal phase | Ebb–Low–Flood–High | Tidal phase based on NOAA’s tidal charts. High and low tide were categorised as one hour either side of event | Categorical |
| Shark ID | 1–20 | Influence of individual shark | Random effect |
Fig. 2An example from the sway acceleration axis of a 63 s prey manipulation event, consisting of three headshakes (HS; totalling 19 s) and a brief burst event
Confusion matrix generated for the test set of the ground-truthed data
| Model | Predicted behaviours | |||||||
|---|---|---|---|---|---|---|---|---|
| Class | Swim | HS | Rest | Chafe | Burst | Class error | ||
| Actual behaviours | LR | Swim |
| 6 | 2 | 8 | 0 | 0.002 |
| HS | 4 |
| 0 | 1 | 2 | 0.304 | ||
| Rest | 15 | 0 |
| 0 | 0 | 0.170 | ||
| Chafe | 3 | 3 | 0 |
| 0 | 0.111 | ||
| Burst | 0 | 8 | 0 | 0 |
| 0.800 | ||
| ANN | Swim |
| 2 | 1 | 2 | 0 | 0.001 | |
| HS | 6 |
| 0 | 0 | 6 | 0.522 | ||
| Rest | 7 | 0 |
| 0 | 0 | 0.080 | ||
| Chafe | 6 | 3 | 0 |
| 1 | 0.185 | ||
| Burst | 0 | 2 | 0 | 0 |
| 0.200 | ||
| RFG | Swim |
| 20 | 0 | 9 | 2 | 0.004 | |
| HS | 1 |
| 0 | 1 | 2 | 0.174 | ||
| Rest | 3 | 0 |
| 0 | 0 | 0.034 | ||
| Chafe | 2 | 2 | 0 |
| 1 | 0.093 | ||
| Burst | 0 | 3 | 0 | 0 |
| 0.300 | ||
| RFE | Swim |
| 24 | 0 | 8 | 0 | 0.005 | |
| HS | 1 |
| 0 | 1 | 3 | 0.217 | ||
| Rest | 3 | 0 |
| 0 | 0 | 0.034 | ||
| Chafe | 2 | 3 | 0 |
| 1 | 0.111 | ||
| Burst | 0 | 3 | 0 | 0 |
| 0.300 | ||
| GB | Swim |
| 2 | 0 | 3 | 0 | 0.001 | |
| HS | 5 |
| 0 | 1 | 1 | 0.304 | ||
| Rest | 4 | 0 |
| 0 | 0 | 0.045 | ||
| Chafe | 2 | 0 | 0 |
| 1 | 0.056 | ||
| Burst | 5 | 0 | 0 | 0 |
| 0.500 | ||
| VE | Swim |
| 2 | 0 | 3 | 0 | 0.001 | |
| HS | 4 |
| 0 | 1 | 1 | 0.261 | ||
| Rest | 4 | 0 |
| 0 | 0 | 0.045 | ||
| Chafe | 2 | 0 | 0 |
| 1 | 0.056 | ||
| Burst | 2 | 1 | 0 | 0 |
| 0.300 | ||
Rows indicate actual observations and columns represent predicted behaviours
Values in italic are correctly classified behavioural observations
LR logistic regression, ANN artificial neural network, RFG random forest Gini, RFE random forest entropy, GB gradient tree boosting, VE voting ensemble, HS headshakes
Performance metrics of base learner models and voting ensemble model
| Model | Class | TP | FP | FN | Precision | Recall | Class | Macro-averaged |
|---|---|---|---|---|---|---|---|---|
| LR | Swim | 6984 | 22 | 16 | 0.997 | 0.998 | 0.997 | 0.723 |
| HS | 16 | 17 | 7 | 0.485 | 0.696 | 0.571 | ||
| Rest | 73 | 2 | 15 | 0.973 | 0.830 | 0.896 | ||
| Chafe | 48 | 9 | 6 | 0.842 | 0.889 | 0.865 | ||
| Burst | 2 | 2 | 8 | 0.500 | 0.200 | 0.286 | ||
| ANN | Swim | 6995 | 19 | 5 | 0.997 |
| 0.998 | 0.802 |
| HS | 11 | 7 | 12 | 0.611 | 0.478 | 0.537 | ||
| Rest | 81 | 1 | 7 | 0.988 | 0.920 | 0.953 | ||
| Chafe | 44 | 2 | 10 |
| 0.815 | 0.880 | ||
| Burst | 8 | 7 | 2 | 0.533 |
| 0.640 | ||
| RFG | Swim | 6969 | 6 | 31 |
| 0.996 | 0.997 | 0.810 |
| HS | 19 | 25 | 4 | 0.432 |
| 0.567 | ||
| Rest | 85 | 0 | 3 |
|
|
| ||
| Chafe | 49 | 10 | 5 | 0.831 | 0.907 | 0.867 | ||
| Burst | 7 | 5 | 3 | 0.583 | 0.700 | 0.636 | ||
| RFE | Swim | 6968 | 6 | 32 |
| 0.995 | 0.997 | 0.804 |
| HS | 18 | 30 | 5 | 0.375 | 0.783 | 0.507 | ||
| Rest | 85 | 0 | 3 |
|
|
| ||
| Chafe | 48 | 9 | 6 | 0.842 | 0.889 | 0.865 | ||
| Burst | 7 | 4 | 3 | 0.636 | 0.700 | 0.667 | ||
| GB | Swim | 6995 | 16 | 5 | 0.998 |
| 0.999 | 0.856 |
| HS | 16 | 2 | 7 |
| 0.696 | 0.780 | ||
| Rest | 84 | 0 | 4 |
| 0.955 | 0.977 | ||
| Chafe | 51 | 4 | 3 | 0.927 |
| 0.936 | ||
| Burst | 5 | 2 | 5 | 0.714 | 0.500 | 0.588 | ||
| VE | Swim | 6995 | 12 | 5 | 0.998 |
|
|
|
| HS | 17 | 3 | 6 | 0.850 | 0.739 |
| ||
| Rest | 84 | 0 | 4 |
| 0.955 | 0.977 | ||
| Chafe | 51 | 4 | 3 | 0.927 |
|
| ||
| Burst | 7 | 2 | 3 |
| 0.700 |
|
The values in italic show optimum values for each metric
LR logistic regression, ANN artificial neural network, RFG random forest Gini, RFE random forest entropy, GB gradient tree boosting, VE voting ensemble, HS headshake class; TP true positive, FP false positive, FN false negative
Log-likelihoods scores for models investigating the occurrence of headshakes in lemon sharks in Bimini, Bahamas
| Covariates | Log-likelihood |
|---|---|
| s(Hour) + factor(Tide) + (Season) | − 1265.815a |
| s(Hour) + factor(Season) | − 1268.462 |
| s(Hour) + factor(Tide) | − 1269.209 |
| s(Hour) | − 1271.833 |
| factor(Tide) + (Season) | − 1300.566 |
| factor(Tide) | − 1304.014 |
| factor(Season) | − 1311.745 |
aOptimal model
Results of the final binomial generalized additive model investigating the presence of headshaking by lemon sharks in Bimini, Bahamas
| Covariate | edf | ref.df | X2 | |||
|---|---|---|---|---|---|---|
| Hour | 5.903 | 8 | 73.82 | ≤ 0.05 |
Outcomes of the smoother hour include: covariate, effective degrees of freedom (edf), reference degrees of freedom (ref.df), Chi squared value (χ2), p value. Outcomes of factors include covariate, level, coefficient, standard error (SE), z value and p value. The overall adjusted R2 value is also displayed
Fig. 3Estimated smoother for the effect of hour of day on the probability of headshaking behaviour occurring by the juvenile lemon shark in Bimini, Bahamas. The lowest and highest probabilities of a headshake occurring are around 0800 and 1700 h, respectively. Estimates are based on final binomial generalized additive mixed model. The solid line is the smoother. Dark grey shaded area surrounding the smoother represent 95% confidence intervals. The light grey shaded area represents the range of sunset times throughout the deployments. The dashed line represents the mean likelihood of a headshaking occurring. The blue dots represent mean hourly temperature (°C), calculated from the temperature sensor in the acceleration data logger (ADL) packages, across all deployments