Literature DB >> 34223900

Disentangling data dependency using cross-validation strategies to evaluate prediction quality of cattle grazing activities using machine learning algorithms and wearable sensor data.

Leonardo Augusto Coelho Ribeiro1, Tiago Bresolin2, Guilherme Jordão de Magalhães Rosa2, Daniel Rume Casagrande1, Marina de Arruda Camargo Danes1, João Ricardo Rebouças Dórea2.   

Abstract

Wearable sensors have been explored as an alternative for real-time monitoring of cattle feeding behavior in grazing systems. To evaluate the performance of predictive models such as machine learning (ML) techniques, data cross-validation (CV) approaches are often employed. However, due to data dependencies and confounding effects, poorly performed validation strategies may significantly inflate the prediction quality. In this context, our objective was to evaluate the effect of different CV strategies on the prediction of grazing activities in cattle using wearable sensor (accelerometer) data and ML algorithms. Six Nellore bulls (average live weight of 345 ± 21 kg) had their behavior visually classified as grazing or not-grazing for a period of 15 d. Elastic Net Generalized Linear Model (GLM), Random Forest (RF), and Artificial Neural Network (ANN) were employed to predict grazing activity (grazing or not-grazing) using 3-axis accelerometer data. For each analytical method, three CV strategies were evaluated: holdout, leave-one-animal-out (LOAO), and leave-one-day-out (LODO). Algorithms were trained using similar dataset sizes (holdout: n = 57,862; LOAO: n = 56,786; LODO: n = 56,672). Overall, GLM delivered the worst prediction accuracy (53%) compared with the ML techniques (65% for both RF and ANN), and ANN performed slightly better than RF for LOAO (73%) and LODO (64%) across CV strategies. The holdout yielded the highest nominal accuracy values for all three ML approaches (GLM: 59%, RF: 76%, and ANN: 74%), followed by LODO (GLM: 49%, RF: 61%, and ANN: 63%) and LOAO (GLM: 52%, RF: 57%, and ANN: 57%). With a larger dataset (i.e., more animals and grazing management scenarios), it is expected that accuracy could be increased. Most importantly, the greater prediction accuracy observed for holdout CV may simply indicate a lack of data independence and the presence of carry-over effects from animals and grazing management. Our results suggest that generalizing predictive models to unknown (not used for training) animals or grazing management may incur poor prediction quality. The results highlight the need for using management knowledge to define the validation strategy that is closer to the real-life situation, i.e., the intended application of the predictive model.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Society of Animal Science. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  accelerometer; grazing; machine learning; validation

Mesh:

Year:  2021        PMID: 34223900      PMCID: PMC8418637          DOI: 10.1093/jas/skab206

Source DB:  PubMed          Journal:  J Anim Sci        ISSN: 0021-8812            Impact factor:   3.338


  13 in total

1.  Predicting the dry matter intake of grazing dairy cows using infrared reflectance spectroscopy analysis.

Authors:  B Lahart; S McParland; E Kennedy; T M Boland; T Condon; M Williams; N Galvin; B McCarthy; F Buckley
Journal:  J Dairy Sci       Date:  2019-07-24       Impact factor: 4.034

2.  Beef cattle responses to pre-grazing sward height and low level of energy supplementation on tropical pastures.

Authors:  João R R Dórea; Vinícius N Gouvêa; Luiz Roberto D Agostinho Neto; Sila C Da Silva; Geoffrey E Brink; Alexandre V Pires; Flávio A P Santos
Journal:  J Anim Sci       Date:  2020-06-01       Impact factor: 3.159

3.  Dairy cattle behavior classifications based on decision tree learning using 3-axis neck-mounted accelerometers.

Authors:  Tomoya Tamura; Yuki Okubo; Yoshitaka Deguchi; Shizu Koshikawa; Masahiro Takahashi; Yasushi Chida; Keiji Okada
Journal:  Anim Sci J       Date:  2019-02-17       Impact factor: 1.749

4.  Prediction and validation of residual feed intake and dry matter intake in Danish lactating dairy cows using mid-infrared spectroscopy of milk.

Authors:  N Shetty; P Løvendahl; M S Lund; A J Buitenhuis
Journal:  J Dairy Sci       Date:  2016-11-17       Impact factor: 4.034

5.  Validation strategy can result in an overoptimistic view of the ability of milk infrared spectra to predict methane emission of dairy cattle.

Authors:  Qiuyu Wang; Henk Bovenhuis
Journal:  J Dairy Sci       Date:  2019-05-02       Impact factor: 4.034

6.  An artificial neural network to estimate physical activity energy expenditure and identify physical activity type from an accelerometer.

Authors:  John Staudenmayer; David Pober; Scott Crouter; David Bassett; Patty Freedson
Journal:  J Appl Physiol (1985)       Date:  2009-07-30

Review 7.  Invited review: sensors to support health management on dairy farms.

Authors:  C J Rutten; A G J Velthuis; W Steeneveld; H Hogeveen
Journal:  J Dairy Sci       Date:  2013-02-22       Impact factor: 4.034

8.  Development and application of a machine learning algorithm for classification of elasmobranch behaviour from accelerometry data.

Authors:  L R Brewster; J J Dale; T L Guttridge; S H Gruber; A C Hansell; M Elliott; I G Cowx; N M Whitney; A C Gleiss
Journal:  Mar Biol       Date:  2018-03-08       Impact factor: 2.573

9.  Identification of the Rumination in Cattle Using Support Vector Machines with Motion-Sensitive Bolus Sensors.

Authors:  Andrew W Hamilton; Chris Davison; Christos Tachtatzis; Ivan Andonovic; Craig Michie; Holly J Ferguson; Laura Somerville; Nicholas N Jonsson
Journal:  Sensors (Basel)       Date:  2019-03-07       Impact factor: 3.576

10.  BIG DATA ANALYTICS AND PRECISION ANIMAL AGRICULTURE SYMPOSIUM: Machine learning and data mining advance predictive big data analysis in precision animal agriculture.

Authors:  Gota Morota; Ricardo V Ventura; Fabyano F Silva; Masanori Koyama; Samodha C Fernando
Journal:  J Anim Sci       Date:  2018-04-14       Impact factor: 3.159

View more
  1 in total

1.  Integrating diverse data sources to predict disease risk in dairy cattle-a machine learning approach.

Authors:  Jana Lasser; Caspar Matzhold; Christa Egger-Danner; Birgit Fuerst-Waltl; Franz Steininger; Thomas Wittek; Peter Klimek
Journal:  J Anim Sci       Date:  2021-11-01       Impact factor: 3.338

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.