| Literature DB >> 36172463 |
L Heinen1, P A Lancaster1, B J White1, E Zwiefel2.
Abstract
Changes in feeding behavior and intake have been used to predict the onset of bovine respiratory disease in individual animals but have not been applied to cohort-level data. Correctly identifying high morbidity cohorts of cattle early in the feeding period could facilitate the administration of interventions to improve health and economic outcomes. The study objective was to determine the ability of feed delivery data from the first 15 days of feed to predict total feeding period morbidity. Data consisted of 518 cohorts (10 feedlots, 56,796 animals) of cattle of varying sex, age, arrival weight, and arrival time of year over a 2-year period. Overall cohort-level morbidity was classified into high (≥15% total morbidity) or low categories with 18.5% of cohorts having high morbidity. Five predictive models (advanced perceptron, decision forest, logistic regression, neural network, and boosted decision tree) were created to predict overall morbidity given cattle characteristics at arrival and feeding characteristics from the first 15 days. The dataset was split into training and testing subsets (75% and 25% of original, respectively), stratified by the outcome of interest. Predictive models were generated in Microsoft Azure using the training set and overall predictive performance was evaluated using the testing set. Performance in the testing set (n = 130) was measured based on final accuracy, sensitivity (Sn, the ability to accurately detect high morbidity cohorts), and specificity (Sp, the ability to accurately detect low morbidity cohorts). The decision forest had the highest Sp (97%) with the greatest ability to accurately identify low morbidity lots (103 of 106 identified correctly), but this model had low Sn (33%). The logistic regression and neural network had similar Sn (both 63%) and Sp (69% and 72%, respectively) with the best ability to correctly identify high morbidity cohorts (15 of 24 correctly identified). Predictor variables with the greatest importance in the predictive models included percent change in feed delivery between days and 4-day moving averages. The most frequent variable with a high level of importance among models was the percent change in feed delivered from d 2 to 3 after arrival. In conclusion, feed delivery data during the first 15 days on feed was a significant predictor of total cohort-level morbidity over the entire feeding period with changes in feed delivery providing important information.Entities:
Keywords: feeding patterns; machine learning; predictive analytics
Year: 2022 PMID: 36172463 PMCID: PMC9512095 DOI: 10.1093/tas/txac121
Source DB: PubMed Journal: Transl Anim Sci ISSN: 2573-2102
Figure 1.Timeline schematic demonstrating the feed delivery data corresponding to various feeding predictor variables. Triangles indicate data on dry matter delivered as percentage of arrival body weight (DMI-BW). Arrows indicate data on percent change in DMI-BW from day to day.
Complete overview of variables included in the final dataset to train and test the five predictive models
| Variable category | Variable name | Description of variable |
|---|---|---|
| Feeding variables | DMI-BW | Feed intake measured by DMI-BW for each day starting on the day of arrival (0) to day 15 (16 total measurements) |
| Percent change in DMI-BW from one day to the next for days 0 through 15 | Percent change in feed intake measured by DMI-BW between sequential days (15 total measurements) | |
| 2-day increment rolling averages of percent change in DMI-BW | Rolling averages in 2-day increments of percent change in DMI-BW (14 total measurements) | |
| 3-day increment rolling averages of percent change in DMI-BW | Rolling averages in 3-day increments of percent change in DMI-BW (13 total measurements) | |
| 4-day increment rolling averages of percent change in DMI-BW | Rolling averages in 4-day increments of percent change in DMI-BW (12 total measurements) | |
| 5-day increment rolling averages of percent change in DMI-BW | Rolling averages in 5-day increments of percent change in DMI-BW (11 total measurements) | |
| 6-day increment rolling averages of percent change in DMI-BW | Rolling averages in 6-day increments of percent change in DMI-BW (10 total measurements) | |
| 7-day increment rolling averages of percent change in DMI-BW | Rolling averages in 7-day increments of percent change in DMI-BW (9 total measurements) | |
| Arrival characteristics | Arrival date | Date of arrival for the cohort, format: MM/DD/YYYY |
| Average arrival weight | Average weight at arrival of the cohort in pounds | |
| Sex | Sex of the cohort, could be heifer, steer, mixed | |
| Arrival animal count | Number of animals in the cohort upon arrival | |
| Outcome variable | Total morbidity category | High (≥ 15%) or low (< 15%) based on morbidity for any diagnosis as a percentage of arrival animal count |
DMI-BW indicates the feed intake on a dry matter basis given as a percentage of the average arrival weight of the cohort.
Figure 2.Illustration of data management, and model training and evaluation process.
Figure 3.Receiver operating characteristic (ROC) curves for five predictive models trained to predict high (≥15%) morbidity cohorts of feedlot cattle. The five predictive models are Advanced Perceptron, Logistic Regression, Neural Network, Decision Tree, and Decision Forest. Perf. Pred. represents the perfect predictive model.
Model evaluation of five predictive models trained to predict high (≥15%) morbidity cohorts of feedlot cattle
| Model | AUC | Acc (%) | Sn (%) | Sp (%) | PPV | NPV |
|---|---|---|---|---|---|---|
| Advanced perceptron | 0.653 | 18.5 | 100.0 | 0.0 | 0.18 | — |
| Logistic regression | 0.675 | 67.7 | 62.5 | 68.9 | 0.31 | 0.89 |
| Neural network | 0.691 | 70.0 | 62.5 | 71.7 | 0.33 | 0.89 |
| Decision tree | 0.691 | 78.5 | 29.2 | 89.6 | 0.39 | 0.85 |
| Decision forest | 0.671 | 85.4 | 33.3 | 97.2 | 0.73 | 0.87 |
AUC, area under the receiver operator characteristic (ROC) curve; Acc, overall accuracy; Sn, sensitivity, ability to predict high morbidity cohorts; Sp, specificity, ability to predict low morbidity cohorts; PPV, positive predictive value, probability that predicted high morbidity cohorts are truly high morbidity cohorts; NPV, negative predictive value, probability that predicted low morbidity cohorts are truly low morbidity cohorts.