| Literature DB >> 35948988 |
Xingdong Zhou1, Ran Guan2, Hongbo Cai3, Pei Wang4, Yongchun Yang1, Xiaodu Wang5, Xiaowen Li6, Houhui Song7.
Abstract
BACKGROUND: The purpose of this study was to analyze the relationship between different productive factors and piglets weaned per sow per year (PSY) in 291 large-scale pig farms and analyze the impact of the changes in different factors on PSY. We chose nine different algorithm models based on machine learning to calculate the influence of each variable on every farm according to its current situation, leading to personalize the improvement of the impact in the specific circumstances of each farm, proposing a production guidance plan of PSY improvement for every farm. According to the comparison of mean absolute error (MAE), 95% confidence interval (CI) and R2, the optimal solution was conducted to calculate the influence of 17 production factors of each pig farm on PSY improvement, finding out the bottleneck corresponding to each pig farm. The level of PSY was further analyzed when the bottleneck factor of each pig farm changed by 0.5 standard deviation (SD).Entities:
Keywords: Correlation coefficient; Gradient boosting regressor model; Machine learning; PSY; Personalized improvement
Year: 2022 PMID: 35948988 PMCID: PMC9364547 DOI: 10.1186/s40813-022-00280-z
Source DB: PubMed Journal: Porcine Health Manag ISSN: 2055-5660
Fig. 1Relationship between 17 production factors and PSY in 291 large-scale pig farms
Fig. 2Correlation coefficient matrix of 18 production factors in 291 large-scale pig farms
Evaluation factors and 95% confidence interval of different models
| Model | MAE | 95% CI | R2 | MSE | MAPE |
|---|---|---|---|---|---|
| Gradient boosting regressor | 1.6047 | 4.262 | 0.7432 | 5.0377 | 10.70% |
| Hist gradient boosting regressor | 1.6211 | 4.5973 | 0.7399 | 4.8847 | 10.68% |
| Extra trees regressor | 1.6318 | 4.488 | 0.7353 | 5.0112 | 10.84% |
| Random forest regressor | 1.7397 | 4.642 | 0.7103 | 5.5809 | 11.64% |
| Bayesian ridge | 1.7691 | 4.6014 | 0.7098 | 5.5517 | 11.73% |
| Linear regression | 1.7744 | 4.8353 | 0.7048 | 5.6470 | 11.76% |
| Bagging regressor | 1.8329 | 5.3001 | 0.7034 | 6.0023 | 12.06% |
| Ada Boost regressor | 2.0326 | 5.08 | 0.6679 | 6.4055 | 12.84% |
| Elastic net | 2.36 | 6.0404 | 0.5354 | 8.8888 | 15.38% |
MAE = mean absolute error; CI = confidence interval; MSE = mean square error; MAPE = mean absolute percentage error
Fig. 3Scatter plot of actual versus predicted values of PSY derived from gradient boosting regressor model. The solid line represents the regression curve, and dotted lines represent the 95% confidence interval
Personalized bottleneck calculated by gradient boosting regressor model*
| Factors | Number of pig farms | The proportion of pig farms | 0.5 SD | Average | Median | Max | Min |
|---|---|---|---|---|---|---|---|
| Production days | 101 | 34.7% | 163.60 | 1.41 | 1.44 | 2.92 | 0.29 |
| Number of weaned piglets per litter | 60 | 20.6% | 0.46 | 1.14 | 1.15 | 1.97 | 0.22 |
| Mating rate within 7 days after weaning | 45 | 15.5% | 6.99% | 1.63 | 1.58 | 3.31 | 0.50 |
| Farrowing rate | 24 | 8.2% | 4.06% | 1.01 | 1.07 | 1.76 | 0.28 |
| Designed stock | 17 | 5.8% | 299.90 | 1.33 | 1.29 | 2.37 | 0.61 |
| Number of piglets born alive per litter | 15 | 5.2% | 0.40 | 1.04 | 1.05 | 1.45 | 0.61 |
| Weaning to breeding interval | 11 | 3.8% | 4.39 | 1.84 | 1.82 | 2.57 | 1.16 |
| Non-productive days | 4 | 1.4% | 14.15 | 1.35 | 1.06 | 2.93 | 0.35 |
| Actual stock | 3 | 1.0% | 300.01 | 1.56 | 1.50 | 2.74 | 0.42 |
| Full load rate | 3 | 1.0% | 4.58% | 0.32 | 0.32 | 0.36 | 0.28 |
| Birth weight of piglets | 2 | 0.7% | 0.06 | 0.67 | 0.67 | 1.01 | 0.33 |
| Mummified piglet rate | 2 | 0.7% | 0.36% | 0.27 | 0.27 | 0.38 | 0.15 |
| Total number of piglets per litter | 2 | 0.7% | 0.40 | 0.80 | 0.80 | 1.21 | 0.38 |
| 21-day adjusted weight of piglets | 1 | 0.3% | 0.18 | 0.34 | 0.34 | 0.34 | 0.34 |
| Stillbirth rate | 1 | 0.3% | 1.01% | 0.28 | 0.28 | 0.28 | 0.28 |
For PSY improvement of 291 large-scale pig farms, each factor was increased by the absolute value of 0.5 standard deviation
SD = standard deviation
*Return-service rate and farm type did not affect the PSY improvement by gradient boosting regressor