| Literature DB >> 36242599 |
Sherif A Moawed1, Ayman H Abd El-Aziz2.
Abstract
The incorporation of novel technologies such as artificial intelligence, data mining, and advanced statistical methodologies have received wide responses from researchers. This study was designed to model the factors impacting the actual milk yield of Holstein-Friesian cows using the proportional odds ordered logit model (OLM). A total of 8300 lactation records were collected for cows calved between 2005 and 2019. The actual milk yield, the outcome variable, was categorized into three levels: low (< 4500 kg), medium (4500-7500 kg), and high (> 7500 kg). The studied predictor variables were age at first calving (AFC), lactation order (LO), days open (DO), lactation period (LP), peak milk yield (PMY), and dry period (DP). The proportionality assumption of odds using the logit link function was verified for the current datasets. The goodness-of-fit measures revealed the suitability of the ordered logit models to datasets structure. Results showed that cows with younger ages at first calving produce two times higher milk quantities. Also, longer days open were associated with higher milk yield. The highest amount of milk yield was denoted by higher lactation periods (> 250 days). The peak yield per kg was significantly related to the actual yield (P < 0.05). Moreover, shorter dry periods showed about 1.5 times higher milk yield. The greatest yield was observed in the 2nd and 4th parities, with an odds ratio (OR) equal to 1.75, on average. In conclusion, OLM can be used for analyzing dairy cows' data, denoting fruitful information as compared to the other classical regression models. In addition, the current study showed the possibility and applicability of OLM in understanding and analyzing livestock datasets suited for planning effective breeding programs.Entities:
Keywords: Holstein–Friesian dairy cows; Milk production; Odds ratio; Ordered logit models; Predicted probability; Proportional odds
Mesh:
Year: 2022 PMID: 36242599 PMCID: PMC9569299 DOI: 10.1007/s11250-022-03329-x
Source DB: PubMed Journal: Trop Anim Health Prod ISSN: 0049-4747 Impact factor: 1.893
The variables studied and data coding
| Variable | Role in the model | Categories and codes |
|---|---|---|
| Actual milk yield (kg) | Dependent variable (response or outcome) | < 4500 (low) 4500–7500 (medium) > 7500 (high) |
| Age at first calving (month) | Explanatory variable Covariate | < 22 22–25 > 25 |
| Lactation order (parity) | Explanatory variable | P1 P2 P3 P4 P5 P6 |
| Days open | Explanatory variable | < 100 100–200 201–400 > 400 |
| Lactation period (days) | Explanatory variable | < 150 150–250 > 250 |
| Peak milk yield (kg) | Explanatory variable | < 35 35–50 > 50 |
| Dry period (days) | Explanatory variable | < 60 60–80 > 80 |
The most common models used for modeling dependent variables with multiple levels
| Model namea | Type of dependent variable | Assumptions |
|---|---|---|
| Multinomial logistic regression (logit) | Categorical, unordered | a. The characteristics of individuals should be provided b. The dependent variable categories should be independent |
| Multinomial logistic regression (probit) | Categorical, unordered | a. The characteristics of individuals should be provided |
| Ordinal logistic regression (ordered logit) | Categorical, ordered | a. The characteristics of individuals should be provided b. Proportional odds assumption is required |
| Ordinal logistic regression (ordered probit) | Categorical, ordered | a. The characteristics of individuals should be provided b. Proportional odds assumption is required |
| Nested logit | Nested nominal design | a. The inclusive value (IV) should be positive |
| Conditional logit | Categorical, nominal | a. The characteristics of both categories and individuals should be provided |
aAccording to previous authors (Akkus and Ozkoc 2012; Akkus and Sevinc 2019)
Summary statistics and descriptive measures for the lifetime investigated variables
| Dependent and independent variables | Summary Statistics | ||
|---|---|---|---|
| Percent (%) | Mean | Standard deviation | |
| Actual milk yield (kg) | |||
| < 4500 | 22.72 | 2954.80 | 1048.47 |
| 4500–7500 | 28.55 | 5963.51 | 881.89 |
| > 7500 | 48.73 | 11,666.56 | 3870.11 |
| Age at first calving (month) | |||
| < 22 | 15.40 | 21.86 | 0.34 |
| 22–25 | 71.75 | 23.56 | 0.74 |
| > 25 | 12.86 | 28.23 | 3.19 |
| Days open | |||
| < 100 | 32.14 | 76.81 | 12.84 |
| 100–200 | 37.97 | 144.79 | 27.78 |
| 201–400 | 20.33 | 284.57 | 54.46 |
| > 400 | 7.47 | 492.26 | 88.86 |
| Lactation period (days) | |||
| < 150 | 19.43 | 120.12 | 25.37 |
| 150–250 | 37.52 | 200.71 | 28.31 |
| > 250 | 40.96 | 399.01 | 127.29 |
| Peak milk yield (kg) | |||
| < 35 | 8.97 | 31.14 | 4.57 |
| 35–50 | 60.24 | 43.50 | 3.86 |
| > 50 | 30.79 | 56.15 | 4.03 |
| Dry period (days) | |||
| < 60 | 49.78 | 55.29 | 3.80 |
| 60–80 | 40.36 | 65.54 | 4.61 |
| > 80 | 9.72 | 103.90 | 26.74 |
Test of parallel lines (proportional odds assumption)
| Tested model | -2 log-likelihood* | Chi-square | Degrees of freedom (df) | |
|---|---|---|---|---|
| Null hypothesis | 374.9 | |||
| General | 351.1 | 23.84 | 16 | 0.093NS |
NS: non-significant at 5% significance level (P > 0.05)
Model fitting, validity, and goodness-of-fit information based on the logit link function
| Model fitting information | ||||
|---|---|---|---|---|
| Model | -2 log-likelihood | Chi-square | DF | |
| Intercept only | 1213.7 | |||
| Final | 374.9 | 838.8 | 16 | 0.0001* |
| Goodness-of-fit measures | ||||
| Chi-square | DF | |||
| Pearson | 378.4 | 598 | 1.00 | |
| Deviance | 286.5 | 598 | 1.00 | |
| Pseudo | ||||
| Cox and Snell | 0.722 | |||
| Nagelkerke | 0.823 | |||
| McFadden | 0.609 | |||
Multivariable-ordered logit model for the analysis of factors influencing milk yield of Holstein-Friesians
| Dependent variable: actual milk yield 1: < 4500 2: 4500–7500 3: > 7500 | Coefficients: β (SE) | Wald | 95% CI of coefficients | Odds ratio (OR) | ||
|---|---|---|---|---|---|---|
| Lower limit | Upper limit | |||||
| Threshold parameters | ||||||
| Actual milk yield (1) | 0.948 | 806.6 | 0.0001** | − 28.79 | − 25.08 | - |
| Actual milk yield (2) | 0.834 | 737.8 | 0.0001** | − 24.29 | − 21.02 | - |
| Location parameters | ||||||
| Age at first calving | ||||||
| < 22 | 0.730 (0.456) | 2.567 | 0.039* | − 1.163 | 1.623 | 2.08 |
| 22–25 | 0.358 (0.367) | 0.948 | 0.330NS | − 0.362 | 1.077 | 1.43 |
| > 25 (base group) | ||||||
| Days open | ||||||
| < 100 | − 18.54 (0.391) | 2247.9 | 0.001** | − 19.31 | − 17.78 | 0.432 |
| 100–200 | − 19.01 (0.373) | 2602.4 | 0.001** | − 19.75 | − 18.28 | 0.369 |
| 201–400 | − 17.76 (0.965) | 338.56 | 0.048* | − 19.26 | − 16.26 | 0.721 |
| > 400 (base group) | ||||||
| Lactation period (DIM) | ||||||
| < 150 | − 9.78 (0.654) | 224.19 | 0.001** | − 11.07 | − 8.51 | 0.509 |
| 150–250 | − 4.71 (0.393) | 143.62 | 0.001** | − 5.48 | − 3.94 | 0.489 |
| > 250 (base group) | ||||||
| Peak milk yield | ||||||
| < 35 | − 5.39 (0.512) | 110.98 | 0.001** | − 6.39 | − 4.39 | 0.691 |
| 35–50 | − 2.58 (0.349) | 54.71 | 0.042* | − 3.26 | − 1.89 | 0.856 |
| > 50 (reference) | ||||||
| Dry period | ||||||
| < 60 | 0.406 (0.448) | 0.823 | 0.044* | − 0.471 | 1.284 | 1.50 |
| 60–80 | − 0.031 (0.449) | 0.005 | 0.945NS | − 0.912 | 0.850 | 0.97 |
| > 80 (base group) | ||||||
| Lactation order (parity) | ||||||
| P1 | 0.255 (0.517) | 0.242 | 0.623NS | − 0.759 | 1.268 | 1.29 |
| P2 | 0.599 (0.518) | 1.341 | 0.047* | − 0.415 | 1.614 | 1.82 |
| P3 | 0.045 (0.533) | 0.007 | 0.933NS | − 1.00 | 1.090 | 1.05 |
| P4 | 0.553 (0.575) | 0.924 | 0.036* | − 0.574 | 1.679 | 1.74 |
| P5 | 0.148 (0.656) | 0.051 | 0.821NS | − 1.137 | 1.433 | 1.16 |
| P6 (base group) | ||||||
| Model validity | ||||||
| -2 log-likelihood = 374.9; LR chi-sq (16) = ; prob > chi-square = 0.0001** | ||||||
| Correct classification rate = 82.32% | ||||||
**Coefficient is significant at a 0.01 level of significance (P < 0.01)
*Coefficient is significant at a 0.05 level of significance (P < 0.05)
NSCoefficient is non-significant at a 0.05 level of significance (P > 0.05)
Reference categories: AFC (> 25 months), DO (> 400 days), LP (> 250 days), PMY (> 50 kg), DP (> 80 days), lactation order or parity (P6)