| Literature DB >> 29642673 |
Daniel Zaborski1, Witold S Proskura1, Wilhelm Grzesiak1.
Abstract
OBJECTIVE: The aim of this study was to verify the usefulness of artificial neural networks (ANN), multivariate adaptive regression splines (MARS), naïve Bayes classifier (NBC), general discriminant analysis (GDA), and logistic regression (LR) for dystocia detection in Polish Holstein-Friesian Black-and-White heifers and cows and to indicate the most influential predictors of calving difficulty.Entities:
Keywords: Dairy Cattle; Dystocia; Prediction; Statistical Analysis
Year: 2018 PMID: 29642673 PMCID: PMC6212759 DOI: 10.5713/ajas.17.0780
Source DB: PubMed Journal: Asian-Australas J Anim Sci ISSN: 1011-2367 Impact factor: 2.509
Probabilities for individual models on the training (n = 671), validation (n = 335), and test (n = 336) sets (two-class system, heifer dataset)
| Model | SET | Se | Sp | Acc | P(FP) | P(FN) | P(PSTP) | P(PSTN) |
|---|---|---|---|---|---|---|---|---|
| LN | L | 0.7707 | 0.8219 | 0.8063 | 0.1781 | 0.2293 | 0.6556 | 0.8907 |
| V | 0.6875 | 0.8475 | 0.7940 | 0.1525 | 0.3125 | 0.6937 | 0.8438 | |
| L+V | 0.7413 | 0.8302 | 0.8022 | 0.1698 | 0.2587 | 0.6676 | 0.8746 | |
| T | 0.7818 | 0.7832 | 0.7827 | 0.2168 | 0.2182 | 0.6370 | 0.8806 | |
| MLP1 | L | 0.8146 | 0.8305 | 0.8256 | 0.1695 | 0.1854 | 0.6789 | 0.9106 |
| V | 0.7768 | 0.8251 | 0.8090 | 0.1749 | 0.2232 | 0.6905 | 0.8804 | |
| L+V | 0.8013 | 0.8287 | 0.8201 | 0.1713 | 0.1987 | 0.6828 | 0.9006 | |
| T | 0.8545 | 0.7743 | 0.8006 | 0.2257 | 0.1455 | 0.6483 | 0.9162 | |
| MLP2 | L | 0.8098 | 0.8219 | 0.8182 | 0.1781 | 0.1902 | 0.6667 | 0.9076 |
| V | 0.7589 | 0.8206 | 0.8000 | 0.1794 | 0.2411 | 0.6800 | 0.8714 | |
| L+V | 0.7918 | 0.8215 | 0.8121 | 0.1785 | 0.2082 | 0.6711 | 0.8956 | |
| T | 0.8455 | 0.7655 | 0.7917 | 0.2345 | 0.1545 | 0.6370 | 0.9105 | |
| RBF | L | 0.8146 | 0.8155 | 0.8152 | 0.1845 | 0.1854 | 0.6601 | 0.9091 |
| V | 0.7589 | 0.8251 | 0.8030 | 0.1749 | 0.2411 | 0.6855 | 0.8720 | |
| L+V | 0.7950 | 0.8186 | 0.8111 | 0.1814 | 0.2050 | 0.6684 | 0.8967 | |
| T | 0.8273 | 0.7566 | 0.7798 | 0.2434 | 0.1727 | 0.6233 | 0.9000 | |
| MARS | L+V | 0.6215 | 0.9173 | 0.8241 | 0.0827 | 0.3785 | 0.7756 | 0.8404 |
| T | 0.6545 | 0.8628 | 0.7946 | 0.1372 | 0.3455 | 0.6990 | 0.8369 | |
| NBC | L+V | 0.2397 | 0.9927 | 0.7555 | 0.0073 | 0.7603 | 0.9383 | 0.7395 |
| T | 0.2818 | 0.9690 | 0.7440 | 0.0310 | 0.7182 | 0.8158 | 0.7349 | |
| GDA | L+V | 0.6057 | 0.9042 | 0.8101 | 0.0958 | 0.3943 | 0.7442 | 0.8329 |
| T | 0.7000 | 0.8673 | 0.8125 | 0.1327 | 0.3000 | 0.7196 | 0.8559 | |
| LR | L+V | 0.6341 | 0.9028 | 0.8181 | 0.0972 | 0.3659 | 0.7500 | 0.8428 |
| T | 0.6909 | 0.8673 | 0.8095 | 0.1327 | 0.3091 | 0.7170 | 0.8522 |
Se, sensitivity; Sp, specificity; Acc, accuracy; P(FP), false positive rate; P(FN), false negative rate; P(PSTP), a posteriori probability of true positives; P(PSTN), a posteriori probability of true negatives.
Model: LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.
Dataset: L, training set; V, validation set; T, test set.
Values within columns (and within sets) with different superscripts differ significantly (p<0.05).
Quality measures for the models predicting calving difficulty in heifers and cows (two- and three-class system)
| Model | Heifers | Cows | ||||
|---|---|---|---|---|---|---|
|
|
| |||||
| AIC | BIC | G2 | AIC | BIC | G2 | |
| Two-class system | ||||||
| LN | −1,887.16 | −1,857.68 | 446.22 | −4,461.26 | −4,409.76 | 1,477.22 |
| MLP1 | −1,989.56 | −1,915.85 | 399.33 | −4,552.60 | −4,495.95 | 1,527.00 |
| MLP2 | −1,908.76 | −1,673.11 | 418.99 | −4,257.93 | −3,521.05 | 1,446.49 |
| RBF | −1,701.15 | −1,166.05 | 421.34 | −4,440.15 | −4,311.38 | 1,690.78 |
| MARS | −1,729.76 | −1,451.99 | 420.59 | −912.21 | 1,108.09 | 70.24 |
| NBC | −1,632.62 | −1,583.48 | 915.50 | −4,285.61 | −4,192.90 | - |
| GDA | −2,019.97 | −1,970.83 | 456.65 | −4,391.31 | −4,298.59 | - |
| LR | −2,034.55 | −2,009.98 | 429.82 | −4,409.51 | −4,363.15 | - |
| Three-class system | ||||||
| LN | −1,663.94 | −1,575.49 | 1,395.31 | −2,308.71 | −2,125.43 | 1,003.61 |
| MLP1 | −1,638.86 | −1,365.75 | 1,181.25 | −1,650.23 | −279.29 | 870.53 |
| MLP2 | −621.73 | 750.33 | 1,172.18 | −1,989.35 | −1,064.54 | 892.54 |
| RBF | −1,561.60 | −1,219.08 | 1,282.70 | −1,171.75 | 544.35 | 960.53 |
| MARS | −1,589.96 | −1,425.34 | 1,104.91 | −1,524.42 | −202.24 | 894.38 |
| NBC | −1,734.65 | −1,660.94 | 1,527.62 | −2,177.10 | −2,038.03 | 1,347.23 |
| GDA | −1,714.63 | −1,640.93 | 1,218.13 | −2,361.01 | −2,206.49 | 972.12 |
AIC, Akaike information criterion; BIC, Bayesian information criterion; G2, G-squared statistic.
Model: LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.
Values could not be calculated.
Probabilities for individual models on the training (n = 850), validation (n = 425), and test (n = 424) sets (two-class system, cow dataset)
| Model | SET | Se | Sp | Acc | P(FP) | P(FN) | P(PSTP) | P(PSTN) |
|---|---|---|---|---|---|---|---|---|
| LN | L | 0.7333 | 0.6439 | 0.6471 | 0.3561 | 0.2667 | 0.0701 | 0.9851 |
| V | 0.6154 | 0.6359 | 0.6353 | 0.3641 | 0.3846 | 0.0506 | 0.9813 | |
| L+V | 0.6977 | 0.6412 | 0.6431 | 0.3588 | 0.3023 | 0.0636 | 0.9838 | |
| T | 0.4667 | 0.6675 | 0.6604 | 0.3325 | 0.5333 | 0.0490 | 0.9715 | |
| MLP1 | L | 0.7333 | 0.6220 | 0.6259 | 0.3780 | 0.2667 | 0.0663 | 0.9846 |
| V | 0.6154 | 0.6578 | 0.6565 | 0.3422 | 0.3846 | 0.0537 | 0.9819 | |
| L+V | 0.6977 | 0.6339 | 0.6361 | 0.3661 | 0.3023 | 0.0624 | 0.9836 | |
| T | 0.4667 | 0.6504 | 0.6439 | 0.3496 | 0.5333 | 0.0467 | 0.9708 | |
| MLP2 | L | 0.7000 | 0.6634 | 0.6647 | 0.3366 | 0.3000 | 0.0707 | 0.9837 |
| V | 0.5385 | 0.6408 | 0.6376 | 0.3592 | 0.4615 | 0.0452 | 0.9778 | |
| L+V | 0.6512 | 0.6558 | 0.6557 | 0.3442 | 0.3488 | 0.0619 | 0.9818 | |
| T | 0.6000 | 0.6479 | 0.6462 | 0.3521 | 0.4000 | 0.0588 | 0.9779 | |
| RBF | L | 0.7667 | 0.5890 | 0.5953 | 0.4110 | 0.2333 | 0.0639 | 0.9857 |
| V | 0.5385 | 0.6286 | 0.6259 | 0.3714 | 0.4615 | 0.0438 | 0.9774 | |
| L+V | 0.6977 | 0.6023 | 0.6055 | 0.3977 | 0.3023 | 0.0577 | 0.9828 | |
| T | 0.4000 | 0.6235 | 0.6156 | 0.3765 | 0.6000 | 0.0375 | 0.9659 | |
| MARS | L+V | 0.4419 | 1.0000 | 0.9812 | 0.0000 | 0.5581 | 1.0000 | 0.9809 |
| T | 0.0000 | 0.9878 | 0.9528 | 0.0122 | 1.0000 | 0.0000 | 0.9642 | |
| NBC | L+V | 0.0000 | 1.0000 | 0.9663 | 0.0000 | 1.0000 | - | 0.9663 |
| T | 0.0000 | 1.0000 | 0.9646 | 0.0000 | 1.0000 | - | 0.9646 | |
| GDA | L+V | 0.0000 | 1.0000 | 0.9663 | 0.0000 | 1.0000 | - | 0.9663 |
| T | 0.0000 | 1.0000 | 0.9646 | 0.0000 | 1.0000 | - | 0.9646 | |
| LR | L+V | 0.0000 | 1.0000 | 0.9663 | 0.0000 | 1.0000 | - | 0.9663 |
| T | 0.0000 | 1.0000 | 0.9646 | 0.0000 | 1.0000 | - | 0.9646 |
Se, sensitivity; Sp, specificity; Acc, accuracy; P(FP), false positive rate; P(FN), false negative rate; P(PSTP), a posteriori probability of true positives; P(PSTN), a posteriori probability of true negatives.
Model: LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.
Dataset: L, training set; V, validation set; T, test set.
Values of the test statistic could not be calculated.
Values of the test statistic could not be calculated for the comparisons between NBC, GDA, and LR.
P(PSTP) values could not be calculated.
Values within columns (and within sets) with different superscripts differ significantly (p<0.05).
Proportions of correctly classified cases for the three categories of calving difficulty in heifers and cows (three-class system)
| Model | SET | Heifers | Cows | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
| ||||||||
| Easy | Mod | Diff | Acc | Easy | Mod | Diff | Acc | ||
| LN | L | 0.2844 | 0.5242 | 0.8195 | 0.5365 | 0.6395 | 0.7023 | 0.0000 | 0.6494 |
| V | 0.2909 | 0.6106 | 0.7500 | 0.5522 | 0.6022 | 0.6926 | 0.0000 | 0.6329 | |
| L+V | 0.2866 | 0.5512 | 0.7950 | 0.5417 | 0.6275 | 0.6990 | 0.0000 | 0.6439 | |
| T | 0.2432 | 0.4783 | 0.8273 | 0.5149 | 0.6786 | 0.6526 | 0.0000 | 0.6415 | |
| MLP1 | L | 0.2982 | 0.7661 | 0.7659 | 0.6140 | 0.6526 | 0.7591 | 0.0000 | 0.6847 |
| V | 0.3000 | 0.7257 | 0.7143 | 0.5821 | 0.6630 | 0.7143 | 0.0000 | 0.6706 | |
| L+V | 0.2988 | 0.7535 | 0.7476 | 0.6034 | 0.6560 | 0.7437 | 0.0000 | 0.6800 | |
| T | 0.2072 | 0.7130 | 0.8000 | 0.5744 | 0.6122 | 0.6995 | 0.0000 | 0.6344 | |
| MLP2 | L | 0.3303 | 0.6895 | 0.8000 | 0.6066 | 0.6526 | 0.7386 | 0.0000 | 0.6741 |
| V | 0.3182 | 0.6726 | 0.7232 | 0.5731 | 0.6575 | 0.7229 | 0.0000 | 0.6729 | |
| L+V | 0.3262 | 0.6842 | 0.7729 | 0.5954 | 0.6542 | 0.7332 | 0.0000 | 0.6737 | |
| T | 0.2613 | 0.6696 | 0.8091 | 0.5804 | 0.6582 | 0.6854 | 0.0000 | 0.6486 | |
| RBF | L | 0.2936 | 0.7056 | 0.7415 | 0.5827 | 0.6658 | 0.7091 | 0.0000 | 0.6647 |
| V | 0.3182 | 0.6549 | 0.6161 | 0.5313 | 0.6077 | 0.6926 | 0.0000 | 0.6353 | |
| L+V | 0.3018 | 0.6898 | 0.6972 | 0.5656 | 0.6471 | 0.7034 | 0.0000 | 0.6549 | |
| T | 0.2523 | 0.5826 | 0.7636 | 0.5327 | 0.6633 | 0.6808 | 0.0000 | 0.6486 | |
| MARS | L+V | 0.3659 | 0.6648 | 0.7886 | 0.6064 | 0.7094 | 0.7854 | 0.1163 | 0.7294 |
| T | 0.2883 | 0.6348 | 0.8455 | 0.5893 | 0.6633 | 0.6714 | 0.0667 | 0.6462 | |
| NBC | L+V | 0.1677 | 0.8283 | 0.7066 | 0.5746 | 0.3102 | 0.8927 | 0.0000 | 0.6063 |
| T | 0.1081 | 0.8000 | 0.7636 | 0.5595 | 0.3622 | 0.8779 | 0.0000 | 0.6085 | |
| GDA | L+V | 0.4116 | 0.5235 | 0.7666 | 0.5636 | 0.6292 | 0.7139 | 0.0000 | 0.6525 |
| T | 0.3784 | 0.4522 | 0.7818 | 0.5357 | 0.6888 | 0.6526 | 0.0000 | 0.6462 | |
Easy, easy calving; Mod, moderate calving; Diff, difficult calving; Acc, accuracy.
Model: LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.
Dataset: L, training set; V, validation set; T, test set.
Values of the test statistic could not be calculated.
Values within columns (and within sets) with different superscripts differ significantly (p<0.05).
The most influential predictors of calving difficulty for heifers and cows
| Variable | LN | MLP1 | MLP2 | RBF | MARS | GDA | LR |
|---|---|---|---|---|---|---|---|
| Heifers, two-class system | |||||||
| Mean | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| Farm | 2 | 3 | 3 | 3 | 4 | 2 | 2 |
| Age | 3 | 2 | 2 | 2 | 2 | 3 | 3 |
| Season | 4 | 4 | 4 | 4 | 5 | 4 | 4 |
| Sex | 5 | 5 | 5 | 5 | 3 | 5 | 5 |
| Heifers, three-class system | |||||||
| Mean | 1 | 1 | 1 | 1 | 1 | 1 | - |
| Farm | 2 | 2 | 2 | 3 | 2 | 2 | - |
| Age | 3 | 3 | 3 | 2 | 2 | 3 | - |
| Season | 4 | 4 | 4 | 5 | 3 | 4 | - |
| Sex | 5 | 5 | Ex | 4 | 4 | 5 | - |
| Cows, two-class system | |||||||
| Mean | 3 | 1 | 4 | Ex | 3 | 2 | 2 |
| Farm | 9 | Ex | Ex | Ex | 8 | 8 | 7 |
| Age | 7 | 5 | 5 | Ex | 1 | 7 | 8 |
| Season | 8 | 6 | 3 | 3 | 6 | 6 | 6 |
| Sex | 1 | 4 | 2 | 1 | 2 | 1 | 1 |
| MDM | 2 | 2 | 6 | Ex | 4 | 3 | 3 |
| MAST | 4 | 8 | 8 | 2 | 9 | 4 | 4 |
| PCALV | 5 | 7 | 1 | 4 | 7 | 5 | 5 |
| CI | 6 | 3 | 7 | 5 | 5 | 9 | 9 |
| Cows, three-class system | |||||||
| Mean | 1 | 1 | 1 | 3 | 3 | 1 | - |
| Farm | 5 | 6 | 6 | 6 | 6 | 5 | - |
| Age | 3 | 5 | 3 | 5 | 1 | 3 | - |
| Season | 9 | 3 | 4 | 1 | 7 | 8 | - |
| Sex | 2 | 4 | 5 | 2 | 5 | 2 | - |
| MDM | 6 | 7 | 7 | 7 | 2 | 4 | - |
| MAST | 8 | 8 | 9 | 9 | 8 | 7 | - |
| PCALV | 4 | 2 | 2 | 4 | 2 | 6 | - |
| CI | 7 | 9 | 8 | 8 | 4 | 9 | - |
LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; GDA, general discriminant analysis; LR, logistic regression.
Mean, mean calving difficulty for the dam’s sire; Farm, herd milk yield category; AGE, calving age; Season, calving season; Sex, calf sex; MDM, mean daily milk yield for the previous lactation; MAST, mastitis during pregnancy; PCALV, preceding calving difficulty; CI, preceding calving interval; Ex, excluded.
Figure 1The ROC curves for different models (two-class system, heifer dataset). ROC, receiver operating characteristic; LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.
Figure 2The ROC curves for different models (two-class system, cow dataset). ROC, receiver operating characteristic; LN, linear networks; MLP1, multilayer perceptrons with one hidden layer; MLP2, multilayer perceptrons with two hidden layers; RBF, radial basis function networks; MARS, multivariate adaptive regression splines; NBC, naïve Bayes classifier; GDA, general discriminant analysis; LR, logistic regression.