| Literature DB >> 29685441 |
Nils Fall1, Anna Ohlson2, Ulf Emanuelson3, Ian Dohoo4.
Abstract
The use of routinely recorded data for research purposes and disease surveillance is an attractive proposition. However, this requires that the validity and reliability of the data be evaluated for the purpose for which they are to be used. This manuscript reports an evaluation of milk shipment data for evaluating their usefulness in disease monitoring and the resilience of organic and conventional dairy herds in Sweden. A large number of inconsistencies were observed in the data, necessitating substantial efforts to "clean" the data. Given that the selection of rules used in the cleaning process was subjective in nature, a sensitivity analysis was carried out to determine if different cleaning routines produced substantially different results. Despite the cleaning efforts we observed far more large residuals at the shipment level than expected. Thus, it was concluded that the data were too "noisy" to be used for identification of short term impacts on milk production. Resilience was evaluated by examining the residual variance in milk shipped per cow per day under the assumption that herds with high resilience would have lower residual variance. The effects on residual variance of organic status or whether or not the herd used an automatic milking system were evaluated in models in which the residual variance was stratified or not by these factors. We did not find consistent evidence to suggest that organic herds had higher resilience than conventional herds, but this could be partly due to using residual variance as the measure indicating resilience.Entities:
Keywords: Dairy; Disease monitoring; Milk shipment data; Resilience, organic
Mesh:
Year: 2018 PMID: 29685441 PMCID: PMC7114288 DOI: 10.1016/j.prevetmed.2018.03.012
Source DB: PubMed Journal: Prev Vet Med ISSN: 0167-5877 Impact factor: 2.670
Steps in preparation of data set cln20.
| Data compilation steps | # of herds | # of observations |
|---|---|---|
| Initial milk shipment data file | 137 | 95858 |
| Remove exact duplicate records (n = 3243) | 137 | 92615 |
| Combine multiple shipments within one day into a single daily total (5393 combined into 2615 records) | 137 | 89837 |
| Drop duplicate records on consecutive days (n = 164) | 137 | 89673 |
| Drop all milk shipment data after 26 November 2015 (n = 9257) and (38) herds (n = 20,760) with no herd size information | 99 | 59656 |
| Merge with herd managment file and drop (5) herds (n = 646) with no management information | 94 | 59010 |
| Data cleaning steps | ||
| Identify observations meeting supect rule #1 (n = 886) | ||
| Identify observations meeting supect rule #2 (n = 2226) | ||
| Identify observations meeting supect rule #3 (n = 2806) | ||
| Drop observations flagged by any of the preceding three rules plus two shipments on either side (n = 7240) | 94 | 51770 |
| Drop one herd subsequently identified as having seasonal calving (n = 627) | 93 | 51143 |
Fig. 1Proportional change in milk shipped per cow per day before and after data cleaning. (See text for description of data cleaning process).
Estimated shipment-day variance from random effects linear regression models of milk shipped per cow per day, as a measure of resilience. Model 1–3 applied different stratifications, model 4-6 compared different cut-offs for removing suspect records from the data set, models 7 and 9 applied an alternative correlation structure while poorly fitting herds were removed in models 8 and 10.
| Model | Dataset | Correlation structure | n | Lowest level variance | |||
|---|---|---|---|---|---|---|---|
| non-organic | organic | ||||||
| no ams | ams | no ams | ams | ||||
| 1 | cln20 | ar1 | 51014 | 3.74 | |||
| 2 | cln20 | ar1 | 51014 | 3.74 | 3.76 | ||
| 3 | cln20 | ar1 | 51014 | 3.70 | 3.82 | 3.69 | 3.87 |
| 4 | cln25 | ar1 | 52951 | 3.76 | 3.99 | 3.81 | 4.00 |
| 5 | cln15 | ar1 | 43289 | 3.75 | 3.39 | 3.58 | 3.89 |
| 6 | cln60/20 | ar1 | 50269 | 3.69 | 3.65 | 3.69 | 3.92 |
| 7 | cln20 | exp | 51014 | 3.76 | 4.00 | 3.82 | 3.97 |
| 8 | cln20r | exp | 45110 | 3.00 | 3.74 | 3.63 | 4.28 |
| 9 | cln15 | exp | 43289 | 3.70 | 3.39 | 3.75 | 3.84 |
| 10 | cln15r | exp | 38702 | 3.71 | 3.84 | 3.58 | 4.14 |
see text for description of construction of the various data sets.
Herd level characteristics of the 93 study herds in data set cln20. Herd size, milk shipped and within herd SD are averages over the study period.
| automated milking system (ams) | |||||
|---|---|---|---|---|---|
| Production system | parameter | no | transition during study | AMS for full period | Total |
| Non-Organic | number of herds | 29 | 8 | 2 | 39 |
| herd size | 89 | 111 | 204 | 99 | |
| milk shipped | 28.91 | 30.35 | 28.68 | 29.19 | |
| within-herd SD | 2.11 | 2.28 | 2.07 | 2.14 | |
| Organic | number of herds | 20 | 19 | 15 | 54 |
| herd size | 80 | 76 | 72 | 76 | |
| milk shipped | 26.26 | 26.6 | 27.19 | 26.64 | |
| within-herd SD | 2.19 | 2.06 | 2.36 | 2.19 | |
| Total | number of herds | 49 | 27 | 17 | 93 |
| herd size | 85 | 87 | 87 | 86 | |
| milk shipped | 27.83 | 27.71 | 27.36 | 27.71 | |
| within-herd SD | 2.14 | 2.13 | 2.33 | 2.17 | |
estimated number of cows milked.
milk shipped per cow per day.
within herd standard deviation of milk shipment weights.
Final linear regression model evaluating the effect of selected factors on milk shipped per cow per day, based on data set cln20, with the lowest level variance stratified by both AMS and organic and imposing an exponential correlation structure on the lowest level residuals.
| Variable | Level | Estimate | S.E. | 95% Confidence interval |
|---|---|---|---|---|
| Intercept | 27.53 | 0.78 | 26.01 ; 29.05 | |
| Year | 2012 | baseline | ||
| 2013 | 0.22 | 0.07 | 0.09 ; 0.36 | |
| 2014 | 0.67 | 0.08 | 0.51 ; 0.82 | |
| 2015 | 1.17 | 0.09 | 0.99 ; 1.34 | |
| Cows | 0.02 | 0.03 | −0.04 ; 0.08 | |
| Ams (vs non-AMS) | Yes | 0.69 | 0.52 | −0.34 ; 1.71 |
| Organic (vs non-organic) | Yes | −2.81 | 0.53 | −3.85 ; -1.78 |
| Time of year | Cos | 0.29 | 0.04 | 0.2 ; 0.37 |
| Sin | 1.29 | 0.04 | 1.2 ; 1.37 | |
| Region | South Sweden | baseline | ||
| Central Sweden | −0.5 | 0.59 | −1.67 ; 0.66 | |
| North Sweden | −0.48 | 0.68 | −1.82 ; 0.85 | |
| Breed | Swedish Red | baseline | ||
| Swedish Holstein | 3.74 | 0.83 | 2.11 ; 5.38 | |
| Mixed Red and Holstein | 0.98 | 0.7 | −0.39 ; 2.34 | |
| Other | 0.28 | 0.82 | −1.33 ; 1.89 | |
| Random effects | ||||
| Herd | 6.756 | 0.931 | 5.157 ; 8.850 | |
| Non AMS, non organic | Rho | 0.947 | 0.002 | 0.944 ; 0.950 |
| Residual variance | 3.754 | 0.119 | 3.528 ; 3.995 | |
| Non AMS, organic | Rho | 0.939 | 0.002 | 0.935 ; 0.942 |
| Residual variance | 3.819 | 0.116 | 3.598 ; 4.052 | |
| AMS, non organic | Rho | 0.894 | 0.007 | 0.880 ; 0.906 |
| Residual variance | 3.996 | 0.238 | 3.556 ; 4.491 | |
| AMS, organic | Rho | 0.905 | 0.003 | 0.900 ; 0.911 |
| Residual variance | 3.973 | 0.114 | 3.756 ; 4.204 | |
per 10 cows.