| Literature DB >> 19323968 |
Martin J Green1, Graham F Medley, William J Browne.
Abstract
Assessing the fit of a model is an important final step in any statistical analysis, but this is not straightforward when complex discrete response models are used. Cross validation and posterior predictions have been suggested as methods to aid model criticism. In this paper a comparison is made between four methods of model predictive assessment in the context of a three level logistic regression model for clinical mastitis in dairy cattle; cross validation, a prediction using the full posterior predictive distribution and two "mixed" predictive methods that incorporate higher level random effects simulated from the underlying model distribution. Cross validation is considered a gold standard method but is computationally intensive and thus a comparison is made between posterior predictive assessments and cross validation. The analyses revealed that mixed prediction methods produced results close to cross validation whilst the full posterior predictive assessment gave predictions that were over-optimistic (closer to the observed disease rates) compared with cross validation. A mixed prediction method that simulated random effects from both higher levels was best at identifying the outlying level two (farm-year) units of interest. It is concluded that this mixed prediction method, simulating random effects from both higher levels, is straightforward and may be of value in model criticism of multilevel logistic regression, a technique commonly used for animal health data with a hierarchical structure.Entities:
Mesh:
Year: 2009 PMID: 19323968 PMCID: PMC2675184 DOI: 10.1051/vetres/2009013
Source DB: PubMed Journal: Vet Res ISSN: 0928-4249 Impact factor: 3.683
Figure 1.Plots of observed against predicted farm-year incidence rates of clinical mastitis (cases per cow at risk per year).
Figure 2.Plots of cross validatory predictions of farm-year clinical mastitis incidence against full and mixed predictive methods of farm-year clinical mastitis incidence (cases per cow at risk per year).
Figure 3.Comparison of MCMC P values from cross validation (for values > 0.80 and < 0.20) and from different methods of predictive assessment for farm-year incidence of clinical mastitis.
Sensitivity and specificity of MCMC P values for each prediction method (full = full posterior predictive method, mix 1 and mix 2 = mixed predictive methods 1 and 2 respectively) compared to MCMC P values for cross validation, at different P value thresholds (as specified).
| Cross validation | Total | Sens (%) | Spec (%) | |||
|---|---|---|---|---|---|---|
| 0 | 1 | |||||
| full | 0 | 86 | 14 | 100 | 17.6 | 100.0 |
| 1 | 0 | 3 | 3 | |||
| Total | 86 | 17 | 103 | |||
| mix 1 | 0 | 84 | 3 | 87 | 82.4 | 97.7 |
| 1 | 2 | 14 | 16 | |||
| Total | 86 | 17 | 103 | |||
| mix 2 | 0 | 86 | 10 | 96 | 41.2 | 100.0 |
| 1 | 0 | 7 | 7 | |||
| Total | 86 | 17 | 103 | |||
| full | 0 | 93 | 8 | 101 | 20.0 | 100.0 |
| 1 | 0 | 2 | 2 | |||
| Total | 93 | 10 | 103 | |||
| mix 1 | 0 | 90 | 5 | 95 | 50.0 | 96.8 |
| 1 | 3 | 5 | 8 | |||
| Total | 93 | 10 | 103 | |||
| mix 2 | 0 | 93 | 7 | 100 | 30.0 | 100.0 |
| 1 | 0 | 3 | 3 | |||
| Total | 93 | 10 | 103 | |||
| full | 0 | 98 | 5 | 103 | 0.0 | 100.0 |
| 1 | 0 | 0 | 0 | |||
| Total | 98 | 5 | 103 | |||
| mix 1 | 0 | 98 | 2 | 100 | 60.0 | 100.0 |
| 1 | 0 | 3 | 3 | |||
| Total | 98 | 5 | 100 | |||
| mix 2 | 0 | 98 | 4 | 102 | 20.0 | 100.0 |
| 1 | 0 | 1 | 1 | |||
| Total | 98 | 5 | 103 | |||