| Literature DB >> 20591149 |
Kadir Kizilkaya1, Dorian J Garrick, Rohan L Fernando, Burcu Mestav, Mehmet A Yildiz.
Abstract
BACKGROUND: The distribution of residual effects in linear mixed models in animal breeding applications is typically assumed normal, which makes inferences vulnerable to outlier observations. In order to mute the impact of outliers, one option is to fit models with residuals having a heavy-tailed distribution. Here, a Student's-t model was considered for the distribution of the residuals with the degrees of freedom treated as unknown. Bayesian inference was used to investigate a bivariate Student's-t (BSt) model using Markov chain Monte Carlo methods in a simulation study and analysing field data for gestation length and birth weight permitted to study the practical implications of fitting heavy-tailed distributions for residuals in linear mixed models.Entities:
Mesh:
Year: 2010 PMID: 20591149 PMCID: PMC2909158 DOI: 10.1186/1297-9686-42-26
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Summary statistics for gestation length (GL) and birth weight (BW) in Italian Piemontese cattle.
| Trait | N | Mean | Minimum | Maximum | SD |
|---|---|---|---|---|---|
| GL (day) | 7,883 | 290 | 260 | 320 | 8.1 |
| BW (kg) | 7,883 | 39.6 | 22 | 56 | 4.1 |
Comparisons of average predictive log-likelihood1 (PLL) from ten replicates between bivariate Student's-t (BSt) and normal (BN) fitted models (in column) for different true simulated models (in rows) with varying residual degrees of freedom (DF).
| Fitted Model3 | ||
|---|---|---|
| True Model2-DF | BS | BN |
| BS | -1,483 | -1,988 |
| BS | -718 | -754 |
| BN-∞ | -284 | -284 |
1Predictive log-likelihood values were reported after adding 14,000
2Used to simulate data
3Used in analysis of simulated data
Average posterior inference on degrees of freedom from ten replicates using the bivariate Student's-t (BSt) fitted model.
| BS | ||||
|---|---|---|---|---|
| True Parameters | True Model1 | PM ± SE3 | 95% PPI4 | ESS5 |
| BS | 4.1 ± 0.06 | [3.6, 4.6] | 1,594 | |
| BS | 13.3 ± 1.18 | [9.8, 19.1] | 294 | |
| BN-∞ | 2377 ± 654 | [2140, 3365] | 14 | |
1Used to simulate data
2Used in analysis of simulated data
3Posterior mean ± Standard Error
495% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
5Effective sample size
Average posterior inference on sire (co)variances from ten replicates using the bivariate Student's-t (BSt) and normal (BN) fitted models with different residual degrees of freedom (DF).
| Fitted Model2 | |||||||
|---|---|---|---|---|---|---|---|
| BS | BN | ||||||
| True Parameters | True Model1 | PM ± SE3 | 95% PPI4 | ESS5 | PM ± SE | 95% PPI | ESS |
| BS | 2.24 ± 0.16 | [1.32, 3.62] | 20,983 | 2.31 ± 0.18 | [1.30, 3.83] | 20,694 | |
| BS | 2.44 ± 0.14 | [1.48, 3.88] | 24,767 | 2.39 ± 0.14 | [1.44, 3.81] | 25,794 | |
| BN-∞ | 2.49 ± 0.17 | [1.52, 3.93] | 26,715 | 2.49 ± 0.17 | [1.53, 3.93] | 28,154 | |
| BS | 1.41 ± 0.17 | [0.40, 2.76] | 23,326 | 1.43 ± 0.18 | [0.33, 2.89] | 23,146 | |
| BS | 1.93 ± 0.19 | [0.83, 3.47] | 27,428 | 1.89 ± 0.19 | [0.80, 3.42] | 28,683 | |
| BN-∞ | 1.77 ± 0.22 | [0.72, 3.23] | 28,441 | 1.77 ± 0.21 | [0.71, 3.22] | 30,064 | |
| BS | 4.23 ± 0.30 | [2.59, 6.70] | 24,192 | 4.33 ± 0.33 | [2.57, 6.99] | 23,978 | |
| BS | 4.77 ± 0.29 | [2.97, 7.48] | 27,674 | 4.80 ± 0.30 | [2.98, 7.55] | 29,102 | |
| BN-∞ | 4.46 ± 0.39 | [2.79, 6.96] | 29,013 | 4.45 ± 0.39 | [2.78, 6.98] | 29,365 | |
1Used to simulate data
2Used in analysis of simulated data
3Posterior mean ± Standard Error
495% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
5Effective sample size
Average posterior inference on herd variances from ten replicates using the bivariate Student's-t (BSt) and normal (BN) fitted models with different residual degrees of freedom (DF).
| Fitted Model2 | |||||||
|---|---|---|---|---|---|---|---|
| BS | BN | ||||||
| True Parameters | True Model1 | PM ± SE3 | 95% PPI4 | ESS5 | PM ± SE | 95% PPI | ESS |
| BS | 1.71 ± 0.10 | [1.07, 2.56] | 12,027 | 1.74 ± 0.14 | [1.01, 2.71] | 10,437 | |
| BS | 1.82 ± 0.12 | [1.19, 2.65] | 16,133 | 1.82 ± 0.12 | [1.18, 2.65] | 16,802 | |
| BN-∞ | 1.71 ± 0.08 | [1.12, 2.48] | 17,543 | 1.70 ± 0.08 | [1.12, 2.47] | 17,537 | |
| BS | 6.33 ± 0.30 | [4.47, 8.79] | 24,120 | 6.49 ± 0.28 | [4.47, 9.17] | 22,810 | |
| BS | 6.69 ± 0.28 | [4.77, 9.22] | 27,704 | 6.72 ± 0.24 | [4.79, 9.28] | 27,956 | |
| BN-∞ | 6.33 ± 0.27 | [4.55, 8.71] | 29,800 | 6.33 ± 0.27 | [4.55, 8.71] | 29,881 | |
1Used to simulate data
2Used in analysis of simulated data
3Posterior mean ± Standard Error
495% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
5Effective sample size
Average posterior inference on marginal error (co)variances from ten replicates using the bivariate Student's-t (BSt) and normal (BN) fitted models with different residual degrees of freedom (DF).
| Fitted Model2 | |||||||
|---|---|---|---|---|---|---|---|
| BS | BN | ||||||
| True Parameters | True Model1 | PM ± SE3 | 95% PPI4 | ESS5 | PM ± SE | 95% PPI | ESS |
| BS | 30.45 ± 0.51 | [27.44, 34.05] | 3,336 | 30.29 ± 0.44 | [28.61, 32.07] | 42,430 | |
| BS | 17.87 ± 0.17 | [16.75, 19.07] | 9,516 | 17.82 ± 0.16 | [16.83, 18.87] | 43,135 | |
| BN-∞ | 14.94 ± 0.11 | [14.10, 15.82] | 43,387 | 14.94 ± 0.11 | [14.11, 15.82] | 43,204 | |
| BS | 8.20 ± 0.38 | [6.45, 10.10] | 11,384 | 8.37 ± 0.66 | [6.95, 9.83] | 44,553 | |
| BS | 4.61 ± 0.13 | [3.69, 5.56] | 32,416 | 4.57 ± 0.12 | [3.72, 5.44] | 44,370 | |
| BN-∞ | 3.94 ± 0.10 | [3.23, 4.67] | 43,729 | 3.94 ± 0.10 | [3.23, 4.66] | 44,238 | |
| BS | 40.07 ± 0.65 | [35.98, 44.94] | 3,566 | 39.57 ± 0.74 | [37.37, 41.90] | 45,168 | |
| BS | 24.60 ± 0.31 | [23.04, 26.27] | 10,145 | 24.54 ± 0.28 | [23.17, 25.98] | 45,130 | |
| BN-∞ | 20.13 ± 0.18 | [19.00, 21.31] | 42,782 | 20.12 ± 0.18 | [19.00, 21.31] | 45,079 | |
1Used to simulate data
2Used in analysis of simulated data
3Posterior mean ± Standard Error
495% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
5Effective sample size
Average correlations between true and predicted sire effects from ten replicates using the bivariate Student's-t (BSt) and normal (BN) fitted models with different residual degrees of freedom (DF).
| Fitted Model2 | ||||
|---|---|---|---|---|
| Trait1 | Trait2 | |||
| True Model1-DF | BS | BN | BS | BN |
| BS | 0.90 | 0.87 | 0.92 | 0.90 |
| BS | 0.93 | 0.93 | 0.95 | 0.95 |
| BN-∞ | 0.94 | 0.94 | 0.95 | 0.95 |
1Used to simulate data
2Used in analysis of simulated data
Prediction error variance of sire effects using the bivariate Student's-t (BSt) and normal (BN) fitted models with different residual degrees of freedom (DF).
| Fitted Model2 | ||||
|---|---|---|---|---|
| Trait1 | Trait2 | |||
| True Model1-DF | BS | BN | BS | BN |
| BS | 0.36 | 0.44 | 0.51 | 0.67 |
| BS | 0.29 | 0.30 | 0.41 | 0.44 |
| BN-∞ | 0.23 | 0.23 | 0.33 | 0.33 |
1Used to simulate data
2Used in analysis of simulated data
Posterior inference on sire-MGS (co)variances for gestation length (GL) and birth weight (BW) using the bivariate Student's-t (BSt) and normal (BN) models.
| BS | BN | |||||
|---|---|---|---|---|---|---|
| Parameters | PM1 | 95% PPI2 | ESS3 | PM | 95% PPI | ESS |
| 8.42 | [6.65, 10.43] | 894 | 8.13 | [6.27, 10.31] | 384 | |
| 0.13 | [-0.43, 0.72] | 774 | 0.16 | [-0.48, 0.81] | 496 | |
| 2.75 | [1.77, 3.76] | 567 | 2.73 | [1.63, 3.81] | 323 | |
| -0.54 | [-1.04, -0.04] | 524 | -0.74 | [-1.32, -0.21] | 405 | |
| 1.02 | [0.68, 1.43] | 528 | 1.12 | [0.75, 1.55] | 550 | |
| 0.26 | [-0.13, 0.69] | 429 | 0.40 | [-0.09, 0.90] | 230 | |
| 0.36 | [0.15, 0.58] | 428 | 0.39 | [0.18, 0.62] | 484 | |
| 2.24 | [1.47, 3.16] | 389 | 2.04 | [1.17, 3.05] | 232 | |
| 0.27 | [-0.03, 0.57] | 430 | 0.32 | [-0.01, 0.69] | 336 | |
| 0.53 | [0.34, 0.74] | 457 | 0.59 | [0.38, 0.87] | 371 | |
1Posterior mean
295% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
3Effective sample size
Posterior inference on herd-year-season (co)variances for gestation length (GL) and birth weight (BW) using the bivariate Student's-t (BSt) and normal (BN) models.
| BS | BN | |||||
|---|---|---|---|---|---|---|
| Parameters | PM1 | 95% PPI2 | ESS3 | PM | 95% PPI | ESS |
| 4.00 | [3.02, 5.10] | 2,122 | 4.21 | [3.00, 5.60] | 1,661 | |
| 2.43 | [2.04, 2.85] | 3,282 | 2.56 | [2.14, 3.00] | 3,403 | |
1Posterior mean
295% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
3Effective sample size
Posterior inference on marginal residual (co)variances for gestation length (GL) and birth weight (BW) using the bivariate Student's-t (BSt) and normal (BN) models.
| BS | BN | |||||
|---|---|---|---|---|---|---|
| Parameters | PM1 | 95% PPI2 | ESS3 | PM | 95% PPI | ESS |
| 51.86 | [48.43, 55.84] | 2,376 | 48.90 | [47.22, 50.64] | 9,039 | |
| 3.77 | [3.01, 4.56] | 8,213 | 3.20 | [2.63, 3.78] | 11,671 | |
| 13.37 | [12.47, 14.41] | 2,367 | 11.14 | [10.76, 11.53] | 15,789 | |
1Posterior mean
295% equal-tailed posterior probability interval based on the 2.5and 97.5percentiles of the posterior density
3Effective sample size
Figure 1Posterior densities of degrees of freedom obtained from bivariate Student's-. M represents posterior mean, L represents the 2.5percentiles of the posterior density, U represent 97.5percentiles of the posterior density.
Figure 2Posterior densities of direct (D) and maternal (M) heritabilities of gestation length (GL) and birth weight (BW) obtained from bivariate Student's-. h2D and h2M represent direct and maternal heritabilities.
Figure 3Posterior densities of genetic correlations between direct (D) and maternal (M) effects for gestation length (GL) and birth weight (BW) obtained from bivariate Student's-.
Figure 4Distribution of outlier posterior mean values of scale . Distribution of posterior mean values of λless than 0.3 on the left. Distribution of posterior mean values of λless than 0.2 on the right.
Figure 5Scatter plots of posterior means of all and top 100 sire effects for gestation length (GL) and birth weight (BW) in Italian Piemontese cattle, obtained by bivariate Student's-.