| Literature DB >> 26582509 |
Boby Mathew1, Anna Marie Holand2, Petri Koistinen3, Jens Léon4, Mikko J Sillanpää5.
Abstract
KEY MESSAGE: A novel reparametrization-based INLA approach as a fast alternative to MCMC for the Bayesian estimation of genetic parameters in multivariate animal model is presented. ABSTRACT: Multi-trait genetic parameter estimation is a relevant topic in animal and plant breeding programs because multi-trait analysis can take into account the genetic correlation between different traits and that significantly improves the accuracy of the genetic parameter estimates. Generally, multi-trait analysis is computationally demanding and requires initial estimates of genetic and residual correlations among the traits, while those are difficult to obtain. In this study, we illustrate how to reparametrize covariance matrices of a multivariate animal model/animal models using modified Cholesky decompositions. This reparametrization-based approach is used in the Integrated Nested Laplace Approximation (INLA) methodology to estimate genetic parameters of multivariate animal model. Immediate benefits are: (1) to avoid difficulties of finding good starting values for analysis which can be a problem, for example in Restricted Maximum Likelihood (REML); (2) Bayesian estimation of (co)variance components using INLA is faster to execute than using Markov Chain Monte Carlo (MCMC) especially when realized relationship matrices are dense. The slight drawback is that priors for covariance matrices are assigned for elements of the Cholesky factor but not directly to the covariance matrix elements as in MCMC. Additionally, we illustrate the concordance of the INLA results with the traditional methods like MCMC and REML approaches. We also present results obtained from simulated data sets with replicates and field data in rice.Entities:
Mesh:
Year: 2015 PMID: 26582509 PMCID: PMC4733146 DOI: 10.1007/s00122-015-2622-x
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Fig. 1Box plots for the estimation error (difference between the true and estimated values) of the variance components using 50 simulation replicates with the high heritability dataset. Here the Y-axis scale corresponds to the difference between the true simulated values and the estimated values, whereas X-axis corresponds to different estimation methods
Fig. 2Box plots for the estimation error (difference between the true and estimated values) of the covariance components using 50 simulation replicates with the high heritability dataset. Here the Y-axis scale corresponds to the difference between the true simulated values and the estimated values, whereas X-axis corresponds to different estimation methods
Fig. 3Box plots for the estimation error (difference between the true and estimated values) of the variance components using 50 simulation replicates with the low heritability dataset. Here the Y-axis scale corresponds to the difference between the true simulated values and estimated values, whereas X-axis corresponds to different estimation methods
Fig. 4Box plots for the estimation error (difference between the true and estimated values) of the covariance components using 50 simulation replicates with the low heritability dataset. Here the Y-axis scale corresponds to the difference between the true simulated values and the estimated values, whereas X-axis corresponds to different estimation methods
Narrow-sense heritability estimates () for the simulated datasets (averaged over 50 simulation replicates) and the real dataset with the different estimation methods
| REML | MCMC | INLA | True | |
|---|---|---|---|---|
| High heritability dataset | ||||
| Trait1 | 0.49 | 0.49 | 0.49 | 0.50 |
| Trait2 | 0.61 | 0.62 | 0.61 | 0.60 |
| Trait3 | 0.71 | 0.72 | 0.71 | 0.71 |
| Low heritability dataset | ||||
| Trait1 | 0.15 | 0.21 | 0.20 | 0.20 |
| Trait2 | 0.17 | 0.23 | 0.22 | 0.20 |
| Trait3 | 0.17 | 0.23 | 0.21 | 0.22 |
| Real data |
Spindel et al. ( | |||
| PH | 0.35 | 0.35 | 0.35 | 0.35 |
| FL | 0.44 | 0.44 | 0.44 | 0.43 |
| YLD | 0.32 | 0.32 | 0.35 | 0.32 |
Posterior mean estimates of R-INLA and posterior modes from MCMCglmm were used for the calculation. Additionally, the true heritability estimates for the simulated dataset and the heritability estimates reported by Spindel et al. (2015) are also shown
Estimated genetic and residual correlation coefficients (), between each traits ( to ) for both simulated dataset using REML, INLA and MCMC estimates
| Correlation | REML | MCMC | INLA | True |
|---|---|---|---|---|
| High heritability dataset | ||||
| | 0.31(0.28, 0.32) | 0.32(0.28, 0.33) | 0.30(0.27, 0.32) | 0.33 |
| | 0.41(0.37, 0.43) | 0.40(0.37, 0.43) | 0.40(0.38, 0.43) | 0.38 |
| | 0.45(0.44, 0.48) | 0.45(0.44, 0.47) | 0.47(0.44, 0.48) | 0.46 |
| | 0.51(0.50, 0.53) | 0.52(0.51, 0.53) | 0.52(0.51, 0.53) | 0.50 |
| | 0.48(0.46, 0.50) | 0.49(0.46, 0.51) | 0.49(0.47, 0.50) | 0.50 |
| | 0.50(0.48, 0.51) | 0.51(0.48, 0.52) | 0.50(0.48, 0.52) | 0.50 |
| Low heritability dataset | ||||
| | −0.19(−0.41, −0.03) | −0.40(−0.50, −0.31) | −0.38(−0.49, −0.29) | −0.34 |
| | 0.65(0.64, 0.66) | 0.44(0.39, 0.47) | 0.47(0.43, 0.51) | 0.42 |
| | 0.49(0.48, 0.50) | 0.43(0.39, 0.44) | 0.42(0.40, 0.45) | 0.45 |
| | −0.22(−0.24, −0.19) | −0.19(−0.21, −0.17) | −0.20(−0.21, −0.17) | −0.21 |
| | 0.02(0.01, 0.04) | 0.03(0.01, 0.04) | 0.03(0.02, 0.04) | 0.04 |
| | 0.11(0.09, 0.12) | 0.10(0.08, 0.11) | 0.10(0.08, 0.11) | 0.10 |
In order to calculate genetic (a) and residual (e) correlation coefficients, 50 simulation replicates were used and the true simulated values are also given. Additionally, the empirical 95 % confidence intervals for estimates are given in brackets
The additive genetic variance ( and the error variance () components obtained using a univariate INLA analysis (INLA-U) using the simulated dataset with negative covariance
| Variance parameter | INLA-U | INLA-M | True |
|---|---|---|---|
|
| 5.03 | 4.92 | 5.00 |
|
| 5.99 | 6.45 | 7.00 |
|
| 10.15 | 10.34 | 10.00 |
|
| 20.03 | 20.16 | 20.00 |
|
| 28.91 | 28.93 | 28.00 |
|
| 35.08 | 35.30 | 35.00 |
In order to calculate the INLA estimates, mean of 50 simulation replicates were used. True simulated values and the estimates from the multivariate INLA (INLA-M) analysis are also given
The additive genetic variance () and the error variance () for the field data obtained from the REML analysis and the posterior mode estimates obtained from the MCMCglmm package along with R-INLA posterior mean estimates are presented
| (Co)variance parameter | REML | MCMC | INLA |
|---|---|---|---|
|
| 22.92 | 21.85 | 24.09 |
|
| 6.50 | 6.56 | 6.58 |
|
| 34,285.92 | 35,890.51 | 40,218.69 |
|
| 42.34 | 40.91 | 43.68 |
|
| 8.17 | 8.23 | 8.47 |
|
| 71,698.64 | 73,983.14 | 74,211.42 |
|
| 4.57 | 4.48 | 4.60 |
|
| −285.05 | −237.21 | −263.99 |
|
| −158.50 | −120.01 | −161.97 |
|
| 2.41 | 2.78 | 2.57 |
|
| 106.36 | 76.83 | 83.21 |
|
| −38.75 | −23.82 | −44.09 |
The additive genetic covariance () and the error covariance () between each pair of three quantitative traits (PH, FL, YLD) are also shown