| Literature DB >> 20380760 |
Matthew A Cleveland1, Selma Forni1, Nader Deeb1, Christian Maltecca2.
Abstract
BACKGROUND: Bayesian approaches for predicting genomic breeding values (GEBV) have been proposed that allow for different variances for individual markers resulting in a shrinkage procedure that uses prior information to coerce negligible effects towards zero. These approaches have generally assumed application to high-density genotype data on all individuals, which may not be the case in practice. In this study, three approaches were compared for their predictive power in computing GEBV when training at high SNP marker density and predicting at high or low densities: the well- known Bayes-A, a generalization of Bayes-A where scale and degrees of freedom are estimated from the data (Student-t) and a Bayesian implementation of the Lasso method. Twelve scenarios were evaluated for predicting GEBV using low-density marker subsets, including selection of SNP based on genome spacing or size of additive effect and the inclusion of unknown genotype information in the form of genotype probabilities from pedigree and genotyped ancestors.Entities:
Year: 2010 PMID: 20380760 PMCID: PMC2857848 DOI: 10.1186/1753-6561-4-S1-S6
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Number of SNPs included in the calculation of genomic breeding values in each low-density scenario
| Scenario | Evenly-spaceda | Largest effectsb | Genotype | Total |
|---|---|---|---|---|
| probabilitiesc | ||||
| EVEN_19 | 19 | 19 | ||
| EVEN_38 | 38 | 38 | ||
| EVEN_76 | 76 | 76 | ||
| SIG_19 | 19 | 19 | ||
| SIG_38 | 38 | 38 | ||
| SIG_76 | 76 | 76 | ||
| EVEN_GP_19 | 19 | 434 | 453 | |
| EVEN_GP_38 | 38 | 415 | 453 | |
| EVEN_GP_76 | 76 | 377 | 453 | |
| SIG_GP_19 | 19 | 434 | 453 | |
| SIG_GP_38 | 38 | 415 | 453 | |
| SIG_GP_76 | 76 | 377 | 453 | |
aSelected by taking the mth SNPs from ordered list and thus SNP were approximately evenly-spaced
bSelected by taking the top m SNPs from a list ordered by absolute effect, for each analysis method
cGenotype probabilities were used in place of actual genotypes for all SNPs that don't fall into one of the other categories, within a scenario
Correlations between genomic breeding values and breeding values from a traditional animal model for animals in the prediction set (without phenotypes) and coefficients of regression of traditional on genomic breeding values, for t530 and t600.
| t530 | t600 | |||
|---|---|---|---|---|
| 0.673 | 0.893 | 0.674 | 0.880 | |
| 0.718 | 1.019 | 0.720 | 1.010 | |
| 0.736 | 1.061 | 0.737 | 1.072 | |
Correlations between genomic breeding values and breeding values from different low SNP-density approaches (and change in correlation compared to original full marker model), where all SNP effects are estimated in the same high SNP-density training set, for t530 and t600.
| t530 | t600 | |||||
|---|---|---|---|---|---|---|
| EVEN_19 | 0.255 | 0.142 | 0.195 | -0.128 | 0.098 | 0.173 |
| (-0.418) | (-0.846) | (-0.594) | (-0.532) | (-0.622) | (-0.564) | |
| EVEN_38 | 0.481 | 0.494 | 0.528 | 0.469 | 0.485 | 0.522 |
| (-0.192) | (-0.249) | (-0.242) | (-0.180) | (-0.235) | (-0.215) | |
| EVEN_76 | 0.490 | 0.544 | 0.586 | 0.472 | 0.532 | 0.584 |
| (-0.183) | (-0.246) | (-0.192) | (-0.130) | (-0.188) | (-0.153) | |
| SIG_19 | 0.663 | 0.699 | 0.709 | 0.669 | 0.692 | 0.709 |
| (-0.010) | (-0.049) | (-0.037) | (0.025) | (-0.028) | (-0.028) | |
| SIG_38 | 0.664 | 0.703 | 0.713 | 0.669 | 0.707 | 0.721 |
| (-0.009) | (-0.049) | (-0.033) | (0.029) | (-0.013) | (-0.016) | |
| SIG_76 | 0.667 | 0.709 | 0.711 | 0.672 | 0.712 | 0.729 |
| (-0.006) | (-0.046) | (-0.027) | (0.035) | (-0.008) | (-0.008) | |
| EVEN_GP_19 | 0.937 | 0.967 | 0.980 | 0.928 | 0.967 | 0.978 |
| (0.264) | (0.210) | (0.231) | (0.293) | (0.247) | (0.241) | |
| EVEN_GP_38 | 0.733 | 0.785 | 0.861 | 0.736 | 0.789 | 0.862 |
| (0.060) | (0.018) | (-0.049) | (0.111) | (0.069) | (0.125) | |
| EVEN_GP_76 | 0.733 | 0.786 | 0.854 | 0.736 | 0.789 | 0.856 |
| (0.060) | (0.018) | (-0.050) | (0.112) | (0.069) | (0.119) | |
| SIG_GP_19 | 0.674 | 0.730 | 0.802 | 0.675 | 0.735 | 0.798 |
| (0.001) | (0.043) | (-0.006) | (0.056) | (0.015) | (0.061) | |
| SIG_GP_38 | 0.673 | 0.728 | 0.783 | 0.675 | 0.731 | 0.791 |
| (0) | (-0.043) | (-0.008) | (0.054) | (0.011) | (0.054) | |
| SIG_GP_76 | 0.673 | 0.724 | 0.767 | 0.674 | 0.729 | 0.769 |
| (0) | (-0.044) | (-0.012) | (0.050) | (0.009) | (0.032) | |
Figure 1SNP effects estimated by Bayes-A, Student-t and Lasso for t600, by genome location (cM).
Accuracy of genomic breeding values using three methods, as the correlation between true and predicted breeding values, for animals in the prediction set using all markers (ALL) and using alternative low-density approaches, for t600.
| Scenario |
|
|
|
|---|---|---|---|
| ALL | 0.916 | 0.945 | 0.916 |
| EVEN_19 | 0.040 | 0.206 | 0.258 |
| EVEN_38 | 0.732 | 0.738 | 0.738 |
| EVEN_76 | 0.734 | 0.761 | 0.758 |
| SIG_19 | 0.913 | 0.931 | 0.910 |
| SIG_38 | 0.915 | 0.938 | 0.914 |
| SIG_76 | 0.915 | 0.943 | 0.921 |
| EVEN_GP_19 | 0.658 | 0.674 | 0.671 |
| EVEN_GP_38 | 0.833 | 0.84 | 0.817 |
| EVEN_GP_76 | 0.834 | 0.846 | 0.825 |
| SIG_GP_19 | 0.914 | 0.937 | 0.914 |
| SIG_GP_38 | 0.915 | 0.940 | 0.917 |
| SIG_GP_76 | 0.916 | 0.943 | 0.920 |