| Literature DB >> 21624169 |
Xiaochen Sun1, David Habier, Rohan L Fernando, Dorian J Garrick, Jack Cm Dekkers.
Abstract
BACKGROUND: Bayesian methods allow prediction of genomic breeding values (GEBVs) using high-density single nucleotide polymorphisms (SNPs) covering the whole genome with effective shrinkage of SNP effects using appropriate priors. In this study we applied a modification of the well-known BayesA and BayesB methods to estimate the proportion of SNPs with zero effects (π) and a common variance for non-zero effects. The method, termed BayesCπ, was used to predict the GEBVs of the last generation of the QTLMAS2010 data. The accuracy of GEBVs from various methods was estimated by the correlation with phenotypes in the last generation. The methods were BayesCPi and BayesB with different π values, both with and without polygenic effects, and best linear unbiased prediction using an animal model with a genomic or numerator relationship matrix. Positions of quantitative trait loci (QTLs) were identified based on the variances of GEBVs for windows of 10 consecutive SNPs. We also proposed a novel approach to set significance thresholds for claiming QTL in this specific case by using pedigree-based simulation of genotypes. All analyses were focused on detecting and evaluating QTL with additive effects.Entities:
Year: 2011 PMID: 21624169 PMCID: PMC3103198 DOI: 10.1186/1753-6561-5-S3-S13
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Prediction accuracy of GEBV, correlation of GEBV with TBV, correlation of GEBV with genotypic value (g), regression coefficient of phenotype (y) on GEBV, regression coefficient of TBV on GEBV, and regression coefficient of genotypic value on GEBV.
| Methods | Correlation of GEBV with | Regression coefficient on GEBV of | ||||
|---|---|---|---|---|---|---|
| TBV | TBV | |||||
| 0.545 | 0.410 | 0.538 | 1.156 | 1.003 | 1.005 | |
| No Poly | 0.746 | 0.610 | 0.753 | 1.006 | 0.949 | 0.895 |
| Poly | 0.737 | 0.597 | 0.752 | 0.961 | 0.898 | 0.863 |
| No Poly | 0.781 | 0.632 | 0.776 | 1.018 | 0.950 | 0.892 |
| Poly | 0.778 | 0.628 | 0.783 | 0.984 | 0.916 | 0.873 |
| No Poly | 0.788 | 0.640 | 0.787 | 1.023 | 0.960 | 0.901 |
| Poly | 0.784 | 0.634 | 0.793 | 0.983 | 0.916 | 0.875 |
| No Poly | 0.793 | 0.646 | 0.795 | 1.031 | 0.967 | 0.909 |
| Poly | 0.790 | 0.636 | 0.797 | 0.981 | 0.911 | 0.872 |
| No Poly | 0.796 | 0.650 | 0.800 | 1.011 | 0.952 | 0.895 |
| Poly | 0.796 | 0.642 | 0.804 | 0.989 | 0.921 | 0.880 |
| No Poly | – | 0.679 | 0.894 | – | 0.959 | 0.965 |
Results are based on training on the first three generations and validation on generation 4 using P-BLUP, G-BLUP, BayesB with different π's, and BayesCπ, and without (No Poly) and with (Poly) polygenic effects.
1Calculated as correlation of phenotype (y) with GEBV, divided by the square root of estimated heritability.
2Training on the first 4 generations and predicting generation 5.
Average number of SNPs (#SNP) fitted in the model, estimated variance components, and estimated heritability (Heritability).
| Methods | #SNP | Estimated variance components | Heritability | ||||
|---|---|---|---|---|---|---|---|
| Marker | Polygenic | Genetic1 | Residual | Total | |||
| – | – | 51.76 | 51.76 | 103.52 | 0.500 | ||
| – | – | 54.44 | 54.44 | 48.68 | 103.12 | 0.528 | |
| 10031 | |||||||
| No Poly | 44.54 | – | 44.54 | 54.84 | 99.38 | 0.448 | |
| Poly | 38.53 | 12.09 | 50.62 | 49.04 | 99.66 | 0.508 | |
| 2508 | |||||||
| No Poly | 44.28 | – | 44.28 | 54.08 | 98.36 | 0.450 | |
| Poly | 39.05 | 11.06 | 50.11 | 48.32 | 98.43 | 0.509 | |
| 502 | |||||||
| No Poly | 43.96 | – | 43.96 | 54.16 | 98.12 | 0.448 | |
| Poly | 38.05 | 12.80 | 50.85 | 47.59 | 98.44 | 0.517 | |
| 100 | |||||||
| No Poly | 43.44 | – | 43.44 | 54.58 | 98.02 | 0.443 | |
| Poly | 37.43 | 12.35 | 49.78 | 48.30 | 98.09 | 0.508 | |
| No Poly | 124 | 45.68 | – | 45.68 | 53.63 | 99.31 | 0.460 |
| Poly | 80 | 40.21 | 10.33 | 50.54 | 48.58 | 99.12 | 0.510 |
| No Poly | 92 | 47.13 | – | 47.13 | 53.48 | 100.61 | 0.468 |
Results are based on training on the first three generations and validation on generation 4 using P-BLUP, G-BLUP, BayesB with different π’s, and BayesCπ, and without (No Poly) and with (Poly) polygenic effects.
1Total genetic variance = marker variance + polygenic variance.
2Total QTL variance = residual variance = 51.76 in the QTLMAS2010 dataset.
3Training on the first 4 generations.
Figure 1Variances of GEBVs of 10-SNP windows across the genome. Data sets were generated by permutation (Permuted dataset), simulation with linkage equilibrium in founders (LE simulation dataset), and simulation with initial linkage disequilibrium (LD simulation dataset). The bottom panel show window variances obtained for the original QTLMAS 2010 dataset (Original dataset), as well as the location and variances of true QTLs, along with their mode of inheritance (Additive = additive QTL, Epistatic = epistatic QTL, Imprinted = imprinted QTL). Horizontal lines show the 10% (solid) and 20% (dash) chromosome-wise thresholds for window variance derived from the LD simulation.
Variance components estimated from datasets generated by permutation, simulation with linkage equilibrium in founders (LE simulation), and simulation with initial linkage disequilibrium (LD simulation), and thresholds for 10-SNP window variances based on 10% and 20% chromosome-wise type I error rates.
| Methods | Variance Components | Window variance threshold | |||
| Genotypic | Residual | Total | 10% | 20% | |
| Permutation | 3.15 | 98.59 | 101.74 | 0.0011 | 0.0009 |
| LE simulation | 20.83 | 79.84 | 100.67 | 0.0204 | 0.0094 |
| LD simulation | 17.14 | 83.40 | 100.55 | 0.1645 | 0.0887 |
| Original1 | 47.13 | 53.48 | 100.61 | – | – |
1Estimated from the original QTLMAS2010 dataset using BayesCπ, training on the first 4 generations.