| Literature DB >> 25873147 |
J Jiang1, Q Zhang1, L Ma2, J Li3, Z Wang4, J-F Liu1.
Abstract
Predicting organismal phenotypes from genotype data is important for preventive and personalized medicine as well as plant and animal breeding. Although genome-wide association studies (GWAS) for complex traits have discovered a large number of trait- and disease-associated variants, phenotype prediction based on associated variants is usually in low accuracy even for a high-heritability trait because these variants can typically account for a limited fraction of total genetic variance. In comparison with GWAS, the whole-genome prediction (WGP) methods can increase prediction accuracy by making use of a huge number of variants simultaneously. Among various statistical methods for WGP, multiple-trait model and antedependence model show their respective advantages. To take advantage of both strategies within a unified framework, we proposed a novel multivariate antedependence-based method for joint prediction of multiple quantitative traits using a Bayesian algorithm via modeling a linear relationship of effect vector between each pair of adjacent markers. Through both simulation and real-data analyses, our studies demonstrated that the proposed antedependence-based multiple-trait WGP method is more accurate and robust than corresponding traditional counterparts (Bayes A and multi-trait Bayes A) under various scenarios. Our method can be readily extended to deal with missing phenotypes and resequence data with rare variants, offering a feasible way to jointly predict phenotypes for multiple complex traits in human genetic epidemiology as well as plant and livestock breeding.Entities:
Mesh:
Year: 2015 PMID: 25873147 PMCID: PMC4815501 DOI: 10.1038/hdy.2015.9
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Number of used markers and LD level across 30 replicates under varied scenarios
| 1 (default) | 0.5 | 0 | 30 | All | 4119 (159) | 0.333 (0.016) |
| Every 10th | 411 (16) | 0.220 (0.018) | ||||
| Every 25th | 164 (6) | 0.136 (0.015) | ||||
| 2 | 0.5 | 0 | 300 | All | 4140 (126) | 0.333 (0.015) |
| 3 | 0.2 | 0 | 30 | All | 4044 (162) | 0.336 (0.013) |
| 4 | 0.5 | −0.2 | 30 | All | 4030 (164) | 0.338 (0.015) |
| 5 | 0.5 | 0.2 | 30 | All | 4071 (141) | 0.334 (0.013) |
Abbreviation: QTLs, quantitative trait loci.
Average=sum(Ai)/30 and std.=sqrt(sum((Ai-Average)2)/29), where Ai is the average LD level in the ith replicate.
Figure 1Prediction accuracies of various models for the high-heritability trait (a) and the low-heritability trait (b) under different LD levels of adjacent markers.
Figure 2Prediction accuracies of various models for the high-heritability trait (a) and the low-heritability trait (b) under scenarios with a varied number of underlying QTLs.
Figure 3Prediction accuracies of various models for the high-heritability trait (a) and the low-heritability trait (b) under scenarios with varied genetic correlations between traits.
Figure 4Prediction accuracies of various models for the high-heritability trait (a) and the low-heritability trait (b) under scenarios with varied error correlations between traits.
Figure 5Prediction accuracies of various models for the 16th QTL-MAS workshop data set. aMeans the best single-trait method officially reported in the 16th QTL-MAS workshop.
Predictive ability of various methods for the heterogeneous stock mice data across 20 replicates
| 1 | %CD4+/CD3+ | 0.609 | 0.616 | 0.620 | 0.625 |
| %CD8+ | 0.663 | 0.664 | 0.668 | 0.671 | |
| 2 | %CD4+ | 0.358 | 0.372 | 0.375 | 0.375 |
| %CD8+ | 0.665 | 0.667 | 0.670 | 0.670 | |