| Literature DB >> 28340551 |
Alencar Xavier1, Shizhong Xu2, William Muir3, Katy Martin Rainey4.
Abstract
BACKGROUND: Genome-wide assisted selection is a critical tool for the genetic improvement of plants and animals. Whole-genome regression models in Bayesian framework represent the main family of prediction methods. Fitting such models with a large number of observations involves a prohibitive computational burden. We propose the use of subsampling bootstrap Markov chain in genomic prediction. Such method consists of fitting whole-genome regression models by subsampling observations in each round of a Markov Chain Monte Carlo. We evaluated the effect of subsampling bootstrap on prediction and computational parameters.Entities:
Keywords: Bayesian analysis; Bootstrapping; Genome-wide selection
Mesh:
Year: 2017 PMID: 28340551 PMCID: PMC5366167 DOI: 10.1186/s12859-017-1582-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Summary of datasets used in this study
| Species | Population type | Trait | n |
|
| Source |
|---|---|---|---|---|---|---|
| Mouse | Heterogeneous stock | Body mass index | 1814 | 5173 | 0.146 | Legarra et al. [ |
| Soybean | Nested Ass. Panel | Grain yield | 1079 | 4307 | 0.345 | Xavier et al. [ |
| Wheat | Diverse panel | Grain yield | 599 | 1209 | 0.434 | Crossa et al. [ |
| Simulation | Experimental F2 | Simulated | 400 | 500 | 0.516 | Technow [ |
Fig. 1Prediction metrics (y axis) varying the amount of data under subsampling (x axis). Average across four datasets. a Bias as the slope between predicted and observed values; b Predictive ability as the correlation between predicted and observed values; c Mean squared prediction error; and d Computation time to fit the linear model
Summary of prediction metrics with for the complete dataset (Complete), and subsampling 50% with replacement (wR), and 33 and 50% without replacement (woR)
| Time (min.) |
| MSPE |
| |
|---|---|---|---|---|
| Complete | 55.90 | 0.4814 | 0.7431 | 1.0058 |
| woR 33% | 27.90 | 0.4794 | 0.7454 | 1.0239 |
| woR 50% | 35.32 | 0.4794 | 0.7447 | 1.0642 |
| wR 50% | 41.84 | 0.4802 | 0.7562 | 0.8161 |
, correlation between observed and predicted value; MSPE, mean squared prediction error; , Prediction bias
Optimal sampling observed for individual datasets to enhance predictive ability (PA) and mean squared prediction error (MSPE). Subsampling performed with (wR) and without replacement (woR)
| Optimal PA | Increase in PA | Optimal MSPE | Decrease in MSPE | |
|---|---|---|---|---|
| Mouse | wR 66% | 2.5% | woR 32% | <0.1% |
| Soybean | woR 25% | 0.1% | woR 25% | 0.1% |
| Wheat | woR 34% | 0.7% | woR 33% | 0.5% |
| Simulated F2 | wR 87% | 0.1% | wR 66% | 0.2% |