| Literature DB >> 29298666 |
Peng Guo1,2, Bo Zhu1, Hong Niu1, Zezhao Wang1, Yonghu Liang1, Yan Chen1, Lupei Zhang1, Hemin Ni3, Yong Guo3, El Hamidi A Hay4, Xue Gao1, Huijiang Gao1, Xiaolin Wu5,6, Lingyang Xu7, Junya Li8.
Abstract
BACKGROUND: Running multiple-chain Markov Chain Monte Carlo (MCMC) provides an efficient parallel computing method for complex Bayesian models, although the efficiency of the approach critically depends on the length of the non-parallelizable burn-in period, for which all simulated data are discarded. In practice, this burn-in period is set arbitrarily and often leads to the performance of far more iterations than required. In addition, the accuracy of genomic predictions does not improve after the MCMC reaches equilibrium.Entities:
Keywords: Bayesian models; Convergence diagnosis; Genomic prediction; High-performance computing; Tunable burn-in
Mesh:
Year: 2018 PMID: 29298666 PMCID: PMC5751823 DOI: 10.1186/s12859-017-2003-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Speedup ratios of parallel MCMC (>1 chain) over sequential MCMC (1 chain) in simulation studies under the three scenarios: a Scenario 1 (h2 = 0.1; 4000 SNPs; 200 QTLs), b Scenario 2 (h2 = 0.1; 10,000 SNPs; 200 QTLs), and c Scenario 3 (h2 = 0.1; 40,000 SNPs; 200 QTLs). Expected speedup ratios were calculated under the assumption that MCMC chains were 100% parallelizable (i.e., without burn-in)
Fig. 2Trace plots of the simulated residual variance using various computational forms of (a) BayesA and (b) BayesCπ. The total length of MCMC was 50,000 iterations, and the length of the fixed burn-in period was 5000 iterations
Fig. 3Correlations of the estimated SNP effects in simulation scenario 2. Seq-BayesA = sequential BayesA; Seq-BayesCπ = sequential BayesCπ; FBM-BayesA = fixed burn-in, multiple-chain BayesA; TBM-BayesA = tunable burn-in, multiple-chain BayesA; FBM-BayesCπ = fixed burn-in, multiple-chain BayesCπ; TBM-BayesCπ = tunable burn-in, multiple-chain BayesCπ
Genomic predictive accuracies obtained using FBM-BayesA, TBM-BayesA, FBM-BayesCπ, TBM-BayesCπ, and GBLUP in Scenario 1
| Chains | FBM-BayesA | TBM-BayesA | FBM-BayesCπ | TBM-BayesCπ | GBLUP |
|---|---|---|---|---|---|
| 1 | 0.5239 | 0.5231 | 0.6316 | 0.6317 | 0.6016 |
| 2 | 0.5227 | 0.5229 | 0.6301 | 0.6296 | |
| 4 | 0.5230 | 0.5229 | 0.6304 | 0.6314 | |
| 6 | 0.5230 | 0.5230 | 0.6310 | 0.6304 | |
| 8 | 0.5231 | 0.5232 | 0.6313 | 0.6311 | |
| 10 | 0.5232 | 0.5231 | 0.6309 | 0.6307 | |
| 12 | 0.5228 | 0.5232 | 0.6315 | 0.6305 | |
| 14 | 0.5230 | 0.5231 | 0.6306 | 0.6313 | |
| 16 | 0.5230 | 0.5231 | 0.6307 | 0.6303 | |
| 18 | 0.5230 | 0.5231 | 0.6310 | 0.6311 |
FBM-BayesA fixed burn-in, multiple-chain BayesA, TBM-BayesA tunable burn-in, multiple-chain BayesA, FBM-BayesCπ fixed burn-in, multiple-chain BayesCπ, TBM-BayesCπ tunable burn-in, multiple-chain BayesCπ, and Chains number of parallel MCMC running for each genomic prediction model. Simulation parameters are as follows: population size = 1000; number of QTL = 200; heritability = 0.1; number of chromosomes = 5; and number of markers per chromosome = 4000
Genomic predictive accuracies obtained using FBM-BayesA, TBM-BayesA, FBM-BayesCπ, TBM-BayesCπ, and GBLUP in Scenario 2
| Chains | FBM-BayesA | TBM-BayesA | FBM-BayesCπ | TBM-BayesCπ | GBLUP |
|---|---|---|---|---|---|
| 1 | 0.846777 | 0.846761 | 0.937837 | 0.937312 | 0.833173 |
| 2 | 0.846731 | 0.846709 | 0.937741 | 0.937161 | |
| 4 | 0.846797 | 0.846855 | 0.937449 | 0.937160 | |
| 6 | 0.846904 | 0.846855 | 0.938563 | 0.938017 | |
| 8 | 0.846559 | 0.846884 | 0.938305 | 0.938319 | |
| 10 | 0.846770 | 0.846812 | 0.938232 | 0.938763 | |
| 12 | 0.846862 | 0.846824 | 0.938997 | 0.938347 | |
| 14 | 0.846814 | 0.846832 | 0.938535 | 0.938426 | |
| 16 | 0.846844 | 0.846657 | 0.938852 | 0.938385 | |
| 18 | 0.846820 | 0.846854 | 0.938823 | 0.938210 |
Simulation parameters are as follows: population size = 1000; number of QTL = 200; heritability = 0.5; number of chromosomes = 10; and number of markers per chromosome = 5000
Genomic predictive accuracies obtained using FBM-BayesA, TBM-BayesA, FBM-BayesCπ, TBM-BayesCπ, and GBLUP in Scenario 3
| Chains | FBM-BayesA | TBM-BayesA | FBM-BayesCπ | TBM-BayesCπ | GBLUP |
|---|---|---|---|---|---|
| 1 | 0.7717 | 0.7717 | 0.8415 | 0.8415 | 0.7632 |
| 2 | 0.7716 | 0.7717 | 0.8421 | 0.8416 | |
| 4 | 0.7716 | 0.7717 | 0.8417 | 0.8418 | |
| 6 | 0.7715 | 0.7717 | 0.8413 | 0.8418 | |
| 8 | 0.7718 | 0.7719 | 0.8415 | 0.8514 | |
| 10 | 0.7720 | 0.7720 | 0.8411 | 0.8419 | |
| 12 | 0.7720 | 0.7720 | 0.8418 | 0.8415 | |
| 14 | 0.7720 | 0.7716 | 0.8413 | 0.8418 | |
| 16 | 0.7718 | 0.7720 | 0.8415 | 0.8417 | |
| 18 | 0.7718 | 0.7718 | 0.8411 | 0.8416 |
Simulation parameters are as follows: population size = 2000; number of QTL = 200; heritability = 0.3; number of chromosomes = 5; and number of markers per chromosome = 40,000
Heritability estimates and predictive accuracies of four quantitative traits in a Chinese Simmental cattle population
| Trait | h2 | Correlations | Correlations divided by square root of heritability | ||||
|---|---|---|---|---|---|---|---|
| GBLUP | TBM-BayesA | TBM-BayesCπ | GBLUP | TBM-BayesA | TBM-BayesCπ | ||
| CBFT | 0.10 | 0.100 | 0.105 | 0.106 | 0.316 | 0.332 | 0.337 |
| CW | 0.45 | 0.266 | 0.271 | 0.268 | 0.397 | 0.404 | 0.399 |
| SLW | 0.24 | 0.202 | 0.213 | 0.215 | 0.413 | 0.435 | 0.440 |
| ADG | 0.47 | 0.214 | 0.204 | 0.206 | 0.312 | 0.297 | 0.301 |
| Mean | 0.196 | 0.198 | 0.199 | 0.360 | 0.367 | 0.369 | |
CBFT carcass back fat thickness, SLW strip loin weight, CW carcass weight, ADG average daily gain, and h heritability