| Literature DB >> 34643760 |
Haixiao Hu1, Malachy T Campbell2, Trevor H Yeats2, Xuying Zheng2, Daniel E Runcie3, Giovanny Covarrubias-Pazaran4, Corey Broeckling5, Linxing Yao5, Melanie Caffe-Treml6, Lucı A Gutiérrez7, Kevin P Smith8, James Tanaka2, Owen A Hoekenga9, Mark E Sorrells2, Michael A Gore2, Jean-Luc Jannink2,10.
Abstract
KEY MESSAGE: Integration of multi-omics data improved prediction accuracies of oat agronomic and seed nutritional traits in multi-environment trials and distantly related populations in addition to the single-environment prediction. Multi-omics prediction has been shown to be superior to genomic prediction with genome-wide DNA-based genetic markers (G) for predicting phenotypes. However, most of the existing studies were based on historical datasets from one environment; therefore, they were unable to evaluate the efficiency of multi-omics prediction in multi-environment trials and distantly related populations. To fill those gaps, we designed a systematic experiment to collect omics data and evaluate 17 traits in two oat breeding populations planted in single and multiple environments. In the single-environment trial, transcriptomic BLUP (T), metabolomic BLUP (M), G + T, G + M, and G + T + M models showed greater prediction accuracy than GBLUP for 5, 10, 11, 17, and 17 traits, respectively, and metabolites generally performed better than transcripts when combined with SNPs. In the multi-environment trial, multi-trait models with omics data outperformed both counterpart multi-trait GBLUP models and single-environment omics models, and the highest prediction accuracy was achieved when modeling genetic covariance as an unstructured covariance model. We also demonstrated that omics data can be used to prioritize loci from one population with omics data to improve genomic prediction in a distantly related population using a two-kernel linear model that accommodated both likely casual loci with large-effect and loci that explain little or no phenotypic variance. We propose that the two-kernel linear model is superior to most genomic prediction models that assume each variant is equally likely to affect the trait and can be used to improve prediction accuracy for any trait with prior knowledge of genetic architecture.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34643760 PMCID: PMC8580906 DOI: 10.1007/s00122-021-03946-4
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Fig. 1Neighbor-joining tree of 568 oat lines in the Diversity and Elite panels. Different panels are shown in different colors (darkblue, Diversity panel; red, Elite panel, light blue, lines in common)
Fig. 2Distribution of prediction accuracy of the 17 phenotypic traits in the Diversity panel across 50 re-sampling runs. For each trait, boxplots with different colors represent prediction models. Medians of percent change in prediction accuracy of omics models relative to GBLUP are indicated below each box in blue if positive and in red if negative. The Wilcoxon Signed Rank was applied to test difference in prediction accuracy between each omics model and the GBLUP model, and significance levels are indicated above each box. *** = significant at P < 0.001, ** = significant at P < 0.01, * = significant at P < 0.05, NS = not significant. G = genomic BLUP, T = transcriptomic BLUP, M = metabolomic BLUP
Fig. 3Distribution of prediction accuracy of the 15 phenotypic traits in the Elite panel across 50 re-sampling runs estimated by multi-trait models. For each trait, boxplots with different colors represent models. Medians of percent change in prediction accuracy of M and G + M models relative to the G model are indicated below each box in blue if positive and in red if negative. For each model, the uppercase letters before and after the hyphen represent genetic and residual covariance structures: D = diagonal, UN = unstructured, FA = factor-analytic. The Wilcoxon Signed Rank was applied to test difference in prediction accuracy between each omics model and the GBLUP model, and significance levels are indicated above each box. *** = significant at P < 0.001, ** = significant at P < 0.01, * = significant at P < 0.05, NS = not significant
Fig. 4Prediction accuracy of the 10 fatty acid traits in the Elite panel estimated by GBLUP, BayesB and two-kernel BLUP models across 50 re-sampling runs. For each trait, barplots with different colors represent models. Means of percent change in prediction accuracy of all other models relative to GBLUP are indicated above each bar (in blue if positive, in red if negative, and in black if zero). MK-Network = network-based multiple-kernel prediction. The Wilcoxon Signed Rank was applied to test difference in prediction accuracy between other models and the GBLUP model, and significance levels are indicated on each bar. *** = significant at P < 0.001, ** = significant at P < 0.01, * = significant at P < 0.05, NS = not significant