| Literature DB >> 30321482 |
Zhiwu Dan1, Yunping Chen1, Yanghong Xu1, Junran Huang1, Jishuai Huang1, Jun Hu1, Guoxin Yao2, Yingguo Zhu1, Wenchao Huang1.
Abstract
Marker-based prediction holds great promise for improving current plant and animal breeding efficiencies. However, the predictabilities of complex traits are always severely affected by negative factors, including distant relatedness, environmental discrepancies, unknown population structures, and indeterminate numbers of predictive variables. In this study, we utilised two independent F1 hybrid populations in the years 2012 and 2015 to predict rice thousand grain weight (TGW) using parental untargeted metabolite profiles with a partial least squares regression method. A stable predictive model for TGW was built based on hybrids from the population in 2012 (r = 0.75) but failed to properly predict TGW for hybrids from the population in 2015 (r = 0.27). After integrating hybrids from both populations into the training set, the TGW of hybrids could be predicted but was largely dependent on population structures. Then, core hybrids from each population were determined by principal component analysis and the TGW of hybrids in both environments were successfully predicted (r > 0.60). Moreover, adjusting the population structures and numbers of predictive analytes increased TGW predictability for hybrids in 2015 (r = 0.72). Our study demonstrates that the TGW of F1 hybrids across environments can be accurately predicted based on parental untargeted metabolite profiles with a core hybridisation strategy in rice. Metabolic biomarkers identified from early developmental stage tissues, which are grown under experimental conditions, may represent a workable approach towards the robust prediction of major agronomic traits for climate-adaptive varieties.Entities:
Keywords: core hybrids; grain weight; metabolic markers; partial least squares regression; prediction; rice (Oryza sativa)
Mesh:
Substances:
Year: 2018 PMID: 30321482 PMCID: PMC6587747 DOI: 10.1111/pbi.13024
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Figure 1Grain performance and selection of metabolic markers for thousand grain weight. Eighteen representative rice cultivars (including typical indica, intermediate types and typical japonica) were selected as parents of a hybrid population with a complete diallel cross design in 2012. These cultivars had different grain performance (a) and thousand grain weight (b). The order of grains displayed in (a) is the same as that of grain weight in (b). The white bar in (a) represents 1 cm. (c) Box plot for thousand grain weight (TGW) of hybrids from the population in 2012. The hybrid population was divided into three subgroups according to values of TGW, namely, large (>27 g/1000 grains), medium (24–27 g/1000 grains), and small (<24 g/1000 grains). (d) Distribution of hybrids with large and small TGW. PCA and PLS‐DA were performed with predictive analytes for hybrids with large and small TGW. Pairwise score plots between the top four components of PCA and PLS‐DA are shown on the left and right respectively. Green crisscrosses represent hybrids with small TGW, and red triangles represent hybrids with large TGW.
Figure 2Population relatedness, environmental effects, and population structures had negative effects on the predictability of thousand grain weight. (a) Correlation between the observed and predicted thousand grain weight with all hybrids from the population in 2012 as the training set. (b) Correlation between the observed and predicted thousand grain weight with the first half of the population in 2012 and half of the hybrids from the population in 2015 as the training set. (c) Correlation between the observed and predicted thousand grain weight when reversing the validation set in (b) as the training set. Red circles represent hybrids from the population in 2015, and blue squares represent hybrids from the population in 2012.
Figure 3Prediction of thousand grain weight with core hybrids as the training set. (a) Distribution of core hybrids and “non‐core” hybrids. PCA was performed on predictive analytes, which were calculated for hybrids from both populations. The selected core hybrids are indicated by red triangles, and “non‐core” hybrids are indicated by green crisscrosses. (b) Correlation between the observed and predicted thousand grain weight with core hybrids from both populations (3N2012 & 3N2015) as the training set. Predictabilities of thousand grain weight for hybrids from populations in 2012 and 2015 with All2012 & 2N2015 (c), 2N2012 & 2N2015 (d) and 3N2012 & 2N2015 (e). These three different population structures are shown to further investigate the effect of population structures on predictability.
Figure 4Effect of the numbers of metabolic markers on the predictability of thousand grain weight. (a) The relationship between the numbers of predictive analytes and predictabilities of TGW for hybrids from both populations. (b) Correlations between the observed and predicted thousand grain weights for hybrids from both populations with 401 metabolic markers. Based on the selected 401 analytes, PLS was conducted again to further filter low contribution analytes with VIP thresholds corresponding to 0.35 (c), 0.79 (d), and 1.0 (e).