| Literature DB >> 34608951 |
Zhiwu Dan1, Yunping Chen1, Hui Li1, Yafei Zeng1, Wuwu Xu1, Weibo Zhao1, Ruifeng He2, Wenchao Huang1.
Abstract
Understanding the molecular mechanisms underlying complex phenotypes requires systematic analyses of complicated metabolic networks and contributes to improvements in the breeding efficiency of staple cereal crops and diagnostic accuracy for human diseases. Here, we selected rice (Oryza sativa) heterosis as a complex phenotype and investigated the mechanisms of both vegetative and reproductive traits using an untargeted metabolomics strategy. Heterosis-associated analytes were identified, and the overlapping analytes were shown to underlie the association patterns for six agronomic traits. The heterosis-associated analytes of four yield components and plant height collectively contributed to yield heterosis, and the degree of contribution differed among the five traits. We performed dysregulated network analyses of the high- and low-better parent heterosis hybrids and found multiple types of metabolic pathways involved in heterosis. The metabolite levels of the significantly enriched pathways (especially those from amino acid and carbohydrate metabolism) were predictive of yield heterosis (area under the curve = 0.907 with 10 features), and the predictability of these pathway biomarkers was validated with hybrids across environments and populations. Our findings elucidate the metabolomic landscape of rice heterosis and highlight the potential application of pathway biomarkers in achieving accurate predictions of complex phenotypes.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34608951 PMCID: PMC8491067 DOI: 10.1093/plphys/kiab273
Source DB: PubMed Journal: Plant Physiol ISSN: 0032-0889 Impact factor: 8.340
Figure 1Identification of heterosis-associated analytes for six agronomic traits. A, Heterosis of six agronomic traits at the population and individual levels. Five reproductive traits (including yield and four yield components) and one vegetative trait (PH) were recorded. Bars represent standard errors. B, Number of correlations between transformed parental metabolite levels and heterosis. The means of, differences in, and ratios of parental metabolite levels were calculated to perform Pearson correlations with heterosis of the six traits. Correlations with P <0.05 were considered significant. N = 3,746. C, Changes in r values with different numbers of predictive analytes in the PLS regressions. The optimal number of predictive analytes for each trait is marked with a black arrow. D–E, Correlations between the observed and predicted values of heterosis for PH (D) and yield (E) with correspondingly identified heterosis-associated analytes. F, MS/MS spectra of an analyte with peak tag M163T337_NEG and 4-hydroxycinnamic acid standard. G, Correlation between metabolite levels of 4-hydroxycinnamic acid and PH heterosis. H, Venn diagram of heterosis-associated analytes for yield and four yield components. In (A, D, and E) and (G) N = 287. SSR, seed setting rate; TGW, thousand-grain weight; GNP, grain number per panicle; TPP, tiller number per plant; YPP, yield per plant.
Figure 2Connections of heterosis-associated analytes among traits. A, Correlations among heterosis of five reproductive traits and PH. Partial correlations were performed to investigate the contribution of four yield components and PH to yield heterosis. Pearson correlations were conducted to analyze the relationship among the four yield components and PH. Correlation coefficients of the partial and Pearson correlations are indicated with R and r, respectively. *, **, statistically significant at 0.05 and 0.01 levels, respectively; ns, no statistically significant correlation. B, Correlations between metabolite levels of M853T560_NEG and heterosis of seed setting rate and yield. C, Correlations between metabolite levels of M131T16_NEG and heterosis of seed setting rate and tiller number. D, Correlation between the observed and predicted values of yield heterosis based on heterosis of the four yield components and PH. An equation was obtained through stepwise regression analysis: BPH-YPP = BPH-SSR*1.674 + BPH-TPP*0.949 + BPH-TGW*0.571 + BPH-GNP*0.533 + BPH-PH*0.504 + 0.299. E, Correlation between the observed and predicted values of yield heterosis based on heterosis-associated analytes of the four yield components and PH with the equation in Figure 2D. In (A–E), N = 287.
Figure 3Enriched metabolic pathways for heterosis. A, Overlap of analytes between PLS regression and Bayesian method. B, Venn diagram of enriched pathways for heterosis of the five reproductive traits. The percentages of overlapping pathways for each of the four yield components with yield heterosis are correspondingly shown in brackets. The numbers of overlapping and per se enriched pathways for the four yield components are indicated at the left and right side of the slash, respectively. NA, not applicable. C, Comparison of metabolite levels of pentose and glucuronate interconversions between the high- and low-BPH-YPP hybrids. Independent samples t test, two-tailed. N = 72. The center line of each boxplot represents the 50th percentile. The bottom and top of each boxplot represent the 25th and 75th percentiles, respectively. The whiskers represent the minimum and maximum values. The circles represent outliers. D, Correlation between metabolite levels of pentose and glucuronate interconversions and yield heterosis. N = 144. E, Correlation pattern of significantly enriched pathways for yield heterosis. A total of 17 pathways were significantly enriched for yield heterosis, and Pearson correlations were performed among these pathways based on their quantitative information. The purple and green arrows indicate that the high-BPH-YPP hybrids had high or low metabolite levels, respectively. The percentages of regulated pathways from amino acid metabolism and carbohydrate metabolism are shown in brackets. The correlation between cyanoamino acid metabolism and propanoate metabolism is highlighted with a black square. F, Correlations between metabolite levels of the citrate cycle and two pathways from amino acid and carbohydrate metabolism. N = 144.
The enriched metabolic pathways for yield heterosis
| Pathway name |
|
| Metabolite level | Previously known metabolites | Species |
|---|---|---|---|---|---|
| Tyrosine Metabolism | 0.046378 | 3.47E-10 | Low | Succinic acid, tyrosine, maleic acid, dopamine, fumarate |
|
| Pantothenate and CoA Biosynthesis | 0.001271 | 3.54E-04 | Low | Aspartate, valine |
|
| Propanoate Metabolism | 0.001338 | 1.81E-04 | Low | Succinic acid |
|
| Nicotinate and Nicotinamide Metabolism | 0.014907 | 2.60E-04 | Low | Succinic acid, aspartate, fumarate, nicotinate, gamma-aminobutyric acid |
|
| C5-Branched Dibasic Acid Metabolism | 0.017663 | 1.31E-05 | Low | Glutamate, 2-oxoglutarate, itaconate |
|
| Citrate Cycle | 0.00471 | 2.95E-08 | Low | Succinic acid, citric acid, fumarate, malate |
|
| Glyoxylate and Dicarboxylate Metabolism | 0.021109 | 1.72E-03 | Low | Succinic acid, glutamine, citric acid, serine, glycine, 2-oxoglutarate, malate, glyceric acid, glutamate |
|
| Butanoate Metabolism | 0.00072 | 0.17 | Low | Succinic acid, maleic acid, glutamate, 2-oxoglutarate, fumarate, gamma-aminobutyric acid |
|
| Galactose Metabolism | 0.008015 | 5.09E-05 | High | Glycerol, raffinose, galactinol, glucose | Maize ( |
| Pentose and Glucuronate Interconversions | 0.014907 | 5.06E-04 | High | Glycerol, xylose, xylitol | Maize ( |
| Sulfur Metabolism | 0.017663 | 4.67E-04 | High | Succinic acid | Maize ( |
| Cysteine and Methionine Metabolism | 0.022276 | 2.28E-05 | High | Aspartate | Maize ( |
| Pentose Phosphate Pathway | 0.022462 | 1.16E-06 | High | Glycerate, glucose | Maize ( |
| Monobactam Biosynthesis | 0.029861 | 2.33E-03 | High | Aspartate, threonine | Maize ( |
| Tropane, Piperidine and Pyridine alkaloid Biosynthesis | 0.030102 | 5.58E-04 | High | Putrescine, nicotinate, nicotinate |
|
| Lysine Degradation | 0.001925 | 9.41E-03 | High | Succinic acid | Maize ( |
| Valine, Leucine and Isoleucine Biosynthesis | 0.00705 | 2.89E-03 | High | Valine, threonine | Maize ( |
| Cyanoamino acid Metabolism | 0.043081 | 2.85E-02 | High | Glycine, tyrosine, asparagine |
|
| Phenylalanine, Tyrosine and Tryptophan Biosynthesis | 0.022462 | 0.08 | High | – | – |
| Glycine, Serine and Threonine Metabolism | 0.01074 | 0.33 | High | Glycerate, threonine, aspartate, | Maize ( |
| Pyruvate Metabolism | 0.001879 | 0.32 | High | Succinic acid, fumarate | Maize ( |
| Phenylalanine Metabolism | 0.036123 | 0.67 | High | Benzoic acid, succinic acid, fumarate |
|
| Synthesis and Degradation of Ketone Bodies | 0.004325 | – | – | – | – |
The pathway name, P-value, metabolite level, previously known metabolites, and corresponding species are provided. Since two pathways have no reported metabolites and one pathway’s quantitative information is not available, corresponding areas are marked with horizontal lines.
Figure 4Metabolomic landscape of heterosis for six agronomic traits. The landscape of heterosis was created by the overlapping metabolic pathways between traits. All the significantly enriched pathways from amino acid metabolism were positively correlated with heterosis of grain weight, and all the pathways from carbohydrate metabolism were negatively correlated. Similarly, four of six significantly enriched pathways from amino acid metabolism displayed positive correlations with heterosis of seed setting rate, and one out of four pathways from carbohydrate metabolism displayed a negative correlation. Eight significantly enriched pathways for grain number (namely, zeatin biosynthesis, two pathways in amino acid metabolism, and five in carbohydrate metabolism) showed negative relationships, and the pentose phosphate pathway showed a positive correlation. Only one pathway was significantly enriched for tiller heterosis, and the metabolite levels of pentose and glucuronate interconversions were positively correlated with tiller heterosis. In contrast to the above-mentioned correlation patterns, five out of six significantly enriched pathways in amino acid metabolism showed negative correlations with heterosis of PH, and three out of four pathways in carbohydrate metabolism showed positive correlations. Pearson correlation analysis was performed based on the metabolite levels of the significantly enriched pathways, and a correlation was significant when the P <0.05. Positive and negative correlations are indicated in different colors. The metabolic pathways from different types are marked correspondingly. Purple and green arrows indicate high-BPH hybrids with high or low metabolite levels, respectively. Numbers in brackets represent percentages of regulated pathways from amino acid and carbohydrate metabolism.
Figure 5The enriched pathways are predictive of yield heterosis. A, AUC for the ratio of tyrosine metabolism to sulfur metabolism. Univariate ROC curve analysis was performed on high- and low-BPH-YPP hybrids from the diallel cross population to identify biomarkers. The shadow is the computed 95% confidence band. B, Box plot of ratios of tyrosine metabolism to sulfur metabolism. The red line indicates the optimal cutoff value. N = 72. C, AUC for the top 10 features based on the multivariate ROC curve analysis. D, Predictive accuracies with different numbers of features. E, Average importance of the top 10 features. Met = metabolism. F, Correlation between the metabolite levels of l-tyrosine and yield heterosis. N = 287. G, Correlation between the average metabolite levels of the five annotated metabolites in tyrosine metabolism and yield heterosis. N = 287. H, Comparison of yield heterosis for 34 hybrids across growth conditions. Paired samples t test, two-tailed. N = 33. I, Correlation between the metabolite levels of tyrosine metabolism and yield heterosis of the 34 hybrids grown under different conditions. N = 34. J, Comparison of the metabolite levels of tyrosine metabolism between the high- and low-BPH-YPP hybrids (N = 53 and 54, respectively) from a testcross population. K, Correlation between the metabolite levels of tyrosine metabolism and yield heterosis of the testcross population (N = 107). The center line of each boxplot represents the 50th percentile. The bottom and top of each boxplot represent the 25th and 75th percentiles, respectively. The whiskers represent the minimum and maximum values. The circles represent outliers.