| Literature DB >> 27729832 |
Olivier Fernandez1, Maria Urrutia1, Stéphane Bernillon2, Catherine Giauffret3, François Tardieu4, Jacques Le Gouis5, Nicolas Langlade6, Alain Charcosset7, Annick Moing2, Yves Gibon2.
Abstract
BACKGROUND: In the last decade, metabolomics has emerged as a powerful diagnostic and predictive tool in many branches of science. Researchers in microbes, animal, food, medical and plant science have generated a large number of targeted or non-targeted metabolic profiles by using a vast array of analytical methods (GC-MS, LC-MS, 1H-NMR….). Comprehensive analysis of such profiles using adapted statistical methods and modeling has opened up the possibility of using single or combinations of metabolites as markers. Metabolic markers have been proposed as proxy, diagnostic or predictors of key traits in a range of model species and accurate predictions of disease outbreak frequency, developmental stages, food sensory evaluation and crop yield have been obtained. AIM OF REVIEW: (i) To provide a definition of plant performance and metabolic markers, (ii) to highlight recent key applications involving metabolic markers as tools for monitoring or predicting plant performance, and (iii) to propose a workable and cost-efficient pipeline to generate and use metabolic markers with a special focus on plant breeding. KEY MESSAGE: Using examples in other models and domains, the review proposes that metabolic markers are tending to complement and possibly replace traditional molecular markers in plant science as efficient estimators of performance.Entities:
Keywords: Breeding; Metabolic marker; Metabolomics; Plant performance; Prediction
Year: 2016 PMID: 27729832 PMCID: PMC5025497 DOI: 10.1007/s11306-016-1099-1
Source DB: PubMed Journal: Metabolomics ISSN: 1573-3882 Impact factor: 4.290
List of associations of metabolic markers and plant performance in recent literature
| Plant | Trait | Numbers of markers | Statistical approach | Association | Statistical validation | Reference |
|---|---|---|---|---|---|---|
| Arabidopsis | Biomass | 181 | CCA/PLS | corr = 0.73/0.58 | Random permutation/train-test sets | Meyer et al. |
| Dry weight | 181 | OLS/PLS | Q2Y = 0.11 (C24) and 0.11 (Columbia)/Q2Y = 0.12 (C24) and 0.23 (Columbia) | Random permutation/random permutation | Steinfath et al. | |
| Dry weight | 9 (Columbia) 13 (C24) | PLS | Q2Y = 0.26/0.38 | Random permutation | ||
| Barley | Several malt quality traits | 216 | O2PLS | Q2Y = 0.17 to 0.79 | – | Heuberger et al. |
| Maize | Several performance traits | 130 | RR-BLUP | r(ĝ,g) = 0.61 to 0.79 | Fivefold cross validation | Riedelsheimer et al. |
| Dry matter yield | 7 | Pearson correlation | corr = −0.35 to 0.12 |
| Riedelsheimer et al. | |
| Lignin content | 7 | Pearson correlation | corr = −0.20 to 0.15 | p value < 0.01 | ||
| Plant height | 5 | Pearson correlation | corr = −0.23 to 0.16 | p value < 0.008 | ||
| GCA for several performance traits | 1/563 | Pearson correlation/RR-BLUP | corr = −0.54 to 0.48/r(ĝ,g) = 0.47 to 0.78 | Fivefold cross validation | Riedelsheimer et al. | |
| Grain yield under drought stress | 5 | Pearson correlation | corr = −0.47 to −0.54 | p value < 0.01 | Obata et al. | |
| Pine | Plant height | 11 | Pearson correlation | corr = 0.13 to 0.35 | p value < 0.05 | Kang et al. |
| Stem dry mass | 11 | Pearson correlation | corr = 0.15 to 0.34 | p value < 0.05 | ||
| Potato | Chip property | 2 | PLS and VIP selection | 0.66 to 0.75 | Random permutation | Steinfath et al. |
| Susceptibility to blackspotedness | 5 | PLS and VIP selection | 0.53 to 0.82 | Random permutation | ||
| Rice | Tolerance to mild salinity stress | 2 | t-test(foldchange)/PLS-DA | Delta log2(FCh) > 1/Q2Y = 0.49 | p value < 0.05/random permutation | Nam et al. |
| Yield under drought stress | 16 | Pearson correlation | corr = −0.71 to 0.53 | p value < 0.05 | Degenkolbe et al. | |
| yield under drought stress | 5 | Pearson correlation | corr = −0.72 to 0.45 | p value < 0.05 | ||
| Tomato | Firmess and shelf life | 2 | Correlation network | – | p value < 0.001 | López et al. |
| TYLCV resistance | 120 | FCH/Correlation network | 0.88 to 1.43/mean r2 = 0.62 | p value < 0.05/– | Sade et al. | |
| Wheat | Fusarium graminearum resistance | 60 | FCH/Correlation network | – | – | Cuperlovic-Culf et al. |
| Grape | Esca disease sensitivity | 34 | PCA | – | – | Lima et al. |
CCA canonical correlation analysis, corr correlation, FCH fold change, GCA general combining ability, O2PLS orthogonal partial least squares projections to latent structures, OLS ordinary least squares, PCA principal component analysis, PLS partial least squares to latent structures, Q2Y cumulative predictive explained variation, r(ĝ,g) correlation between predicted and unobserved true values, RR-BLUP ridge regression-best linear unbiased prediction, VIP variables importance in the projection
Fig. 1Strategy combining phenotyping, metabotyping and modeling for selection in order to find a few performing genotypes from a full panel of genotypes for a given criterion. Metabolic marker may optimize cost and speed of the process by (A) “metabotyping” and precision phenotyping of a diversity subpanel in a series of representative environmental conditions, (B) using collected data to model genotype performance. The model would generate a workable combination of (C) adapted growth scenario, sampling procedure and a small cost-efficient set of metabolic markers which would be used for (D) validation on the full panel of genotypes or for a further selection program. For the purpose of estimating costs, we consider 1 sample per genotype as a pool of 5 plants
Fig. 2Key milestones for improving and developing the use of metabolic markers