| Literature DB >> 28873633 |
Elena Cubero-Leon1, Olivier De Rudder2, Alain Maquet3.
Abstract
Increasing demand for organic products and their premium prices make them an attractive target for fraudulent malpractices. In this study, a large-scale comparative metabolomics approach was applied to investigate the effect of the agronomic production system on the metabolite composition of carrots and to build statistical models for prediction purposes. Orthogonal projections to latent structures-discriminant analysis (OPLS-DA) was applied successfully to predict the origin of the agricultural system of the harvested carrots on the basis of features determined by liquid chromatography-mass spectrometry. When the training set used to build the OPLS-DA models contained samples representative of each harvest year, the models were able to classify unknown samples correctly (100% correct classification). If a harvest year was left out of the training sets and used for predictions, the correct classification rates achieved ranged from 76% to 100%. The results therefore highlight the potential of metabolomic fingerprinting for organic food authentication purposes.Entities:
Keywords: Authenticity; Carrots; Chemometrics; Conventional agriculture; Mass spectrometry; Metabolomics; Organic agriculture
Mesh:
Year: 2017 PMID: 28873633 PMCID: PMC5611763 DOI: 10.1016/j.foodchem.2017.06.161
Source DB: PubMed Journal: Food Chem ISSN: 0308-8146 Impact factor: 7.514
Fig. 1(a–c) PCA score plots of the first principal component (t1) versus the second principal component (t2) from the combined fraction’s (methanol/water and chloroform) mass spectra of the entire set of carrots under the (a) Electrospray ionization (ESI)+ mode. Quality control (QC) clusters (filled triangles) are highlighted within the ellipses. (b) ESI− mode and (c) ESI− with acid in the mobile phase. Organic samples (filled circles), conventional samples (filled squares). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 2PCA score plots from the combined fraction's mass spectra using the Electrospray ionization (ESI)− mode (as Fig. 1b). The first and the second principal components (t1 and t2) are shown. Each year is represented with a different symbol. In picture legend 1: year 2005; 2: year 2006; 3: year 2007; 4: year 2008.
Summary of OPLS-DA models for the distinction of carrot samples from organic and conventional agricultural systems and production year.
| Model number | Classification based on | Ionisation Mode | Variables | Number of components (predictive component + orthogonal in X component + orthogonal in Y component) | R2Y | Q2 (cum) | Classification rate (Internal cross-validation) | Validation Samples (Test set) | Classification rate (external validation) |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Year | All | 11451 | 3 + 2 + 0 | 0.911 | 0.916 | 100% | – | – |
| 2 | C vs O | ESI+ | 4083 | 1 + 5 + 0 | 0.972 | 0.561 | 100% | Random | 80% |
| 3 | C vs O | ESI− | 1462 | 1 + 4 + 0 | 0.907 | 0.516 | 97.7% | Random | 77.3% |
| 4 | C vs O | ESI− | 1478 | 1 + 2 + 0 | 0.752 | 0.408 | 94.7% | Random | 83.7% |
| 5 | C vs O | All | 20622 | 1 + 4 + 0 | 0.855 | 0.641 | 100% | Random | 83.8% |
| 6 | C vs O | All | 20622 | 1 + 5 + 0 | 0.89 | 0.34 | 98.32% | 2005 | 60% |
| 7 | C vs O | All | 20622 | 1 + 6 + 0 | 0.911 | 0.725 | 100% | 2006 | 48.3% |
| 8 | C vs O | All | 20622 | 1 + 3 + 0 | 0.868 | 0.548 | 98.98% | 2007 | 57.5% |
| 9 | C vs O | All | 20622 | 1 + 3 + 0 | 0.841 | 0.618 | 98.31% | 2008 | 55% |
| 10 | C vs O | All | 13250 | 1 + 4 + 0 | 0.95 | 0.655 | 100% | Random | 100% |
| 11 | C vs O | All | 13250 | 1 + 3 + 0 | 0.961 | 0.754 | 100% | 2005 | 88.2% |
| 12 | C vs O | All | 13250 | 1 + 4 + 0 | 0.939 | 0.733 | 100% | 2006 | 81.7% |
| 13 | C vs O | All | 13250 | 1 + 4 + 0 | 0.982 | 0.781 | 100% | 2007 | 75.7% |
| 14 | C vs O | All | 13250 | 1 + 3 + 0 | 0.957 | 0.761 | 100% | 2008 | 100% |
| 15 | C vs O | All | 13250 | 1 + 4 + 0 | 0.964 | 0.733 | 100% | Cultivar Namur | 90% |
| 16 | C vs O | All | 13250 | 1 + 4 + 0 | 0.966 | 0.756 | 100% | Close Environment | 88.6% |
Validation samples randomly selected (one third of the entire dataset).
Acid in the mobile phase; C: conventional samples; O: organic samples; R2Y: explained variation; Q2(cum): predictive ability; VIP: Variable Importance in the Projection; ESI: electrospray ionization mode.
Fig. 3Score plot of OPLS-DA of model 10 (Table 1). The first predictive component (t1) and the first orthogonal component (to1) are shown. R2Y: explained variation. Ellipse Hotelling’s T2 (95%). Organic samples (filled circles), conventional samples (filled squares). n = 130 (included validation samples/test set). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4ROC curves of external validation of models 10–14 from Table 1. a: random samples used as validation set (Model 10); b: samples from 2005 as external validation test set (Model 11); c: samples from 2006 as external validation test set (Model 12), d: samples from 2007 as external validation test set (Model 13); e: samples from 2008 as external validation test set (Model 14). ROC = Receiver Operating Characteristic; AUC = Area Under the ROC Curve.
Markers of agricultural system discrimination in carrots. P values (uncorrected) are calculated from univariate statistics (unpaired Student’s t test).
| RT | Polarity Molecular Ion | Fraction | Observed Empirical formula | Δppm | Fragment ions | Metabolite | Ratio organic/conventional | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Formula | Δppm | |||||||||||
| 1.3 | + | MW/C | 233.0632 | C7H14O7Na | 0.4 | 193.0707 | C7H13O6 | 0 | Sedoheptulose | 1.7 | 1.17E−04 | |
| 1.3 | + | MW | 365.1054 | C12H22O11Na | 1.3 | 325.1124 | C12H21O10 | 0.8 | Dissaccharide | 1.3 | 1.02E−04 | |
| 1.5 | + | MW | 175.1190 | C6H15N4O2 | 1.1 | 116.0706 | C5H10NO2 | 1.1 | 2.0 | 2.51E−04 | ||
| 1.5 | + | MW/C | 432.1712 | C15H30NO13 | 0.3 | 127.0390 | C6H7O3 | 0 | Arabinofuranosyl-[a- | 1.5 | 4.86E−04 | |
| 1.9 | + | MW | 215.0162 | C6H8O7Na | 2.3 | 175.0237 | C6H7O7 | 2.7 | Citric acid | 1.5 | 1.93E−06 | |
| 7.0 | + | MW/C | 355.1024 | C16H19O9 | 0.3 | 163.0390 | C9H7O3 | 0.1 | Chlorogenic acid | 1.5 | 8.96E−03 | |
| 1.2 | – | MW/C | 179.0561 | C6H11O6 | 1.4 | 119.0350 | C4H7O4 | 0.7 | Monosaccharide | 1.2 | 3.23E−03 | |
| 1.3 | – | MW/C | 209.0660 | C7H13O7 | 3.4 | 119.0350 | C4H7O4 | 2.7 | Sedoheptulose | 1.7 | 3.17E−03 | |
| 1.3 | – | MW/C | 245.0434 | C7H14O7Cl | 1.6 | 209.0660 | C7H13O7 | 1.2 | Sedoheptulose | 1.6 | 2.00E−03 | |
| 7.0 | – | MW | 353.0878 | C16H17O9 | 0 | 191.0561 | C7H11O6 | 0.0 | Chlorogenic acid | 1.4 | 8.07E−03 | |
| 17.0 | + | MW/C | 518.3280 | C29H46NO4Na2 | 1.7 | Fatty acid ester | 2.0 | 4.90E−04 | ||||
| 6.8 | – | MW/C | 391.1610 | C17H27O10 | 2.4 | Terpene glycoside | 6.0 | 1.69E−09 | ||||
| 8.1 | – | MW | 373.1504 | C17H25O9 | 3.3 | Terpene glycoside | 2.4 | 5.03E−08 | ||||
| 8.9 | – | MW | 507.2083 | C22H35O13 | 0.5 | Oligosaccharide | 5.5 | 1.11E−07 | ||||
| 9.1 | – | MW | 369.1555 | C18H25O8 | 0.0 | 247.1187 | C11H19O6 | 1.9 | 12-Hydroxyjasmonic acid glucoside | 3.9 | 1.46E−08 | |
| 1.3 | + | MW/C | 413.1261 | Unknown | 2.8 | 8.49E−06 | ||||||
| 1.4 | + | MW/C | 397.1310 | Unknown | 1.9 | 9.67E−04 | ||||||
| 15.7 | + | C | 406.0746 | Unknown | 0.5 | 6.37E−06 | ||||||
| 1.2 | – | MW/C | 119.0374 | Unknown | 1.2 | 3.17E−03 | ||||||
| 1.7 | – | MW | 719.2690 | Unknown | 0.8 | 8.55E−03 | ||||||
| 6.6 | – | MW | 315.0723 | Unknown | 2.0 | 5.90E−04 | ||||||
| 9.5 | – | C | 325.1078 | Unknown | 0.6 | 4.38E−03 | ||||||
| 11.4 | – | MW | 555.1140 | Unknown | 0.7 | 3.65E−02 | ||||||
| 17.9 | – | C | 452.2785 | Unknown | 1.3 | 1.85E−02 | ||||||
Δppm: is the difference in exact mass between the measured value and the theoretical value. RT: retention time; MW: methanol/water fraction; C: chloroform fraction.
Ratio was calculated as peak area ratio from the arithmetic mean values of each group.
Significant p-values after Bonferroni corrections (p < 3.77E−06).