| Literature DB >> 31752349 |
Sanae Bikrani1, Ana M Jiménez-Carvelo2, Mounir Nechar1, M Gracia Bagur-González2, Badredine Souhail1, Luis Cuadros-Rodríguez2.
Abstract
Fat-spread products are a stabilized emulsion of water and vegetable oils. The whole fat content can vary from 10 to 90% (w/w). There are different kinds, which are differently named, and their composition depends on the country in which they are produced or marketed. Thus, having analytical solutions to determine geographical origin is required. In this study, some multivariate classification methods are developed and optimised to differentiate fat-spread-related products from different geographical origins (Spain and Morocco), using as an analytical informative signal the instrumental fingerprints, acquired by liquid chromatography coupled with a diode array detector (HPLC-DAD) in both normal and reverse phase modes. No sample treatment was applied, and, prior to chromatographic analysis, only the samples were dissolved in n‑hexane. Soft independent modelling of class analogy (SIMCA) and partial least squares-discriminant analysis (PLS-DA) were used as classification methods. In addition, several classification strategies were applied, and performance of the classifications was evaluated applying proper classification metrics. Finally, 100% of samples were correctly classified applying PLS-DA with data collected in reverse phase.Entities:
Keywords: food authentication; liquid chromatography fingerprinting; margarines and spreads; multivariate classification
Year: 2019 PMID: 31752349 PMCID: PMC6915439 DOI: 10.3390/foods8110588
Source DB: PubMed Journal: Foods ISSN: 2304-8158
Types of vegetable oils present in the samples.
| Sample No. | Origin | Vegetable Oil |
|---|---|---|
| 1 | Spain | Sunflower, palm, and corn |
| 2 | Sunflower and palm | |
| 3 | Sunflower and palm | |
| 4 | Sunflower, coconut, and canola | |
| 5 | Sunflower, linseed, coconut, and canola | |
| 6 | Sunflower, linseed, and palm | |
| 7 | Sunflower, linseed, palm, and canola | |
| 8 | Olive, sunflower, linseed, and palm | |
| 9 | Sunflower, linseed, and palm | |
| 10 | Sunflower, linseed, and palm | |
| 11 | Olive, sunflower, linseed, and palm | |
| 12 | Soybean, sunflower, linseed, and palm | |
| 13 | Sunflower and palm | |
| 14 | Olive, sunflower, linseed, coconut, and shea | |
| 15 | Sunflower, linseed, and palm | |
| 16 | Sunflower, coconut, canola, and shea | |
| 17 | Soybean, sunflower, linseed, and palm | |
| 18 | Morocco | - |
| 19 | - | |
| 20 | Soybean | |
| 21 | - | |
| 22 | - | |
| 23 | Soybean and corn | |
| 24 | Soybean | |
| 25 | - | |
| 26 | Corn | |
| 27 | Soybean | |
| 28 | Soybean | |
| 29 | Soybean | |
| 30 | Soybean | |
| 31 | The Netherlands | Sunflower, palm, and coconut |
| 32 | United Kingdom | Sunflower, palm, and rapeseed |
| 33 | France | Rapeseed, palm, olive, and sunflower |
| 34 | Germany | Soybean, palm, rapeseed, and coconut |
| 35 | Belgium | Palm, coconut, and canola |
The hyphen “-” signifies that the kind of vegetable oil present in the samples is unknown.
Figure 1Chromatogram of a margarine/spread samples showing the region of interest used to build the classification models: (a,b) Normal phase from Morocco and Spain, respectively; (c,d) reverse phase from Morocco and Spain, respectively.
Distribution of the samples used in the different classification datasets regarding geographical origin.
| SIMCA | PLS-DA | ||||
|---|---|---|---|---|---|
| Dataset | Origin | (1ic) | (2ic) | ( | (2ic) |
| Training set | Spain | 10 | 10 | 10 | 10 |
| Morocco | -- | 10 | -- | 10 | |
| Blank | -- | -- | 6 | -- | |
| Validation set | Spain | 7 | 7 | 7 | 7 |
| Morocco | 13 | 3 | 13 | 3 | |
| Prediction set | Europe (other than Spain) | -- | -- | -- | 5 |
Characteristics of the principal components analysis (PCA) models.
| (RP)HPLC-DAD | (NP)HPLC-DAD | |||
|---|---|---|---|---|
| Wavelength | PCs | % Var | PCs | % Var |
| 210 nm | 4 | 90.00 | 4 | 95.00 |
| 254 nm | 4 | 99.00 | 4 | 93.00 |
Figure 2PCA scores obtained from the fingerprint data of the 35 margarine samples: PC1–PC2 plane of the chromatogram acquired at 210 nm: (a) In reverse phase; (b) In normal phase; and acquired at 254 nm: (c) In reverse phase; (d) In normal phase.
Figure 3Coomans’ plot from (a) normal and (b) reverse liquid chromatography coupled with diode-array detector ((NP)HPLC-DAD and (RP)HPLC-DAD, respectively) datasets at 210 nm.
Soft independent modelling of class analogy (SIMCA) classification results for the validation set.
| 7 | 3 |
| ||||
|
|
| 0 | 1 | 1 | ||
|
| 2 | 2 | 4 | |||
|
| 0 | 0 | 0 | |||
|
| 5 | 0 | 5 | |||
|
|
| |||||
|
| ||||||
|
| ||||||
| 7 | 3 |
| ||||
|
|
| 0 | 0 | 0 | ||
|
| 1 | 3 | 4 | |||
|
| 0 | 0 | 0 | |||
|
| 6 | 0 | 6 | |||
|
|
| |||||
T: Target class (Spanish class); nT: Nontarget class (Moroccan class); I: Inconclusive samples; O: Samples not considered as belonging to any class.
Quality metrics of the SIMCA models.
| Parameter | (NP)HPLC-DAD | (RP)HPLC-DAD |
|---|---|---|
| Sensitivity (or Recall) | 0.71 | 0.86 |
| Specificity | 0.00 | 0.00 |
| Positive predictive value (Precision) | 1.00 | 1.00 |
| Negative predictive value | -- | -- |
| Youden index | −0.29 | −0.14 |
| Positive likelihood rate | 0.71 | 0.86 |
| Negative likelihood rate | -- | -- |
| 0.83 | 0.92 | |
| Discriminant power | -- | -- |
| Efficiency (or Accuracy) | 0.50 | 0.60 |
| AUC (Correctly classified rate) | 0.36 | 0.43 |
| Matthews correlation coefficient | -- | -- |
| Kappa coefficient | 0.23 | 0.31 |
The hyphen “-” signifies that the performance feature cannot be determined.
Figure 4Classification plot of the partial least squares-discriminant analysis (PLS-DA) models from liquid chromatography (LC) data obtained in (a) normal phase and (b) reverse phase at 210 nm. The orange area represents the inconclusive area.
PLS-DA classification results for the validation set.
| 7 | 3 |
| ||||
|
|
| 0 | 0 | 0 | ||
|
| 1 | 1 | 2 | |||
|
| 0 | 2 | 2 | |||
|
| 6 | 0 | 6 | |||
|
|
| |||||
|
| ||||||
|
| ||||||
| 7 | 3 |
| ||||
|
|
| 0 | 0 | 0 | ||
|
| 0 | 0 | 0 | |||
|
| 0 | 3 | 3 | |||
|
| 7 | 0 | 7 | |||
|
|
| |||||
T: Target class (Spanish class); nT: Nontarget class (Moroccan class); I: Inconclusive samples; O: Samples not considered as belonging to any class.
Quality metrics of the PLS-DA models.
| Parameter | (NP)HPLC-DAD | (RP)HPLC-DAD |
|---|---|---|
| Sensitivity (or Recall) | 0.86 | 1.00 |
| Specificity | 0.67 | 1.00 |
| Positive predictive value (Precision) | 1.00 | 1.00 |
| Negative predictive value | 1.00 | 1.00 |
| Youden index | 0.52 | 1.00 |
| Positive likelihood rate | 2.57 | -- |
| Negative likelihood rate | 0.21 | 0.00 |
| 0.92 | 1.00 | |
| Discriminant power | 0.60 | -- |
| Efficiency (or Accuracy) | 0.80 | 1.00 |
| AUC (Correctly classified rate) | 0.76 | 1.00 |
| Matthews correlation coefficient | 0.76 | 1.00 |
| Kappa coefficient | 0.62 | 1.00 |
The hyphen “-” is signifying that the performance feature cannot be determined.
Figure 5Class predictions plot for the samples from the European countries other than Spain.