| Literature DB >> 25407430 |
Barbara Krakowska1, Ivana Stanimirova, Joanna Orzel, Michal Daszykowski, Ireneusz Grabowski, Grzegorz Zaleszczyk, Miroslaw Sznajder.
Abstract
In the countries of the European Community, diesel fuel samples are spiked with Solvent Yellow 124 and either Solvent Red 19 or Solvent Red 164. Their presence at a given concentration indicates the specific tax rate and determines the usage of fuel. The removal of these so-called excise duty components, which is known as fuel "laundering", is an illegal action that causes a substantial loss in a government's budget. The aim of our study was to prove that genuine diesel fuel samples and their counterfeit variants (obtained from a simulated sorption process) can be differentiated by using their gas chromatographic fingerprints that are registered with a flame ionization detector. To achieve this aim, a discriminant partial least squares analysis, PLS-DA, for the genuine and counterfeit oil fingerprints after a baseline correction and the alignment of peaks was constructed and validated. Uninformative variables elimination (UVE), variable importance in projection (VIP), and selectivity ratio (SR), which were coupled with a bootstrap procedure, were adapted in PLS-DA in order to limit the possibility of model overfitting. Several major chemical components within the regions that are relevant to the discriminant problem were suggested as being the most influential. We also found that the bootstrap variants of UVE-PLS-DA and SR-PLS-DA have excellent predictive abilities for a limited number of gas chromatographic features, 14 and 16, respectively. This conclusion was also supported by the unitary values that were obtained for the area under the receiver operating curve (AUC) independently for the model and test sets.Entities:
Year: 2014 PMID: 25407430 PMCID: PMC4305096 DOI: 10.1007/s00216-014-8332-4
Source DB: PubMed Journal: Anal Bioanal Chem ISSN: 1618-2642 Impact factor: 4.142
Fig. 1Exemplary GC-FID fingerprint: a before and b after baseline removal
Fig. 2Histograms of correlation coefficients calculated between each chromatographic fingerprint and a target signal: a before and b after alignment using COW
Fig. 3Projection of samples (scores) onto space defined by first two principal components: a samples denoted as plus sign are authentic and samples denoted as empty circle are after the laundering process and b illustration of corresponding pairs of samples authentic plus sign and counterfeit variant empty circle (i.e. after the laundering process). Loadings as a function of retention time for: c PC 1 and d PC 2
Performance of partial least squares discriminant models with and without the variable selection scheme
| Type of model | No. of variables |
| AUC model set | AUC test set | Sensitivity, specificity model set (%) | Sensitivity, specificity test set (%) |
|---|---|---|---|---|---|---|
| PLS-DA | 25,000 | 6 | 1.000 | 1.000 | 100.00 100.00 | 100.00 100.00 |
| UVE-PLS-DA | 14 | 9 | 1.000 | 1.000 | 90.00 100.00 | 100.00 100.00 |
| VIP-PLS-DA | 265 | 6 | 0.996 | 0.970 | 95.24 95.24 | 90.00 90.00 |
| SR-PLS-DA | 16 | 3 | 1.000 | 1.000 | 100.00 100.00 | 100.00 100.00 |
Fig. 4The number of relevant variables identified using the UVE-PLS-DA approach as a function of percentage of the variable selection frequency
Identification of chemical compounds found in mixtures eluted at retention times indicated as relevant using the UVE-PLS-DA approach
| Peak number | Retention time [min] | Possible compound |
|---|---|---|
| 1 | 23.937 23.940 23.944 | Benzene, 1-methyl-3-propyl formula: C10H14 |
| 2 | 24.786 24.789 24.793 24.796 | Benzene, 1-methyl-4-propyl formula: C10H14 |
| 3 | 25.398 25.402 | Benzene, 1-ethyl-2,4-dimethyl formula: C10H14 |
| 4 | 27.556 27.559 27.563 | Benzene, 1,2,3,5-tetramethyl formula: C10H14 |
| 5 | 40.881 40.884 |
|
Identification of chemical compounds found in mixtures eluted at retention times indicated as relevant using the SR-PLS-DA approach (NI—not identified)
| Peak number | Retention time [min] | Possible compound |
|---|---|---|
| 1 | 7.052 7.055 | NI |
| 2 | 23.982 23.985 23.989 23.991 23.996 23.999 | 4-Ethyloheptan formula: C9H20 or 1-octanol, 2-butyl formula: C12H26 |
| 3 | 32.292 32.296 32.299 | Phytol formula: C20H40O |
| 4 | 32.891 32.894 32.898 | Compounds containing oxygen, e.g., 1-propene, 2-nitro-3-(1-cyclooctenyl) formula: C11H17NO2 |
| 5 | 38.508 | NI |
| 6 | 47.058 | Pentadecane, 3-methyl formula: C16H34 |