| Literature DB >> 30061490 |
Andrzej Bieganowski1, Grzegorz Józefaciuk2, Lidia Bandura3, Łukasz Guz4, Grzegorz Łagód5, Wojciech Franus6.
Abstract
The possibility of detecting low levels of soil pollution by petroleum fuel using an electronic nose (e-nose) was studied. An attempt to distinguish between pollution caused by petrol and diesel oil, and its relation to the time elapsed since the pollution event was simultaneously performed. Ten arable soils, belonging to various soil groups from the World Reference Base (WRB), were investigated. The measurements were performed on soils that were moistened to field capacity, polluted separately with both hydrocarbons, and then allowed to dry slowly over a period of 180 days. The volatile fingerprints differed throughout the course of the experiment, and, by its end, they were similar to those of the unpolluted soils. Principal component analysis (PCA) and artificial neural network (ANN) analysis showed that the e-nose results could be used to detect soil contamination and distinguish between pollutants and contamination levels.Entities:
Keywords: e-nose; hydrocarbon; pollution; soil
Year: 2018 PMID: 30061490 PMCID: PMC6111446 DOI: 10.3390/s18082463
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Basic properties of the investigated soils.
| No. | WRB Soil Group | Corg (%) | Particle Size Group |
|---|---|---|---|
| 1 | Brunic Arenosol | 0.86 | Sand |
| 2 | Stagnic Luvisol | 1.19 | Sandy loam |
| 3 | Haplic Cambisol | 0.57 | Sandy loam |
| 4 | Leptic Cambisol | 1.08 | Silt loam |
| 5 | Mollic Stagnic Fluvisol | 1.14 | Silt loam |
| 6 | Stagnic Phaeozem (Siltic) | 1.97 | Silt |
| 7 | Haplic Chernozem (Siltic) | 1.11 | Silt loam |
| 8 | Haplic Luvisol (Siltic) | 1.06 | Silt |
| 9 | Leptic Skeletic Dystric Cambisol | 0.90 | Silt loam |
| 10 | Haplic Fluvisol (Clayic) | 1.86 | Silt |
Figure 1Schema of sampling method (dimensions in cm).
Figure 2Raw signal changes of electronic nose (e-nose) particular sensors during experiment (a) without contamination, (b) contamination with diesel, and (c) contamination with petrol. Error bars represent standard deviation.
Figure 3Two-dimensional principal component analysis (PCA) plots for (a) the volatile fingerprints of the 10 soils under examination, polluted with petrol and diesel. The numbers represent days after pollution. The points represent the averaged data for all ten soils. (b) Plot of variable loadings.
Classification errors for the networks used, using a single-step classification of all the soils studied.
| %E * | |||
|---|---|---|---|
| Network ID | Training | Validation | Testing |
| 1 | 17.44 | 15.98 | 17.40 |
| 2 | 18.83 | 18.95 | 19.82 |
| 3 | 17.20 | 17.09 | 17.50 |
| 4 | 17.32 | 16.98 | 18.64 |
| 5 | 18.11 | 19.47 | 20.44 |
| 6 | 18.00 | 18.33 | 17.57 |
| 7 | 17.85 | 17.78 | 17.43 |
| 8 | 17.22 | 17.99 | 18.09 |
| 9 | 17.92 | 16.71 | 18.26 |
| 10 | 17.17 | 17.18 | 17.67 |
|
|
|
|
|
* Percentage error (%E) indicates the fraction of samples which were misclassified. A value of 0 means no misclassifications, while a value of 100 indicates the maximum number of misclassifications. ID—identifier.
Classification errors for the networks used in the in two-step classification of all the soils studied.
| STEP1 | STEP2 | ||||||
|---|---|---|---|---|---|---|---|
| Network ID | %E * | Network ID | %E * | ||||
| Training | Validation | Testing | Training | Validation | Testing | ||
| 1 | 0.90 | 0.62 | 0.96 | 1 | 3.98 | 4.15 | 3.53 |
| 2 | 6.46 | 5.83 | 7.32 | 2 | 3.98 | 4.25 | 3.94 |
| 3 | 0.76 | 0.41 | 1.03 | 3 | 4.38 | 3.42 | 3.32 |
| 4 | 4.03 | 4.07 | 3.86 | 4 | 3.75 | 3.63 | 4.15 |
| 5 | 6.86 | 6.35 | 8.14 | 5 | 3.80 | 4.46 | 3.01 |
| 6 | 5.63 | 5.17 | 5.73 | 6 | 3.46 | 3.42 | 3.32 |
| 7 | 1.92 | 1.17 | 2.48 | 7 | 4.88 | 5.28 | 3.94 |
| 8 | 1.06 | 1.45 | 1.38 | 8 | 4.04 | 3.21 | 3.94 |
| 9 | 1.71 | 1.65 | 2.27 | 9 | 4.51 | 4.36 | 3.53 |
| 10 | 3.59 | 3.31 | 4.07 | 10 | 3.82 | 4.88 | 3.63 |
|
|
|
|
|
|
|
|
|
* Percentage error (%E) indicates the fraction of samples which were misclassified. A value of 0 means no misclassifications, while a value of 100 indicates the maximum number of misclassifications.
Classification errors for the artificial neural networks (ANNs) used in the prediction of one chosen type of soil contamination assessment (yes/no).
| ANN ID | Day Known | Day Unknown | ||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 8 | 15 | 37 | 64 | 93 | 173 | 1–173 | |
| 1 | 7.2 | 8.9 | 10.8 | 12.0 | 10.6 | 13.6 | 15.6 | 21.7 |
| 2 | 6.1 | 11.1 | 9.9 | 8.4 | 13.8 | 19.1 | 17.4 | 20.7 |
| 3 | 8.3 | 14.4 | 14.8 | 14.4 | 14.2 | 14.0 | 13.9 | 23.7 |
| 4 | 7.6 | 11.9 | 12.2 | 13.3 | 13.3 | 13.3 | 12.8 | 15.4 |
| 5 | 6.1 | 9.8 | 14.1 | 9.2 | 14.5 | 19.4 | 15.0 | 19.1 |
| 6 | 6.3 | 8.8 | 10.7 | 12.0 | 13.8 | 18.6 | 17.2 | 24.7 |
| 7 | 8.3 | 10.7 | 12.2 | 14.6 | 12.9 | 17.9 | 15.0 | 24.4 |
| 8 | 7.4 | 8.8 | 14.7 | 8.4 | 14.3 | 15.5 | 12.2 | 18.0 |
| 9 | 7.7 | 8.7 | 8.2 | 8.0 | 13.3 | 17.7 | 12.2 | 23.3 |
| 10 | 9.7 | 8.2 | 9.2 | 13.3 | 11.6 | 16.4 | 14.8 | 21.5 |
|
|
|
|
|
|
|
|
|
|
Efficiency of the used networks in distinguishing the data samples collected at different pre-defined days after pollution.
|
| ||||||
|
|
|
|
| |||
|
|
|
|
|
|
| |
| 1 | 0.99 | 0.99 | 1.14 | 0.99 | 1.27 | 0.99 |
| 2 | 0.06 | 0.99 | 0.19 | 0.99 | 0.08 | 0.99 |
| 3 | 0.16 | 0.99 | 0.18 | 0.99 | 0.48 | 0.99 |
| 4 | 0.032 | 0.99 | 0.013 | 0.99 | 0.023 | 0.99 |
| 5 | 0.10 | 0.99 | 0.26 | 0.99 | 15.58 | 0.97 |
|
|
|
|
|
|
|
|
|
| ||||||
|
|
|
|
| |||
|
|
|
|
|
|
| |
| 1 | 0.11 | 0.99 | 0.69 | 0.99 | 0.40 | 0.99 |
| 2 | 0.29 | 0.99 | 0.56 | 0.99 | 0.71 | 0.99 |
| 3 | 0.10 | 0.99 | 0.21 | 0.99 | 0.90 | 0.99 |
| 4 | 0.29 | 0.99 | 0.30 | 0.99 | 0.93 | 0.99 |
| 5 | 1.33 | 0.99 | 1.27 | 0.99 | 1.65 | 0.99 |
|
|
|
|
|
|
|
|
* The mean squared error (MSE) is the average squared difference between outputs and targets. Lower values are better, whereby a value of 0 means no error. Regression (R) values measure the correlation between outputs and targets. An R value of 1 means a close relationship, while a value of 0 corresponds to a random relationship.
Figure 4Exemplary fits of artificial neural network (ANN) identifier (ID) 1 for the predicted and measured data for the estimation of the time lapse from the beginning of pollution with petrol (up) and diesel (down).