| Literature DB >> 30872800 |
S Cipullo1, S Nawar2, A M Mouazen2, P Campo-Moreno1, F Coulon3.
Abstract
A number of studies have shown that visible and near infrared spectroscopy (VIS-NIRS) offers a rapid on-site measurement tool for the determination of total contaminant concentration of petroleum hydrocarbons compounds (PHC), heavy metals and metalloids (HM) in soil. However none of them have yet assessed the feasibility of using VIS-NIRS coupled to random forest (RF) regression for determining both the total and bioavailable concentrations of complex chemical mixtures. Results showed that the predictions of the total concentrations of polycyclic aromatic hydrocarbons (PAH), PHC, and alkanes (ALK) were very good, good and fair, and in contrast, the predictions of the bioavailable concentrations of the PAH and PHC were only fair, and poor for ALK. A large number of trace elements, mainly lead (Pb), aluminium (Al), nickel (Ni), chromium (Cr), cadmium (Cd), iron (Fe) and zinc (Zn) were predicted with very good or good accuracy. The prediction results of the total HMs were also better than those of the bioavailable concentrations. Overall, the results demonstrate that VIS-NIR DRS coupled to RF is a promising rapid measurement tool to inform both the distribution and bioavailability of complex chemical mixtures without the need of collecting soil samples and lengthy extraction for further analysis.Entities:
Year: 2019 PMID: 30872800 PMCID: PMC6418180 DOI: 10.1038/s41598-019-41161-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Illustrative block diagram showing the different steps for the estimation of complex chemical mixtures of total and bioavailable concentrations in soils using chemical methods and VIS-NIR coupled with Random Forest (RF). DCM: dichloromethane; Hex: hexane; HP-β-CD: hydroxypropyl-β-cyclodextrin; PHC: Petroleum hydrocarbons; HM: heavy metals; PAH: polycyclic aromatic hydrocarbons; ALK: Alkanes; Al: aluminium; Cr: Chromium, Cd: Cadmium; Ni: Nickel, Zn: Zinc; Se: Selenium, Cu: Copper; Fe: Iron; As: Arsenic; Pb:Lead, ML: Machine Learning,; LOOCV: leave-one-out-cross-validation; R2:coefficient of determination; RMSEP: root mean square error of prediction; RPD: ratio of prediction deviation; RPIQ: ratio of the performance to interquartile distance.
Figure 2Box plot representing total and bioavailable concentrations (mg/kg) of heavy metals/metalloids (HM) (left) and petroleum hydrocarbons (PHC) (right) across the five soil types (n = 105). Black dots represent outlier samples.
Descriptive statistics of the calibration datasets of total and bioavailable contents of PHC, PAH, ALK and HM/metalloids used for the RF modelling.
| No | Min | 1st Q | Median | Mean | 3rd Q | Max | SD | |||
|---|---|---|---|---|---|---|---|---|---|---|
| Organics | Total (mg/kg) | PHC | 74 | 79 | 137 | 241 | 285 | 389 | 1049 | 188 |
| PAH | 73 | 0.3 | 2.1 | 102 | 145 | 267 | 553 | 160 | ||
| ALK | 73 | 49 | 109 | 126 | 146 | 163 | 496 | 74 | ||
| Bioavailable (mg/kg) | PHC | 73 | 14 | 48 | 109 | 127 | 159 | 548 | 107 | |
| PAH | 73 | 0.2 | 1.2 | 60 | 76 | 131 | 326 | 82 | ||
| ALK | 73 | 7.3 | 32 | 47 | 55 | 62 | 263 | 39 | ||
| Inorganics | Total (mg/kg) | Al | 74 | 2375 | 7289 | 12301 | 14409 | 18808 | 46195 | 9605 |
| Cr | 73 | 5 | 17 | 25 | 29 | 37 | 85 | 16 | ||
| Cd | 72 | 0.1 | 0.2 | 0.3 | 0.4 | 0.6 | 2 | 0.4 | ||
| Ni | 74 | 2 | 11 | 15 | 18 | 26 | 49 | 10 | ||
| Zn | 73 | 15 | 64 | 108 | 244 | 164 | 1964 | 393 | ||
| Se | 72 | 0.4 | 1 | 2 | 2 | 3 | 6 | 1 | ||
| Cu | 73 | 4 | 12 | 27 | 33 | 40 | 128 | 25 | ||
| Fe | 74 | 787 | 10857 | 15300 | 17969 | 20955 | 57669 | 10822 | ||
| As | 73 | 1 | 7 | 10 | 11 | 13 | 34 | 6 | ||
| Pb | 74 | 9 | 31 | 61 | 288 | 131 | 2864 | 600 | ||
| Bioavailable (mg/kg) | Al | 72 | 1 | 8 | 234 | 339 | 685 | 1037 | 355 | |
| Cr | 73 | 0.1 | 0.3 | 1 | 1 | 1 | 2 | 1 | ||
| Cd | 73 | 0.1 | 0.2 | 0.2 | 0.3 | 0.2 | 2 | 0.4 | ||
| Ni | 74 | 1 | 1 | 3 | 3 | 4 | 12 | 2 | ||
| Zn | 72 | 4 | 9 | 15 | 314 | 26 | 1911 | 624 | ||
| Se | 72 | 0.1 | 0.5 | 1 | 1 | 1 | 2 | 0.4 | ||
| Cu | 72 | 0.2 | 2 | 6 | 7 | 12 | 18 | 6 | ||
| Fe | 73 | 5 | 8 | 98 | 171 | 159 | 928 | 244 | ||
| As | 72 | 0.3 | 0.5 | 1 | 1 | 1 | 1 | 0.2 | ||
| Pb | 74 | 0.1 | 0.3 | 5 | 295 | 54 | 2463 | 690 | ||
RF outputs for the calibration of the total and bioavailable concentrations of PHC, PAH, ALK and HM/metalloids in the contaminated soil samples.
| Compound | N° | R2 | RMSE (mg/kg) | RPD | RPIQ | ||
|---|---|---|---|---|---|---|---|
| Organics | Total (mg/kg) | PHC | 74 | 0.83 | 78.2 | 2.4 | 3.2 |
| PAH | 73 | 0.88 | 52.5 | 2.8 | 5.1 | ||
| ALK | 74 | 0.82 | 30.7 | 2.4 | 1.8 | ||
| Bioavailable (mg/kg) | PHC | 74 | 0.80 | 48.5 | 2.3 | 2.5 | |
| PAH | 73 | 0.82 | 33.6 | 2.4 | 3.6 | ||
| ALK | 74 | 0.77 | 18.7 | 2.1 | 1.6 | ||
| Inorganics | Total (mg/kg) | Al | 73 | 0.93 | 2195 | 4.1 | 5.2 |
| Cr | 73 | 0.93 | 4 | 3.7 | 4.8 | ||
| Cd | 72 | 0.92 | 0.1 | 3.5 | 5.2 | ||
| Ni | 74 | 0.92 | 3 | 3.6 | 5.6 | ||
| Zn | 73 | 0.9 | 121 | 3.3 | 1.8 | ||
| Se | 72 | 0.88 | 0.4 | 3 | 4.2 | ||
| Cu | 73 | 0.9 | 8 | 3.3 | 3.5 | ||
| Fe | 74 | 0.92 | 2967 | 3.6 | 3.4 | ||
| As | 73 | 0.89 | 2 | 3.1 | 3.2 | ||
| Pb | 74 | 0.88 | 198 | 3 | 2.6 | ||
| Bioavailable (mg/kg) | Al | 72 | 0.92 | 97 | 3.8 | 5 | |
| Cr | 73 | 0.92 | 0.1 | 3.7 | 5.3 | ||
| Cd | 73 | 0.91 | 0.1 | 3.3 | 3.4 | ||
| Ni | 74 | 0.77 | 0.9 | 3.1 | 3.6 | ||
| Zn | 72 | 0.82 | 258 | 2.4 | 1.3 | ||
| Se | 72 | 0.86 | 0.1 | 2.7 | 3.2 | ||
| Cu | 72 | 0.89 | 1.5 | 3.7 | 6.5 | ||
| Fe | 73 | 0.89 | 78 | 3.1 | 1.9 | ||
| As | 72 | 0.86 | 0.07 | 2.8 | 3.1 | ||
| Pb | 74 | 0.86 | 199 | 2.8 | 2.8 |
Descriptive statistics of the prediction datasets of total and bioavailable PHC, PAH, ALK and HM/metalloids used for the RF modelling.
| Compound | N° | Min | 1st Q | Median | Mean | 3rd Q | Max | SD | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Organics | Total (mg/kg) | PHC | 31 | 92 | 127 | 285 | 308 | 411 | 890 | 210 |
| PAH | 31 | 0.6 | 2.9 | 190 | 172 | 285 | 522 | 160 | ||
| ALK | 31 | 58 | 102 | 120 | 149 | 150 | 477 | 88 | ||
| Bioavailable (mg/kg) | PHC | 31 | 42 | 58 | 130 | 131 | 177 | 374 | 80 | |
| PAH | 31 | 0.3 | 3.8 | 70 | 76 | 102 | 292 | 81 | ||
| ALK | 31 | 35 | 13 | 54 | 65 | 82 | 206 | 38 | ||
| Inorganics | Total (mg/kg) | Al | 31 | 1543 | 5222 | 8920 | 12677 | 18772 | 33055 | 9113 |
| Cr | 31 | 3 | 11 | 18 | 23 | 31 | 59 | 15 | ||
| Cd | 31 | 0.1 | 0.1 | 0.2 | 0.4 | 0.4 | 1 | 0.4 | ||
| Ni | 31 | 4 | 9 | 14 | 16 | 22 | 36 | 10 | ||
| Zn | 31 | 30 | 66 | 105 | 303 | 320 | 1827 | 446 | ||
| Se | 31 | 1 | 1 | 2 | 2 | 3 | 4 | 1 | ||
| Cu | 31 | 6 | 12 | 21 | 27 | 25 | 103 | 23 | ||
| Fe | 31 | 1109 | 5647 | 11825 | 15774 | 21352 | 40529 | 11672 | ||
| As | 31 | 3 | 8 | 10 | 13 | 17 | 25 | 6 | ||
| Pb | 31 | 11 | 40 | 106 | 314 | 291 | 2349 | 519 | ||
| Bioavailable (mg/kg) | Al | 31 | 1 | 2 | 263 | 344 | 603 | 906 | 329 | |
| Cr | 31 | 0.1 | 0.4 | 1 | 1 | 1 | 1 | 0.4 | ||
| Cd | 31 | 0.1 | 0.2 | 0.2 | 1 | 1 | 2 | 1 | ||
| Ni | 31 | 1 | 2 | 3 | 4 | 5 | 8 | 2 | ||
| Zn | 31 | 5 | 12 | 18 | 147 | 24 | 1176 | 343 | ||
| Se | 31 | 0.2 | 1 | 1 | 1 | 1 | 1 | 0.4 | ||
| Cu | 31 | 0.3 | 3 | 5 | 7 | 13 | 18 | 5 | ||
| Fe | 31 | 6 | 18 | 142 | 252 | 426 | 816 | 290 | ||
| As | 31 | 0.4 | 1 | 1 | 1 | 1 | 1 | 0 | ||
| Pb | 31 | 0.1 | 0.3 | 5 | 577 | 1511 | 2408 | 888 | ||
Figure 3Scatter plots of the prediction datasets of total (A) and bioavailable (B) total petroleum hydrocarbons (PHC), aromatic (PAH) and alkanes (ALK), respectively.
Figure 4Scatter plots of the prediction datasets of total (A) and bioavailable (B) contents of HM/metalloids
RF outputs for the prediction for total and bioavailable concentrations of PHC, PAH, ALK and HM in contaminated soils.
| Compound | N° | R2 | RMSE (mg/kg) | RPD | RPIQ | ||
|---|---|---|---|---|---|---|---|
| Organics | Total (mg/kg) | PHC | 31 | 0.69 | 117.8 | 1.8 | 2.4 |
| PAH | 31 | 0.75 | 79.8 | 2.0 | 3.5 | ||
| ALK | 31 | 0.57 | 56.6 | 1.6 | 1.6 | ||
| Bioavailable (mg/kg) | PHC | 31 | 0.62 | 63.9 | 1.7 | 1.9 | |
| PAH | 31 | 0.65 | 51.9 | 1.7 | 2.1 | ||
| ALK | 31 | 0.40 | 35.4 | 1.3 | 1.1 | ||
| Inorganics | Total (mg/kg) | Al | 31 | 0.79 | 4101 | 2.2 | 3.3 |
| Cr | 31 | 0.76 | 7 | 2.1 | 2.6 | ||
| Cd | 31 | 0.76 | 0.2 | 2.1 | 2.3 | ||
| Ni | 31 | 0.77 | 5 | 2.1 | 2.8 | ||
| Zn | 31 | 0.71 | 235 | 1.9 | 1.7 | ||
| Se | 31 | 0.67 | 0.6 | 1.8 | 2.9 | ||
| Cu | 31 | 0.6 | 15 | 1.6 | 1.9 | ||
| Fe | 31 | 0.72 | 5997 | 1.9 | 2.6 | ||
| As | 31 | 0.72 | 3 | 1.9 | 2.8 | ||
| Pb | 31 | 0.81 | 217 | 2.4 | 2.3 | ||
| Bioavailable (mg/kg) | Al | 31 | 0.77 | 154 | 2.1 | 3.9 | |
| Cr | 31 | 0.75 | 0.2 | 2.0 | 3.4 | ||
| Cd | 31 | 0.76 | 0.2 | 2.0 | 2.2 | ||
| Ni | 31 | 0.65 | 1.3 | 1.7 | 2.3 | ||
| Zn | 31 | 0.56 | 222 | 1.6 | 1.2 | ||
| Se | 31 | 0.5 | 0.2 | 1.4 | 1.6 | ||
| Cu | 31 | 0.6 | 3 | 1.6 | 3 | ||
| Fe | 31 | 0.58 | 183 | 1.6 | 2.2 | ||
| As | 31 | 0.45 | 0.2 | 1.4 | 1.7 | ||
| Pb | 31 | 0.75 | 343 | 2.1 | 2.1 | ||