| Literature DB >> 31374862 |
Jun Hyeong Kim1, Mi Lim Lee2, Chuljin Park3.
Abstract
This study addresses the problem of identifying the source location of a contaminant spill in a river system when a sensor network returns observations containing random measurement errors. To solve this problem, we suggest a new framework comprising three main steps: (i) spill detection, (ii) data preprocessing, and (iii) source identification. Specifically, we applied a statistical process control chart to detect a contaminant spill with measurement errors while keeping the false alarm rate at less than or equal to a user-specified value. After detecting a spill, we generated a nonlinear regression model to estimate a breakthrough curve of the observations and derive a characteristic vector of the estimated curve. Using the characteristic vector as an input, a random forest model was constructed with the sensor raising the first alarm. The model provides output values between 0 and 1 to represent the possibility of each candidate location being the true spill source. These possibility values allow users to identify strong candidate locations for the spill. The accuracy of our framework was tested on part of the Altamaha River system in Georgia, USA.Entities:
Keywords: random forest; river system; sensor network; source identification; statistical process control; water quality monitoring
Year: 2019 PMID: 31374862 PMCID: PMC6696032 DOI: 10.3390/s19153378
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Overall description of the proposed source identification framework.
Figure 2An example plot for .
Figure 3Examples of when sensors are installed at locations 9 and 19.
Figure 4An example of part of a tree classifier.
Figure 5The overall structure of a random forest model for source identification.
Figure 6Targeted study area with six sensors.
Random forest models with their OOB errors.
| Sensor Location | Random Forest Model | Set of Candidate Spill Locations | % of OOB Error (L, L) | % of OOB Error (H, H) |
|---|---|---|---|---|
| 9 |
|
| 29.34 | 33.84 |
| 19 |
|
| 21.08 | 26.68 |
| 26 |
|
| 4.49 | 7.43 |
| 33 |
|
| 4.46 | 9.83 |
| 46 |
|
| 30.82 | 35.17 |
| 53 |
|
| 4.4 | 10.11 |
Figure 7Identification accuracy regarding the top three locations under (L, L) configuration.
Figure 8Identification accuracy regarding the top three locations under (H, H) configuration.
Figure 9Examples of values for some related pairs of spill locations.