| Literature DB >> 31194054 |
Nick Lucius1, Kevin Rose2, Callin Osborn3, Matt E Sweeney2, Renel Chesak4, Scott Beslow2, Tom Schenk1.
Abstract
Culture-based methods to measure Escherichia coli (E. coli) are used by beach administrators to inform whether bacteria levels represent an elevated risk to swimmers. Since results take up to 12 h, statistical models are used to forecast bacteria levels in lieu of test results; however they underestimate days with elevated fecal indicator bacteria levels. Quantitative polymerase chain reaction (qPCR) tests return results within 3 h but are 2-5 times more expensive than culture-based methods. This paper presents a prediction model which uses limited deployments of qPCR tested sites with inter-beach correlation to predict when bacteria will exceed acceptable thresholds. The model can be used to inform management decisions on when to warn residents or close beaches due to exposure to the bacteria. Using data from Chicago collected between 2006 and 2016, the model proposed in this paper increased sensitivity from 3.4 percent to 11.2 percent-a 230 percent increase. We find that the correlation between beaches are substantial enough to provide higher levels of precision and sensitivity to predictive models. Thus, limited deployments of qPCR testing can be used to deliver better predictions for beach administrators at lower cost and less complexity.Entities:
Keywords: Chicago; Escherichia coli; Fecal indicator bacteria; Random forest; Recreational water quality
Year: 2018 PMID: 31194054 PMCID: PMC6549907 DOI: 10.1016/j.wroa.2018.100016
Source DB: PubMed Journal: Water Res X ISSN: 2589-9147
Fig. 1Map of Lake Michigan beaches in Chicago with results of k-means clustering.
Fig. 2Pearson correlation coefficient heat map of daily E. coli levels at Chicago beaches between 2006 and 2017. Each level of tree shows the nearest-neighbor correlation.
Final results of K-means clustering. The ˆ denotes feature beaches used to predict remaining beaches in the cluster.
| Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 | Cluster 5 | |
|---|---|---|---|---|---|
| Fosterˆ | North Avenueˆ | Leoneˆ | 31stˆ | South Shoreˆ | |
| Osterman | Oak Street | Juneway | 12th | 57th | |
| Albion | Howard | 39th | |||
| Rogers | |||||
| Jarvis |
Variables used for multivariate Hybrid model.
| Variables |
|---|
| Precipitation |
| Forecast for today |
| Yesterday total rainfall |
| 3 day total rainfall |
| Total rainfall total until 8am |
| Sunlight |
| Daily cloud cover forecast |
| Prior-day cloudiness |
| 3-day total cloudiness |
| Length of daylight time in a day |
| Wind |
| Wind Direction |
| Wind Speed |
| 1-day average wind speed |
| 3-day average wind speed |
| Wind speed at 8am |
| Tidal Levels |
| Lunar phase |
| Lake Level |
| 1-day lake level |
| 3-day average lake level |
| Visitor Density |
| Indicator for weekday |
| Julian date |
Fig. 3Variable importance of each factor when added to random forest model as measured by mean squared error.
Fig. 4Plot of the log of raw fitted values versus residuals from the random forest model.
Fig. 5Model performance measured by mean squared error (MSR) as the number of trees grow within the random forest model.
Fig. 6Receiver Operating Characteristic (ROC) for Hybrid Model and Prior-day Model. Dashed line shows historical false positive rate for prior-day model in Chicago.
Comparing Specificity, Sensitivity, Accuracy, Area Under Curve (AUC), and Matthews Correlation Coefficient (MCC) between Hybrid, Multivariate, and Prior-day Nowcast models for the 15 pilot beaches.
| Model | Specificity | Sensitivity | Precision | Accuracy | AUC | MCC |
|---|---|---|---|---|---|---|
| 2017 Hybrid | 0.980 | 0.112 | 0.273 | 0.926 | 0.753 | 0.142 |
| 2017 Multivariate | 0.971 | 0.060 | 0.125 | 0.912 | 0.738 | 0.044 |
| 2016 Prior-day | 0.988 | 0.037 | 0.167 | 0.930 | 0.644 | 0.052 |
| 2015 Prior-day | 0.984 | 0.032 | 0.176 | 0.892 | 0.655 | 0.036 |
Fig. 7Comparison of the 2017 hybrid pilot to existing prior-day model for “true positive rates” (sensitivity) and “false positive rates” (type I error).