Literature DB >> 33552026

Tracking Major Sources of Water Contamination Using Machine Learning.

Jianyong Wu1, Conghe Song2, Eric A Dubinsky3, Jill R Stewart1.   

Abstract

Current microbial source tracking techniques that rely on grab samples analyzed by individual endpoint assays are inadequate to explain microbial sources across space and time. Modeling and predicting host sources of microbial contamination could add a useful tool for watershed management. In this study, we tested and evaluated machine learning models to predict the major sources of microbial contamination in a watershed. We examined the relationship between microbial sources, land cover, weather, and hydrologic variables in a watershed in Northern California, United States. Six models, including K-nearest neighbors (KNN), Naïve Bayes, Support vector machine (SVM), simple neural network (NN), Random Forest, and XGBoost, were built to predict major microbial sources using land cover, weather and hydrologic variables. The results showed that these models successfully predicted microbial sources classified into two categories (human and non-human), with the average accuracy ranging from 69% (Naïve Bayes) to 88% (XGBoost). The area under curve (AUC) of the receiver operating characteristic (ROC) illustrated XGBoost had the best performance (average AUC = 0.88), followed by Random Forest (average AUC = 0.84), and KNN (average AUC = 0.74). The importance index obtained from Random Forest indicated that precipitation and temperature were the two most important factors to predict the dominant microbial source. These results suggest that machine learning models, particularly XGBoost, can predict the dominant sources of microbial contamination based on the relationship of microbial contaminants with daily weather and land cover, providing a powerful tool to understand microbial sources in water.
Copyright © 2021 Wu, Song, Dubinsky and Stewart.

Entities:  

Keywords:  XGBoost; fecal contamination; land use; machine learning; microbial source tracking; rainfall

Year:  2021        PMID: 33552026      PMCID: PMC7854693          DOI: 10.3389/fmicb.2020.616692

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


  24 in total

1.  The influence of rainfall on the incidence of microbial faecal indicators and the dominant sources of faecal pollution in a Florida river.

Authors:  S D Shehane; V J Harwood; J E Whitlock; J B Rose
Journal:  J Appl Microbiol       Date:  2005       Impact factor: 3.772

2.  Microbial water quality in freshwater lakes with different land use.

Authors:  Z R Staley; E Chase; C Mitraki; T L Crisman; V J Harwood
Journal:  J Appl Microbiol       Date:  2013-08-20       Impact factor: 3.772

3.  Linking landscape patterns to sources of water contamination: Implications for tracking fecal contaminants with geospatial and Bayesian approaches.

Authors:  Jianyong Wu
Journal:  Sci Total Environ       Date:  2018-09-08       Impact factor: 7.963

4.  Storm loads of culturable and molecular fecal indicators in an inland urban stream.

Authors:  Hehuan Liao; Leigh-Anne H Krometis; W Cully Hession; Romina Benitez; Richard Sawyer; Erin Schaberg; Emily von Wagoner; Brian D Badgley
Journal:  Sci Total Environ       Date:  2015-06-05       Impact factor: 7.963

5.  Performance of forty-one microbial source tracking methods: a twenty-seven lab evaluation study.

Authors:  Alexandria B Boehm; Laurie C Van De Werfhorst; John F Griffith; Patricia A Holden; Jenny A Jay; Orin C Shanks; Dan Wang; Stephen B Weisberg
Journal:  Water Res       Date:  2013-07-05       Impact factor: 11.236

6.  Evaluation of molecular community analysis methods for discerning fecal sources and human waste.

Authors:  Yiping Cao; Laurie C Van De Werfhorst; Eric A Dubinsky; Brian D Badgley; Michael J Sadowsky; Gary L Andersen; John F Griffith; Patricia A Holden
Journal:  Water Res       Date:  2013-07-05       Impact factor: 11.236

7.  Geographic setting influences Great Lakes beach microbiological water quality.

Authors:  Sheridan K Haack; Lisa R Fogarty; Erin A Stelzer; Lori M Fuller; Angela K Brennan; Natasha M Isaacs; Heather E Johnson
Journal:  Environ Sci Technol       Date:  2013-09-27       Impact factor: 9.028

8.  Variability of E. coli density and sources in an urban watershed.

Authors:  J Wu; P Rees; S Dorner
Journal:  J Water Health       Date:  2011-03       Impact factor: 1.744

9.  Impact of mixed land-use practices on the microbial water quality in a subtropical coastal watershed.

Authors:  Zhanbei Liang; Zhenli He; Xuxia Zhou; Charles A Powell; Yuangen Yang; Li Ming He; Peter J Stoffella
Journal:  Sci Total Environ       Date:  2013-02-26       Impact factor: 7.963

10.  Linking fecal bacteria in rivers to landscape, geochemical, and hydrologic factors and sources at the basin scale.

Authors:  Marc P Verhougstraete; Sherry L Martin; Anthony D Kendall; David W Hyndman; Joan B Rose
Journal:  Proc Natl Acad Sci U S A       Date:  2015-08-03       Impact factor: 11.205

View more
  1 in total

1.  Ontology-aware deep learning enables ultrafast and interpretable source tracking among sub-million microbial community samples from hundreds of niches.

Authors:  Yuguo Zha; Hui Chong; Hao Qiu; Kai Kang; Yuzheng Dun; Zhixue Chen; Xuefeng Cui; Kang Ning
Journal:  Genome Med       Date:  2022-04-26       Impact factor: 15.266

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.