Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data.

Literature DB >> 31918388

Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data.

Kangyang Chen¹, Hexia Chen², Chuanlong Zhou², Yichao Huang², Xiangyang Qi¹, Ruqin Shen³, Fengrui Liu⁴, Min Zuo⁵, Xinyi Zou¹, Jinfeng Wang⁶, Yan Zhang⁶, Da Chen², Xingguo Chen⁷, Yongfeng Deng⁸, Hongqiang Ren⁶.

Abstract

The water quality prediction performance of machine learning models may be not only dependent on the models, but also dependent on the parameters in data set chosen for training the learning models. Moreover, the key water parameters should also be identified by the learning models, in order to further reduce prediction costs and improve prediction efficiency. Here we endeavored for the first time to compare the water quality prediction performance of 10 learning models (7 traditional and 3 ensemble models) using big data (33,612 observations) from the major rivers and lakes in China from 2012 to 2018, based on the precision, recall, F1-score, weighted F1-score, and explore the potential key water parameters for future model prediction. Our results showed that the bigger data could improve the performance of learning models in prediction of water quality. Compared to other 7 models, decision tree (DT), random forest (RF) and deep cascade forest (DCF) trained by data sets of pH, DO, CODMn, and NH3-N had significantly better performance in prediction of all 6 Levels of water quality recommended by Chinese government. Moreover, two key water parameter sets (DO, CODMn, and NH3-N; CODMn, and NH3-N) were identified and validated by DT, RF and DCF to be high specificities for perdition water quality. Therefore, DT, RF and DCF with selected key water parameters could be prioritized for future water quality monitoring and providing timely water quality warning.

Entities: Chemical

Keywords: Deep cascade forest; Ensemble methods; Machine learning models; The key water parameters; Water quality prediction

Year: 2019 PMID： 31918388 DOI： 10.1016/j.watres.2019.115454

Source DB: PubMed Journal: Water Res ISSN： 0043-1354 Impact factor: 11.236

Keyword Cloud
Cited

6 in total

1. AI-big data analytics for building automation and management systems: a survey, actual challenges and future perspectives.

Authors: Yassine Himeur; Mariam Elnour; Fodil Fadli; Nader Meskin; Ioan Petri; Yacine Rezgui; Faycal Bensaali; Abbes Amira
Journal: Artif Intell Rev Date: 2022-10-15 Impact factor: 9.588

2. Prediction of E. coli Concentrations in Agricultural Pond Waters: Application and Comparison of Machine Learning Algorithms.

Authors: Matthew D Stocker; Yakov A Pachepsky; Robert L Hill
Journal: Front Artif Intell Date: 2022-01-11

3. Regulation-based probabilistic substance quality index and automated geo-spatial modeling for water quality assessment.

Authors: Artyom Nikitin; Polina Tregubova; Dmitrii Shadrin; Sergey Matveev; Ivan Oseledets; Maria Pukalchik
Journal: Sci Rep Date: 2021-12-10 Impact factor: 4.379

4. Use of Artificial Neural Networks as a Predictive Tool of Dissolved Oxygen Present in Surface Water Discharged in the Coastal Lagoon of the Mar Menor (Murcia, Spain).

Authors: Eva M García Del Toro; Luis Francisco Mateo; Sara García-Salgado; M Isabel Más-López; Maria Ángeles Quijano
Journal: Int J Environ Res Public Health Date: 2022-04-09 Impact factor: 4.614

5. Water Quality Prediction Based on Multi-Task Learning.

Authors: Huan Wu; Shuiping Cheng; Kunlun Xin; Nian Ma; Jie Chen; Liang Tao; Min Gao
Journal: Int J Environ Res Public Health Date: 2022-08-06 Impact factor: 4.614

6. Machine learning-based estimation of riverine nutrient concentrations and associated uncertainties caused by sampling frequencies.

Authors: Shengyue Chen; Zhenyu Zhang; Juanjuan Lin; Jinliang Huang
Journal: PLoS One Date: 2022-07-13 Impact factor: 3.752

6 in total