Literature DB >> 32078849

Hybrid decision tree-based machine learning models for short-term water quality prediction.

Hongfang Lu1, Xin Ma2.   

Abstract

Water resources are the foundation of people's life and economic development, and are closely related to health and the environment. Accurate prediction of water quality is the key to improving water management and pollution control. In this paper, two novel hybrid decision tree-based machine learning models are proposed to obtain more accurate short-term water quality prediction results. The basic models of the two hybrid models are extreme gradient boosting (XGBoost) and random forest (RF), which respectively introduce an advanced data denoising technique - complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN). Taking the water resources of Gales Creek site in Tualatin River (one of the most polluted rivers in the world) Basin as an example, a total of 1875 data (hourly data) from May 1, 2019 to July 20, 2019 are collected. Two hybrid models are used to predict six water quality indicators, including water temperature, dissolved oxygen, pH value, specific conductance, turbidity, and fluorescent dissolved organic matter. Six error metrics are introduced as the basis of performance evaluation, and the results of the two models are compared with the other four conventional models. The results reveal that: (1) CEEMDAN-RF performs best in the prediction of temperature, dissolved oxygen and specific conductance, the mean absolute percentage errors (MAPEs) are 0.69%, 1.05%, and 0.90%, respectively. CEEMDAN-XGBoost performs best in the prediction of pH value, turbidity, and fluorescent dissolved organic matter, the MAPEs are 0.27%, 14.94%, and 1.59%, respectively. (2) The average MAPEs of CEEMDAN-RF and CEEMMDAN-XGBoost models are the smallest, which are 3.90% and 3.71% respectively, indicating that their overall prediction performance is the best. In addition, the stability of the prediction model is also discussed in this paper. The analysis shows that the prediction stability of CEEMDAN-RF and CEEMDAN-XGBoost is higher than other benchmark models.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Data denoising; Decision tree-based model; Extreme gradient boosting; Random forest; Short-term; Water quality prediction

Year:  2020        PMID: 32078849     DOI: 10.1016/j.chemosphere.2020.126169

Source DB:  PubMed          Journal:  Chemosphere        ISSN: 0045-6535            Impact factor:   7.086


  9 in total

1.  Design of Online Ideological and Political Teaching of Building Architecture from the Perspective of Machine Learning.

Authors:  Xuhui Li
Journal:  Comput Intell Neurosci       Date:  2022-05-18

2.  Predictive water virology using regularized regression analyses for projecting virus inactivation efficiency in ozone disinfection.

Authors:  Syun-Suke Kadoya; Osamu Nishimura; Hiroyuki Kato; Daisuke Sano
Journal:  Water Res X       Date:  2021-02-12

3.  Predicting suspended sediment load in Peninsular Malaysia using support vector machine and deep learning algorithms.

Authors:  Yusuf Essam; Yuk Feng Huang; Ahmed H Birima; Ali Najah Ahmed; Ahmed El-Shafie
Journal:  Sci Rep       Date:  2022-01-07       Impact factor: 4.379

4.  Regulation-based probabilistic substance quality index and automated geo-spatial modeling for water quality assessment.

Authors:  Artyom Nikitin; Polina Tregubova; Dmitrii Shadrin; Sergey Matveev; Ivan Oseledets; Maria Pukalchik
Journal:  Sci Rep       Date:  2021-12-10       Impact factor: 4.379

5.  Water quality prediction in sea cucumber farming based on a GRU neural network optimized by an improved whale optimization algorithm.

Authors:  Huanhai Yang; Shue Liu
Journal:  PeerJ Comput Sci       Date:  2022-05-31

6.  Machine learning-based estimation of riverine nutrient concentrations and associated uncertainties caused by sampling frequencies.

Authors:  Shengyue Chen; Zhenyu Zhang; Juanjuan Lin; Jinliang Huang
Journal:  PLoS One       Date:  2022-07-13       Impact factor: 3.752

7.  Groundwater Quality: The Application of Artificial Intelligence.

Authors:  Mosleh Hmoud Al-Adhaileh; Theyazn H H Aldhyani; Fawaz Waselallah Alsaade; Mohammed Al-Yaari; Ali Khalaf Ahmed Albaggar
Journal:  J Environ Public Health       Date:  2022-08-24

8.  Water Quality Indicator Interval Prediction in Wastewater Treatment Process Based on the Improved BES-LSSVM Algorithm.

Authors:  Meng Zhou; Yinyue Zhang; Jing Wang; Yuntao Shi; Vicenç Puig
Journal:  Sensors (Basel)       Date:  2022-01-06       Impact factor: 3.576

9.  An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction.

Authors:  Iman Ahmadianfar; Seyedehelham Shirvani-Hosseini; Jianxun He; Arvin Samadi-Koucheksaraee; Zaher Mundher Yaseen
Journal:  Sci Rep       Date:  2022-03-23       Impact factor: 4.996

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.