Literature DB >> 19211132

Determination of the optimal parameters in regression models for the prediction of chlorophyll-a: a case study of the Yeongsan Reservoir, Korea.

Kyung Hwa Cho1, Joo-Hyon Kang, Seo Jin Ki, Yongeun Park, Sung Min Cha, Joon Ha Kim.   

Abstract

Statistical regression models involve linear equations, which often lead to significant prediction errors due to poor statistical stability and accuracy. This concern arises from multicollinearity in the models, which may drastically affect model performance in terms of a trade-off scenario for effective water resource management logistics. In this paper, we propose a new methodology for improving the statistical stability and accuracy of regression models, and then show how to cope with pitfalls in the models and determine optimal parameters with a decreased number of predictive variables. Here, a comparison of the predictive performance was made using four types of multiple linear regression (MLR) and principal component regression (PCR) models in the prediction of chlorophyll-a (chl-a) concentration in the Yeongsan (YS) Reservoir, Korea, an estuarine reservoir that historically suffers from high levels of nutrient input. During a 3-year water quality monitoring period, results showed that PCRs could be a compact solution for improving the accuracy of the models, as in each case MLR could not accurately produce reliable predictions due to a persistent collinearity problem. Furthermore, based on R(2) (goodness of fit) and F-overall number (confidence of regression), and the number of explanatory variables (R-F-N) curve, it was revealed that PCR-F(7) was the best model among the four regression models in predicting chl-a, having the fewest explanatory variables (seven) and the lowest uncertainty. Seven PCs were identified as significant variables, related to eight water quality parameters: pH, 5-day biochemical oxygen demand, total coliform, fecal indicator bacteria, chemical oxygen demand, ammonia-nitrogen, total nitrogen, and dissolved oxygen. Overall, the results not only demonstrated that the models employed successfully simulated chl-a in a reservoir in both the test and validation periods, but also suggested that the optimal parameters should cautiously be considered in the design of regression models.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19211132     DOI: 10.1016/j.scitotenv.2009.01.017

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


  5 in total

1.  Modelling geosmin concentrations in three sources of raw water in Quebec, Canada.

Authors:  Julien Parinet; Manuel J Rodriguez; Jean-Baptiste Sérodes
Journal:  Environ Monit Assess       Date:  2012-02-09       Impact factor: 2.513

2.  Identification of key factors influencing primary productivity in two river-type reservoirs by using principal component regression analysis.

Authors:  Yeonjung Lee; Sun-Yong Ha; Hae-Kyung Park; Myung-Soo Han; Kyung-Hoon Shin
Journal:  Environ Monit Assess       Date:  2015-03-27       Impact factor: 2.513

3.  Comparing artificial intelligence techniques for chlorophyll-a prediction in US lakes.

Authors:  Wenguang Luo; Senlin Zhu; Shiqiang Wu; Jiangyu Dai
Journal:  Environ Sci Pollut Res Int       Date:  2019-09-03       Impact factor: 4.223

4.  Advancing analysis of spatio-temporal variations of soil nutrients in the water level fluctuation zone of China's Three Gorges Reservoir using self-organizing map.

Authors:  Chen Ye; Siyue Li; Yuyi Yang; Xiao Shu; Jiaquan Zhang; Quanfa Zhang
Journal:  PLoS One       Date:  2015-03-19       Impact factor: 3.240

5.  Determination of the optimal training principle and input variables in artificial neural network model for the biweekly chlorophyll-a prediction: a case study of the Yuqiao Reservoir, China.

Authors:  Yu Liu; Du-Gang Xi; Zhao-Liang Li
Journal:  PLoS One       Date:  2015-03-13       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.