| Literature DB >> 25768650 |
Yu Liu1, Du-Gang Xi2, Zhao-Liang Li3.
Abstract
Predicting the levels of chlorophyll-a (Chl-a) is a vital component of water quality management, which ensures that urban drinking water is safe from harmful algal blooms. This study developed a model to predict Chl-a levels in the Yuqiao Reservoir (Tianjin, China) biweekly using water quality and meteorological data from 1999-2012. First, six artificial neural networks (ANNs) and two non-ANN methods (principal component analysis and the support vector regression model) were compared to determine the appropriate training principle. Subsequently, three predictors with different input variables were developed to examine the feasibility of incorporating meteorological factors into Chl-a prediction, which usually only uses water quality data. Finally, a sensitivity analysis was performed to examine how the Chl-a predictor reacts to changes in input variables. The results were as follows: first, ANN is a powerful predictive alternative to the traditional modeling techniques used for Chl-a prediction. The back program (BP) model yields slightly better results than all other ANNs, with the normalized mean square error (NMSE), the correlation coefficient (Corr), and the Nash-Sutcliffe coefficient of efficiency (NSE) at 0.003 mg/l, 0.880 and 0.754, respectively, in the testing period. Second, the incorporation of meteorological data greatly improved Chl-a prediction compared to models solely using water quality factors or meteorological data; the correlation coefficient increased from 0.574-0.686 to 0.880 when meteorological data were included. Finally, the Chl-a predictor is more sensitive to air pressure and pH compared to other water quality and meteorological variables.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25768650 PMCID: PMC4359150 DOI: 10.1371/journal.pone.0119082
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A map of the Yuqiao Reservoir in Tianjin, China.
Correlation coefficients of Chlorophyll-a (Chl-a) with water quality and Chl-a with meteorological variables.
| Variables | Day0
| Day15
| Day30 | Day45 | Day60 | Day75 | Day90 | Day105 | Day120 | Day135 | Day150 | Day165 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Tw | 0.28 | 0.33 | 0.58 | 0.58 | 0.59 | 0.57 | 0.55 | 0.51 | 0.47 | 0.42 | 0.36 | 0.3 |
| pH | 0.18 | 0.2 | 0.23 | 0.24 | 0.26 | 0.28 | 0.29 | 0.3 | 0.29 | 0.27 | 0.24 | 0.19 |
| Cond | 0.00 | 0.02 | -0.05 | -0.1 | -0.15 | -0.18 | -0.2 | -0.21 | -0.22 | -0.22 | -0.23 | -0.25 |
| Tran | 0.46 | 0.49 | 0.49 | 0.48 | 0.48 | 0.45 | 0.43 | 0.4 | 0.32 | 0.26 | 0.2 | 0.17 |
| CL | 0.14 | 0.19 | 0.17 | 0.15 | 0.13 | 0.1 | 0.09 | 0.06 | 0.04 | 0.01 | -0.02 | -0.06 |
| Hard | -0.32 | -0.33 | -0.42 | -0.44 | -0.48 | -0.5 | -0.49 | -0.47 | -0.45 | -0.41 | -0.38 | -0.34 |
| NH4-N | 0.05 | 0.09 | 0.02 | -0.03 | -0.1 | -0.18 | -0.23 | -0.29 | -0.35 | -0.39 | -0.42 | -0.44 |
| NO3-N | -0.55 | -0.56 | -0.55 | -0.52 | -0.49 | -0.46 | -0.43 | -0.39 | -0.36 | -0.32 | -0.28 | -0.24 |
| NO2-N | -0.07 | -0.14 | -0.16 | -0.16 | -0.17 | -0.2 | -0.24 | -0.3 | -0.36 | -0.41 | -0.46 | -0.49 |
| TN | -0.45 | -0.42 | -0.45 | -0.44 | -0.43 | -0.42 | -0.4 | -0.38 | -0.36 | -0.32 | -0.29 | -0.26 |
| DO | -0.4 | -0.47 | -0.51 | -0.52 | -0.51 | -0.49 | -0.45 | -0.39 | -0.31 | -0.22 | -0.14 | -0.06 |
| PI | 0.59 | 0.57 | 0.57 | 0.59 | 0.57 | 0.53 | 0.48 | 0.42 | 0.35 | 0.28 | 0.21 | 0.13 |
| BOD | 0.11 | 0.03 | 0.01 | 0.02 | 0 | -0.03 | -0.06 | -0.05 | -0.06 | -0.05 | -0.04 | -0.03 |
| TP | 0.54 | 0.46 | 0.53 | 0.56 | 0.56 | 0.54 | 0.52 | 0.46 | 0.39 | 0.33 | 0.27 | 0.19 |
| PS | -0.04 | 0 | -0.08 | -0.11 | -0.11 | -0.13 | -0.14 | -0.15 | -0.15 | -0.16 | -0.17 | -0.17 |
| TS | -0.07 | -0.15 | -0.15 | -0.16 | -0.19 | -0.19 | -0.17 | -0.17 | -0.17 | -0.16 | -0.18 | -0.22 |
| SPS | 0.47 | 0.4 | 0.45 | 0.41 | 0.39 | 0.39 | 0.36 | 0.29 | 0.24 | 0.19 | 0.14 | 0.09 |
| SLS | -0.15 | -0.15 | -0.15 | -0.17 | -0.18 | -0.18 | -0.18 | -0.2 | -0.2 | -0.21 | -0.22 | -0.22 |
| SAL | -0.24 | -0.27 | -0.28 | -0.29 | -0.3 | -0.3 | -0.28 | -0.27 | -0.26 | -0.25 | -0.25 | -0.26 |
| Chl- | 1.00 | 0.71 | 0.69 | 0.69 | 0.66 | 0.6 | 0.54 | 0.48 | 0.41 | 0.35 | 0.29 | 0.23 |
| P | 0.31 | 0.36 | 0.41 | 0.46 | 0.51 | 0.55 | 0.53 | 0.5 | 0.48 | 0.46 | 0.45 | 0.43 |
| Pmax | 0.28 | 0.34 | 0.4 | 0.45 | 0.5 | 0.55 | 0.52 | 0.51 | 0.48 | 0.47 | 0.47 | 0.46 |
| Pmin | 0.29 | 0.35 | 0.41 | 0.46 | 0.51 | 0.55 | 0.53 | 0.5 | 0.48 | 0.48 | 0.47 | 0.47 |
| Ta | 0.45 | 0.46 | 0.47 | 0.47 | 0.48 | 0.49 | 0.51 | 0.52 | 0.51 | 0.51 | 0.49 | 0.48 |
| Tamax | 0.41 | 0.42 | 0.43 | 0.43 | 0.45 | 0.46 | 0.48 | 0.49 | 0.48 | 0.48 | 0.46 | 0.45 |
| Tamin | 0.42 | 0.43 | 0.44 | 0.44 | 0.45 | 0.46 | 0.48 | 0.49 | 0.48 | 0.48 | 0.46 | 0.45 |
| PCP | 0.42 | 0.44 | 0.46 | 0.48 | 0.48 | 0.47 | 0.45 | 0.44 | 0.42 | 0.41 | 0.39 | 0.38 |
| WS | 0.29 | 0.36 | 0.36 | 0.38 | 0.37 | 0.29 | 0.28 | 0.27 | 0.26 | 0.26 | 0.25 | 0.25 |
| WSmax | 0.28 | 0.35 | 0.36 | 0.38 | 0.37 | 0.29 | 0.28 | 0.26 | 0.25 | 0.25 | 0.24 | 0.24 |
| SD | 0.23 | 0.25 | 0.3 | 0.31 | 0.3 | 0.28 | 0.26 | 0.24 | 0.24 | 0.23 | 0.23 | 0.22 |
| R | 0.21 | 0.25 | 0.31 | 0.34 | 0.32 | 0.29 | 0.26 | 0.23 | 0.22 | 0.22 | 0.21 | 0.21 |
Note:
* Tw, water temperature; Cond, conductivity; Tran, transparency; CL, chloride; Hard, hardness; NH4-N, ammonia nitrogen; NO3-N, nitrate-nitrogen; NO2-N, nitrite-nitrogen; TN, total nitrogen; DO, dissolved oxygen; PI, permanganate index; BOD, biochemical oxygen demand; TP, total phosphorus; PS, phosphate; TS, total solids; SPS, suspended solids; SLS, soluble solids; SAL, salinity; P, daily mean air pressure; Pmax, maximum air pressure; Pmin, minimum air pressure; Ta, average air temperature; Tamax, maximum air temperature; Tamin, minimum air temperature; PCP, precipitation; WS, average wind speed; WSmax, maximum wind speed; SD, sunshine duration; R, total radiation.
** Day0, data of the predicted day;
*** Day15, Day30…Day165, the average data of the previous 15, 30…165 days.
Features and variables of the Chl-a prediction model.
| Type | Features | No. of variables | Variables |
|---|---|---|---|
| Water quality | Tw | 6 | Tw30, Tw45, Tw60, Tw75, Tw90, Tw105 |
| pH | 1 | pH105 | |
| DO | 3 | DO30, DO45, DO60 | |
| PI | 3 | PI30, PI45, PI60 | |
| TP | 5 | TP30, TP45, TP60, TP75, TP90 | |
| NO3-N | 5 | Nia0, Nia15, Nia30, Nia45, Nia60 | |
| Chl- | 4 | Chla15, Chla30, Chla45, Chla60 | |
| Meteorology | P | 4 | P60, P75, P90, P105 |
| Ta | 4 | Ta90, Ta105, Ta120, Ta135 | |
| WS | 5 | WS0, WS15, WS30, WS45, WS60 | |
| SD | 5 | SD0, SD15, SD30, SD45, SD60 | |
| R | 4 | R30, R45, R60, R75 |
Note:
* Features are the same as listed in Table 1.
** Average value of the variables over the indicated number of preceding days. For example, Tw30 represents the average water temperature of the preceding 30 days.
Fig 2Configuration of the Chlorophyll-a (Chl-a) predictor using artificial neural networks (ANN).
Left: input water quality and meteorological variables extracted by the correlation coefficient threshold method; middle: configuration of the predictor; right: predicted Chl-a. White circles on the left and right represent input and output neurons, respectively. Black circles represent neurons in the hidden layer. Lines around the circles indicate the data flow. A total of 49 variables of 12 features (27 variables of 7 water quality features and 22 variables of 5 meteorological features) were used.
Parameters of the eight Chl- predictors.
| Type of network | Output layer | Hidden layer | |||
|---|---|---|---|---|---|
| Learning step | Number of hidden layers | Number of Neurons | Learning step | ||
| ANN | BP | 0.1 | 2 | 40,30 | 0.1 |
| MNN | 0.1 | 1 | 45,30 | 0.1 | |
| Jordan-Elman | 0.1 | 1 | 40 | 0.1 | |
| PNN | 1 | 1 | 60 | 1 | |
| SOM | 0.1 | 1 | 60 | 1 | |
| CANFIS | 0.1 | 50 | 0.1 | ||
| Non-ANN | SVM | 0.01 | |||
| PCA) | 0.1 | ||||
Note:
* BP, Back Propagation; MNN, Modular Neural Network; Jordan-Elman, Jordan-Elman network; PNN, Probabilistic Neural Network; SOM, Self-Organizing Map network; CANFIS, Co-Active Neuro-Fuzzy Inference System; SVM, the Support Vector Machine; PCA, Principal Component Analysis.
** Number of neurons: (1): Hidden layer 1: 40; Hidden layer 2: 30; (2): Upper processing elements (PEs) = 45; Low PEs = 30.
*** Some structural parameters of the models are (1) Time: 0.4; Integrator axon; (2): Cluster: 40; Competitive: conscience; Metric: Euclidean; (3) Rows: 4; columns: 10; Starting: 4; Final radius: 0; Neighborhood shape: Square Kononen Full; (4) Gamma axon memory; Depth in: 10; Trajectory: 50; (5) Learning rule: Sanger full; Principal 4. Further information on the parameters is available at http://www.neurosolutions.com/downloads/documentation.html.
Results of the eight Chl-a predictors with the same inputs.
| Method | Time (S) | Training | Validation | Testing | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| NMSE (mg/l) | Corr | NSE | NMSE (mg/l) | Corr | NSE | NMSE (mg/l) | Corr | NSE | ||
| BP | 12.00 | 0.004 | 0.897 | 0.772 | 0.003 | 0.895 | 0.808 | 0.003 | 0.88 | 0.754 |
| MNN | 7.00 | 0.005 | 0.686 | 0.714 | 0.005 | 0.6 | 0.739 | 0.006 | 0.553 | 0.637 |
| Jordan/El man | 14.67 | 0.004 | 0.648 | 0.736 | 0.005 | 0.53 | 0.703 | 0.004 | 0.489 | 0.629 |
| PNN | 42.00 | 0.003 | 0.766 | 0.711 | 0.003 | 0.684 | 0.634 | 0.004 | 0.631 | 0.668 |
| SOM | 14.00 | 0.003 | 0.748 | 0.749 | 0.003 | 0.654 | 0.633 | 0.006 | 0.603 | 0.604 |
| CANFIS | 29.33 | 0.004 | 0.742 | 0.761 | 0.005 | 0.676 | 0.706 | 0.005 | 0.624 | 0.702 |
| PCA | 26.30 | 0.003 | 0.604 | 0.619 | 0.005 | 0.576 | 0.583 | 0.005 | 0.524 | 0.540 |
| SVM | 14.87 | 0.227 | -0.050 | 0.49 | 0.341 | -0.062 | 0.314 | 0.258 | -0.057 | 0.491 |
Note:
* Methods are the same as shown in Table 2.
Fig 3Scatter plots of the observed data vs. the model predictions using different inputs.
(A) Chl-a prediction with different variables used as inputs for training; (B) is the same as (A) but for validation; (C) is the same as (A) but for testing.
Results of three Chl-a predictors with different inputs
| Evaluation indices | Training | Validation | Testing | ||||||
|---|---|---|---|---|---|---|---|---|---|
| WQ* | MF** | WF*** | WQ | MF | WF | WQ | MF | WF | |
| RMSE (mg/l) | 0.005 | 0.008 | 0.005 | 0.003 | 0.003 | 0.003 | 0.004 | 0.005 | 0.003 |
| Corr | 0.710 | 0.844 | 0.897 | 0.735 | 0.884 | 0.895 | 0.574 | 0.686 | 0.880 |
| NSE | 0.395 | 0.679 | 0.772 | 0.361 | 0.674 | 0.808 | 0.225 | 0.662 | 0.754 |
Note:
*, **, and *** indicate models using only water quality factors, only meteorological factors, and both water quality and meteorological factors as inputs, respectively.
Fig 4Sensitivity of the predictor to water quality variables.
Bars indicate changes in Chl-a values caused by changes in the input variables, which were altered by 5%, 10% and 20%. Black, slash-filled, and cross line-filled bars indicate the change in Chl-a values caused by 5%, 10%, and 20% changes in input variables, respectively. Tw, water temperature; DO, dissolved oxygen; PI, permanganate index; TP, total phosphorus; NO3-N, nitrate-nitrogen.
Fig 5Sensitivity of the predictor to meteorological variables.
Bars indicate changes as described for Fig. 4. P, daily average air pressure; Ta, average air temperature; WS, wind speed; SD, sunshine duration; R, total radiation.