| Literature DB >> 29892503 |
Senlin Zhu1, Emmanuel Karlo Nyarko2, Marijana Hadzima-Nyarko3.
Abstract
The bio-chemical and physical characteristics of a river are directly affected by water temperature, which thereby affects the overall health of aquatic ecosystems. It is a complex problem to accurately estimate water temperature. Modelling of river water temperature is usually based on a suitable mathematical model and field measurements of various atmospheric factors. In this article, the air-water temperature relationship of the Missouri River is investigated by developing three different machine learning models (Artificial Neural Network (ANN), Gaussian Process Regression (GPR), and Bootstrap Aggregated Decision Trees (BA-DT)). Standard models (linear regression, non-linear regression, and stochastic models) are also developed and compared to machine learning models. Analyzing the three standard models, the stochastic model clearly outperforms the standard linear model and nonlinear model. All the three machine learning models have comparable results and outperform the stochastic model, with GPR having slightly better results for stations No. 2 and 3, while BA-DT has slightly better results for station No. 1. The machine learning models are very effective tools which can be used for the prediction of daily river temperature.Entities:
Keywords: Air temperature; Machine learning models; Missouri river; Standard regression models; Water temperature
Year: 2018 PMID: 29892503 PMCID: PMC5994338 DOI: 10.7717/peerj.4894
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Geographic locations of the three river stations and meteorological stations.
Detailed information about the three stations used in this study.
| Station No. | Station name | Long. | Lat. | Water temperature period | Meteorological station |
|---|---|---|---|---|---|
| 1 | Apple Creek Nr Menoken, ND | −100°39′25″ | 46°47′40″ | 2010∕1∕1–2013∕12∕31 | ND320819 |
| 2 | Missouri River at Yankton, SD | −97°23′38″ | 42°51′58″ | 2010∕10∕1–2013∕7∕16 | SD726525 |
| 3 | Missouri River at Kansas City, MO | −94°35′17″ | 39°06′43″ | 2007∕5∕9–2013∕12∕30 | MO724458 |
Figure 2Time series of air temperatures, water temperatures and flow discharges for the studied three stations: (A) time series of daily averaged air temperatures, corresponding water temperatures and flow discharges for Station No. 1; (B) time series of daily averaged air temperatures, and corresponding water temperatures for Station No. 2 (no available flow data for station No. 2); (C) time series of daily averaged air temperatures, corresponding water temperatures and flow discharges for Station No. 3.
Figure 3Optimal number of hidden neurons for: (A) station 1, (B) station 2, (C) station 3.
Regression expressions of standard models for the three stations in this study.
| Station No. | Standard models | Expressions |
|---|---|---|
| 1 | Linear model | |
| Non-linear model | ||
| Stochastic model | ||
| 2 | Linear model | |
| Non-linear model | ||
| Stochastic model | ||
| 3 | Linear model | |
| Non-linear model | ||
| Stochastic model |
Performance coefficients of the developed models by the testing datasets.
| Station No. | Models | ||||
|---|---|---|---|---|---|
| 1 | Linear model | 0.62 | 3.41 | 0.37 | – |
| Non-linear model | 0.64 | 3.34 | 0.29 | – | |
| Stochastic model | 0.94 | 2.14 | 0.72 | – | |
| ANN | 0.9665 | 1.9471 | 0.7601 | 0.6432 | |
| GPR | 0.9664 | 1.9784 | 0.7523 | 0.6317 | |
| BA-DT | 0.9508 | 1.8777 | 0.7769 | 0.6682 | |
| 2 | Linear model | 0.91 | 3.53 | 0.82 | – |
| Non-linear model | 0.94 | 2.99 | 0.87 | – | |
| Stochastic model | 0.98 | 1.86 | 0.95 | – | |
| ANN | 0.9833 | 1.5561 | 0.9657 | 0.7852 | |
| GPR | 0.9840 | 1.5524 | 0.9658 | 0.7862 | |
| BA-DT | 0.9823 | 1.5826 | 0.9645 | 0.7778 | |
| 3 | Linear model | 0.92 | 3.94 | 0.84 | – |
| Non-linear model | 0.93 | 3.62 | 0.86 | – | |
| Stochastic model | 0.98 | 1.72 | 0.97 | – | |
| ANN | 0.9993 | 1.5649 | 0.9741 | 0.8594 | |
| GPR | 0.9897 | 1.4950 | 0.9764 | 0.8717 | |
| BA-DT | 0.9849 | 1.8072 | 0.9655 | 0.8125 |
Figure 4Performance of BA-DT model for Station No. 1: (A) training dataset and (B) testing dataset.
Figure 5Performance of GPR model for Station No. 2: (A) training dataset and (B) testing dataset.
Figure 6Performance of GPR model for Station No. 3: (A) training dataset and (B) testing dataset.