| Literature DB >> 36142084 |
Guoliang Guan1, Yonggui Wang1, Ling Yang1, Jinzhao Yue1, Qiang Li1, Jianyun Lin2, Qiang Liu3.
Abstract
The openly released and measured data from automatic hydrological and water quality stations in China provide strong data support for water environmental protection management and scientific research. However, current public data on hydrology and water quality only provide real-time data through data tables in a shared page. To excavate the supporting effect of these data on water environmental protection, this paper designs a water-quality-prediction and pollution-risk early-warning system. In this system, crawler technology was used for data collection from public real-time data. Additionally, a modified long short-term memory (LSTM) was adopted to predict the water quality and provide an early warning for pollution risks. According to geographic information technology, this system can show the process of spatial and temporal variations of hydrology and water quality in China. At the same time, the current and future water quality of important monitoring sites can be quickly evaluated and predicted, together with the pollution-risk early warning. The data collected and the water-quality-prediction technique in the system can be shared and used for supporting hydrology and in water quality research and management.Entities:
Keywords: LSTM; machine learning; pollution risk; water quality evaluation; water-quality early-warning system; web crawler
Mesh:
Year: 2022 PMID: 36142084 PMCID: PMC9517095 DOI: 10.3390/ijerph191811818
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1The philosophy of water-quality-prediction and early-warning system.
The functions of tables used in the database.
| ID | Tables in the Database | Functions |
|---|---|---|
| 1 | Hydrology and water quality station information table | Store the names and geographical coordinates of hydrology and water quality stations in China |
| 2 | Hydrology station table | Store the names of the hydrology sites crawled |
| 3 | Water quality site table | Store the names of the water quality sites crawled |
| 4 | Hydrological data table | Store the hydrological data crawled |
| 5 | Water quality data table | Store the water quality data crawled |
Figure 2Process of data collecting by web crawler used in the system.
Figure 3Model construction process of water quality prediction based on LSTM.
Figure 4Structure of the LSTM model.
Figure 5The submodules and functions designed in the client.
Figure 6Distribution of hydrological monitoring stations (a) and water-quality-monitoring stations (b).
Features of the obtained data in the system.
| Category | Variable | Indexes Name | Time Range | Units |
|---|---|---|---|---|
| Hydrological monitoring data | WL | Water level | Since 2005 | m |
| WWL | Warning water level | m | ||
| Q | Flow | m³/s | ||
| Water-quality-monitoring data | Tub | Turbidity of water | Since 2000 | NTU |
| CODMN | Permanganate index | mg/L | ||
|
| Ammonia nitrogen index | mg/L | ||
| TP | Total phosphorus index | mg/L | ||
| TN | Total nitrogen index | mg/L | ||
| PH | Pondus Hydrogenii | - | ||
| CHL | Chlorophyll α content | mg/L | ||
| CA | Algal density of water | cells/L | ||
| BOD5 | Biochemical oxygen demand | mg/L | ||
| DO | Dissolved oxygen | mg/L | ||
| WT | Water temperature | °C | ||
| WQC | Water quality classification | - |
Figure 7Part of the hydrological monitoring data at Taihu station (a) and water quality of North Moat station (b).
Figure 8Distribution of cross-correlation (CC) values between predicted results and measured values for each water quality indicator predicted results in the the validation set (a) and test set (b). The lower boundary of the box represents the minimum value. The upper boundary of the box represents the maximum value. The red dotted lines represent the threshold of acceptable model performance (CC = 0.65).
Figure 9Functions of the water environment data sharing and analysis system ((a): overview of water-environment-monitoring data, (b): water quality analysis). Level VI of water quality means the water quality is worse than level V.
Figure 10The water quality of pollution waring of prediction results.