| Literature DB >> 35990916 |
Abstract
This dataset was collected as part of the InWaterSense project with a wireless sensor network (WSN) installed in a site in river Sitnica in Kosovo, as a case study for monitoring remotely, continuously and in real-time the surface water quality in Kosovo and how it can be extended to all surface waters in the country for quality assurance. Values of four water quality parameters are provided in the dataset, i.e., temperature, electrical conductivity, pH, and dissolved oxygen measured by respective static sensors of WSN in the time frame between May 2015 to beginning of January 2016 and every 10 min, counting to slightly over 100k measurement records in total. The dataset is hosted at the Mendeley Data repository (Ahmedi and Ahmedi 2021), and is related to the research article entitled "InWaterSense: An Intelligent Wireless Sensor Network for Monitoring Surface Water Quality to a River in Kosovo" (Ahmedi et al., 2018). The reuse potential of the dataset to the scientific community is widespread, from environmental engineering to artificial intelligence to the health sector just to mention few. Moreover, practitioners might benefit from this dataset in driving forth the pollution prevention policies and techniques. Data were acquired measuring water quality using static sensors installed as part of a wireless sensor network in Sitnica river in the Plemetin village near Prishtina, then transmitted to the gateway device also in Plemetin via the ZigBee protocol, and finally transmitted remotely via GPRS to the server machine in the premises of the University of Prishtina. The data received from sensors are in real-time stored in the MS SQL server.Entities:
Keywords: Data engineering and data analysis; Surface water; Water quality monitoring; Wireless sensor networks
Year: 2022 PMID: 35990916 PMCID: PMC9382137 DOI: 10.1016/j.dib.2022.108486
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Coordinates of the sensing nodes 1 and 2 deployed in the Sitnica river bank.
| Node Id | Longitude | Latitude |
|---|---|---|
| 1 | 21.03843117 | 42.70670319 |
| 2 | 21.03802872 | 42.70727921 |
An excerpt of raw measurement data.
| Id | Node Id | Timestamp | Parameter | Value |
| 1744 | 1 | 201505311809110000 | pH | 7.92 |
| 1745 | 1 | 201505311809110000 | DissolvedOxygen | 45.4 |
| 1746 | 1 | 201505311809110000 | Conductivity | 336.9 |
| 1747 | 1 | 201505311809110000 | Temperature | 16.43 |
| 80518 | 2 | 201505311858100000 | pH | 8.11 |
| 80519 | 2 | 201505311858100000 | DissolvedOxygen | 13.1 |
| 80520 | 2 | 201505311858100000 | Conductivity | 313.6 |
| 80521 | 2 | 201505311858100000 | Temperature | 16.61 |
Basic statistics of raw measurement data.
| #measurements | #measurements per node | #measurements per parameter |
|---|---|---|
| 105453 | Node 1 (housing): 78378 | Conductivity: 26365 |
| Node 2 (manhole): 27075 | DissolvedOxygen: 26364 | |
| Temperature: 26364 | ||
| pH: 26360 |
An excerpt of measurement data grouped by Timestamp and Node Id, and sorted by Node Id and Timestamp.
| Id | Node Id | Timestamp | Timestamp as DateTime | Temperature | Conductivity | pH | DissolvedOxygen |
|---|---|---|---|---|---|---|---|
| 17667 | 1 | 201512080246360000 | 2015/12/08 02:46:36 0000 | 6.29 | 274.8 | 6.11 | 92.3 |
| 17668 | 1 | 201512080256480000 | 2015/12/08 02:56:48 0000 | 6.13 | 265.7 | 6.11 | 93 |
| 17669 | 1 | 201512080306580000 | 2015/12/08 03:06:58 0000 | 6.24 | 276.2 | 6.12 | 92.4 |
| 17670 | 1 | 201512080317080000 | 2015/12/08 03:17:08 0000 | 6.29 | 275.8 | 6.13 | 91.5 |
| 25799 | 2 | 201506170132450000 | 2015/06/17 01:32:45 0000 | 20.08 | 392.1 | 6.94 | 39.2 |
| 25800 | 2 | 201506170142550000 | 2015/06/17 01:42:55 0000 | 20.04 | 390.8 | 6.71 | 39.8 |
| 25801 | 2 | 201506170153060000 | 2015/06/17 01:53:06 0000 | 20.03 | 392.1 | 6.96 | 38.8 |
| 25802 | 2 | 201506170203170000 | 2015/06/17 02:03:17 0000 | 20.05 | 393.3 | 6.96 | 40 |
Basic statistics of measurement data.
| Id | Node Id | Timestamp | Temperature | Conductivity | pH | DissolvedOxygen | |
|---|---|---|---|---|---|---|---|
| 29842 | 29842 | 29842 | 26364 | 26365 | 26360 | 26364 | |
| - | - | - | 14.45 | 295.76 | 9.66 | 35.26 | |
| - | - | - | 8.26 | 1867.65 | 16.94 | 36.17 | |
| - | - | - | -0.35 | -145212.5 | -28 | 0 | |
| - | - | - | 9.3 | 275 | 5.48 | 9.5 | |
| - | - | - | 15.6 | 324.7 | 6.31 | 19.3 | |
| - | - | - | 18.53 | 388.9 | 7.63 | 49.6 | |
| - | - | - | 115.42 | 26378.7 | 94.58 | 154.6 |
Fig. 1Histograms of individual water quality parameters.
Missing data across columns.
| column name | #missing values |
|---|---|
| Node Id | 0 |
| Timestamp | 0 |
| Timestamp as DateTime | 0 |
| Temperature | 3478 |
| Conductivity | 3477 |
| pH | 3482 |
| DissolvedOxygen | 3478 |
Fig. 2Missing values’ matrix sorted by Timestamp.
Missing data summary statistics.
| total #rows | all 4 parameters are not null | any of 4 parameters is null | all 4 parameters are null |
|---|---|---|---|
| 29842 | 25200 | 4642 | 0 |
An excerpt of measurement data with null values.
| 19165 | 1 | 201512222016418000 | 2015/12/22 20:16:41 8000 | NULL | NULL | 6.25 | NULL |
| 19166 | 1 | 201512222016431000 | 2015/12/22 20:16:43 1000 | NULL | NULL | NULL | 4.4 |
| 19167 | 1 | 201512222016440000 | 2015/12/22 20:16:44 0000 | NULL | 278.3 | NULL | NULL |
| 19168 | 1 | 201512222016481000 | 2015/12/22 20:16:48 1000 | 5.3 | NULL | NULL | NULL |
An excerpt of measurement data with null values and the Timestamp rounded to minutes.
| 19165 | 1 | 201512222016 | 2015/12/22 20:16 | NULL | NULL | 6.25 | NULL |
| 19166 | 1 | 201512222016 | 2015/12/22 20:16 | NULL | NULL | NULL | 4.4 |
| 19167 | 1 | 201512222016 | 2015/12/22 20:16 | NULL | 278.3 | NULL | NULL |
| 19168 | 1 | 201512222016 | 2015/12/22 20:16 | 5.3 | NULL | NULL | NULL |
An excerpt of measurement data with no null values once the Timestamp rounded to minutes.
| 19165 | 1 | 201512222016 | 2015/12/22 20:16 | 5.3 | 278.3 | 6.25 | 4.4 |
Fig. 3A static sensing node in Sitnica river comprised of wireless sensors (middle in the underwater). Adapted from [2].
| Subject | Environmental Engineering |
| Specific subject area | Water Quality Monitoring |
| Type of data | Tables in csv and Excel files. Code in SQL script |
| How data were acquired | Data were acquired measuring water quality using static sensors installed as part of a wireless sensor network in Sitnica river in the Plemetin village near Prishtina, then transmitted to the gateway device also in Plemetin via the ZigBee protocol, and finally transmitted remotely via GPRS to the server machine in the premises of the University of Prishtina. The data received from sensors are in real-time stored in the MS SQL server. The WSN technology used is commercial by Libelium vendor and based on the Waspmote Plug and Sense model, an open source wireless sensor network platform: |
| Data format | Raw and filtered data from sensors |
| Description of data collection | The river water quality parameters measured remotely by the WSN static sensors are: temperature, electrical conductivity, pH, and dissolved oxygen. |
| Data source location | SItnica river bank, Plemetin village, Municipality of Obiliq, close to capital city Prishtina, Republic of Kosova. Measurements were conducted in two points in the river: sensing node 1 (housing) and sensing node 2 (manhole) in a distance of around 100 m from each other. The GPS coordinates (longitude, latitude) of each of the two sensing nodes are: Node 1 coordinates (21.03843117, 42.70670319), Node 2 coordinates (21.03802872, 42.70727921). |
| Data accessibility | Repository name: Mendeley Data. Data identification number: |
| Related research article | F. Ahmedi, L. Ahmedi, B. O'Flynn, A. Kurti, S. Tahirsylaj, E. Bytyçi, B. Sejdiu, A. Salihu, InWaterSense: An Intelligent Wireless Sensor Network for Monitoring Surface Water Quality to a River in Kosovo, Int. J. of Agric. and Environ. Inf. Syst. 9(1) (2018) 39-61. |