| Literature DB >> 34069927 |
Hrvoje Kalinić1, Zvonimir Bilokapić1, Frano Matić2.
Abstract
The experiments conducted on the wind data provided by the European Centre for Medium-range Weather Forecasts show that 1% of the data is sufficient to reconstruct the other 99% with an average amplitude error of less than 0.5 m/s and an average angular error of less than 5 degrees. In a nutshell, our method provides an approach where a portion of the data is used as a proxy to estimate the measurements over the entire domain based only on a few measurements. In our study, we compare several machine learning techniques, namely: linear regression, K-nearest neighbours, decision trees and a neural network, and investigate the impact of sensor placement on the quality of the reconstruction. While methods provide comparable results the results show that sensor placement plays an important role. Thus, we propose that intelligent location selection for sensor placement can be done using k-means, and show that this indeed leads to increase in accuracy as compared to random sensor placement.Entities:
Keywords: data reconstruction; interpolation; machine learning; missing data; neural networks; reanalisys; spatio/temporal resolution
Year: 2021 PMID: 34069927 PMCID: PMC8157581 DOI: 10.3390/s21103507
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Adriatic sea—the geographic area covered in the study with marked selected zones: (A) North Adriatic, (B) Middle Adriatic and (C) Ionian Sea. Color bar shows wet point index defined by the ERAInterim land-sea mask.
Figure 2Adriatic sea—Panels show: (A) Amplitude average. (B) Angle average. (C) Amplitude variance. (D) Angle variance.
Figure 3Sampling scenarios used in experiments (a–g): (a) 50% samples selected in successive order. (b) 50% samples selected randomly. (c) 10% samples selected randomly. (d) 5% samples selected randomly. (e) 1% samples selected randomly. (f) 10 samples selected by the k-means algorithm. (g) 5 samples selected by the k-means algorithm.
Sampling scenarios used in experiments (a) to (g).
| Experiment | Sampling Model | # of Sensors |
|---|---|---|
| (a) | pre order | 50% (1105 sensors) |
| (b) | random | 50% (1105 sensors) |
| (c) | random | 10% (110 sensors) |
| (d) | random | 5% (55 sensors) |
| (e) | random | 1% (10 sensors) |
| (f) | k-means | - 10 sensors |
| (g) | k-means | - 5 sensors |
Table containing mean reconstruction error () and its variance () for each of the machine learning model used in each experiment. The error is divided in amplitude and variance error.
| Experiment | Linear | KNN | Extra | Neural | |||||
|---|---|---|---|---|---|---|---|---|---|
| Regression | Trees | Network | |||||||
|
|
|
|
|
|
|
|
| ||
| (a) | Amplitude | 0.52 | 1.00 | 0.91 | 1.42 | 0.76 | 1.23 | 0.73 | 1.17 |
| Angle | 6.28 | 17.03 | 11.43 | 24.40 | 9.49 | 21.34 | 8.79 | 19.88 | |
| (b) | Amplitude | 0.00 | 0.00 | 0.40 | 0.58 | 0.48 | 0.71 | 0.17 | 0.25 |
| Angle | 0.00 | 0.00 | 5.16 | 12.33 | 6.24 | 14.63 | 2.00 | 5.12 | |
| (c) | Amplitude | 0.00 | 0.03 | 0.74 | 0.64 | 0.87 | 0.77 | 0.20 | 0.16 |
| Angle | 0.09 | 0.89 | 9.50 | 15.80 | 11.34 | 18.29 | 2.63 | 5.08 | |
| (d) | Amplitude | 0.03 | 0.14 | 0.79 | 0.66 | 0.92 | 0.76 | 0.27 | 0.24 |
| Angle | 0.40 | 2.72 | 10.20 | 16.47 | 11.89 | 18.55 | 3.17 | 6.32 | |
| (e) | Amplitude | 0.52 | 0.68 | 0.94 | 0.81 | 1.02 | 0.84 | 0.55 | 0.59 |
| Angle | 6.61 | 13.86 | 12.08 | 18.93 | 13.47 | 20.25 | 6.88 | 13.07 | |
| (f) | Amplitude | 0.30 | 0.30 | 0.80 | 0.59 | 0.94 | 0.73 | 0.32 | 0.28 |
| Angle | 4.20 | 9.30 | 10.44 | 16.29 | 12.37 | 18.76 | 4.31 | 8.95 | |
| (g) | Amplitude | 0.72 | 0.60 | 0.88 | 0.66 | 0.97 | 0.73 | 0.61 | 0.52 |
| Angle | 10.42 | 17.44 | 11.89 | 18.61 | 13.07 | 19.74 | 8.75 | 15.65 | |
Figure 4The average amplitude and angular error (subfigures a,c) and their variances (subfigures b,d) for different number of sensors. The gray line indicates experiments (f) and (g).
Figure 5Spatial distribution of amplitude error for randomly distributed 110 sensors—experiment (c)—and for intelligent selection of the 10 sensors-experiment (f).
Figure 6Spatial distribution of amplitude variance for randomly distributed 110 sensors—experiment (c)—and for intelligent selection of the 10 sensors experiment (f).