| Literature DB >> 31295891 |
Jaewon Moon1, Seungwoo Kum1, Sangwon Lee2.
Abstract
The edge platform has evolved to become a part of a distributed computing environment. While typical edges do not have enough processing power to train machine learning models in real time, it is common to generate models in the cloud for use on the edge. The pattern of heterogeneous Internet of Things (IoT) data is dependent on individual circumstances. It is not easy to guarantee prediction performance when a monolithic model is used without considering the spatial characteristics of the space generating those data. In this paper, we propose a collaborative framework using a new method to select the best model for the edge from candidate models of cloud based on sample data correlation. This method lets the edge use the most suitable model without any training tasks on the edge side, and it also minimizes privacy issues. We apply the proposed method to predict future fine particulate matter concentration in an individual space. The results suggest that our method can provide better performance than the previous method.Entities:
Keywords: collaborative analysis; decentralized analysis architecture; edge computing; heterogeneous IoT data analysis
Year: 2019 PMID: 31295891 PMCID: PMC6678631 DOI: 10.3390/s19143038
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Advantage vs. disadvantage of cloud and edge platforms in terms of data analysis service.
| Advantage | Disadvantage | |
|---|---|---|
|
|
Powerful processing resources Computational efficiency Easy to use Cost savings Efficient management Massive storage Wide-area coverage |
Privacy concerns High bandwidth costs Network connection dependent Latency problems Service provider dependent |
|
|
Real-time data handling User-centric process High mobility Privacy protection Network connection independence High QoS |
Low processing power Insufficient storage Different specification Only local-area coverage |
Figure 1Generalized system model of collaborative edge-cloud processing.
Figure 2Proposed framework structure.
Figure 3Simple model selection flow to find the most proper model.
Figure 4The dataflow of the use case.
Figure 5The diagram for an LSTM cell at time T.
Specifications of six sensor modules used for air quality measurement.
| Factor | Unit | Data Range | Error Range |
|---|---|---|---|
| CO₂ | ppm | 0–10,000 ppm | ±20 ppm ±3% |
| VOCs | ppb | 125–3500 ppb | ±10–30% (123–3500) |
| PM10 | μg/m3 | 0–500 | ±10% |
| PM2.5 | μg/m3 | 0–500 | ±10% |
| Temperature | °C | −40–125 | ±0.3 °C (20–40 °C) |
| Humidity | % | 0–100 | ±2.0% (20–80%) |
| Noise | dB | 30–90 | ±5 dB |
Data distribution information of indoor and outdoor environmental factors.
| Indoor | Outdoor | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CO₂ | VOCs | Noise | PM10 | PM2.5 | Temperature | Humidity | PM10 | PM2.5 | Temperature | Humidity | |
|
| 735.89 | 322.19 | 55.64 | 41.12 | 29.34 | 22.14 | 36.70 | 53.92 | 31.89 | 10.28 | 59.66 |
|
| 458.15 | 266.56 | 12.25 | 36.23 | 24.79 | 2.51 | 8.93 | 42.81 | 28.31 | 5.41 | 22.02 |
|
| 1.95 | 125.00 | 32.00 | 5.00 | 3.00 | 11.09 | 10.80 | 3.00 | 1.00 | 0.10 | 12.31 |
|
| 491.56 | 171.41 | 44.09 | 15.78 | 11.27 | 20.75 | 31.00 | 22.00 | 10.00 | 6.30 | 42.50 |
|
| 617.64 | 247.06 | 58.00 | 23.30 | 16.77 | 22.50 | 37.00 | 50.00 | 25.00 | 9.80 | 58.42 |
|
| 866.24 | 360.90 | 66.09 | 60.13 | 44.00 | 23.75 | 43.00 | 75.00 | 44.00 | 14.60 | 78.76 |
|
| 5000.0 | 2000.0 | 81.27 | 500.0 | 352.7 | 32.31 | 74.20 | 380.00 | 138.00 | 23.70 | 97.90 |
Test data results using each model with the first scenario.
| PM10 | PM2.5 | |||
|---|---|---|---|---|
| ID | Grade Accuracy | RMSE | Grade Accuracy | RMSE |
| id_000 | 0.937 | 9.147 | 0.888 | 6.626 |
| id_001 | 0.993 | 5.587 | 0.988 | 3.649 |
| id_002 | 0.967 | 10.103 | 0.927 | 6.172 |
|
|
|
|
|
|
| id_004 | 1.000 | 2.267 | 0.995 | 1.973 |
| id_005 | 0.974 | 9.801 | 0.965 | 6.491 |
| id_006 | 0.934 | 12.676 | 0.902 | 8.620 |
| id_007 | 0.995 | 5.760 | 0.991 | 3.541 |
| id_008 | 0.988 | 9.129 | 0.967 | 5.829 |
| id_009 | 0.953 | 12.793 | 0.829 | 9.802 |
| id_010 | 0.895 | 9.546 | 0.918 | 6.993 |
| id_011 | 0.965 | 12.342 | 0.953 | 8.461 |
| id_012 | 0.972 | 9.467 | 0.895 | 6.033 |
| id_013 | 1.000 | 2.100 | 0.993 | 1.733 |
| id_014 | 0.974 | 8.168 | 0.977 | 5.851 |
| id_015 | 0.974 | 9.726 | 0.906 | 6.846 |
| id_016 | 0.986 | 8.465 | 0.958 | 6.602 |
| id_017 | 0.953 | 9.142 | 0.951 | 6.579 |
| id_018 | 1.000 | 2.389 | 1.000 | 1.825 |
| id_019 | 0.965 | 7.658 | 0.930 | 5.259 |
| id_020 | 0.965 | 10.935 | 0.871 | 6.947 |
| id_021 | 1.000 | 2.526 | 1.000 | 1.893 |
| id_022 | 0.956 | 15.950 | 0.831 | 10.282 |
| id_023 | 0.995 | 3.651 | 0.991 | 2.492 |
| id_024 | 0.981 | 8.558 | 0.899 | 6.708 |
| id_025 | 0.974 | 11.595 | 0.946 | 7.740 |
| id_026 | 0.958 | 8.216 | 0.951 | 6.257 |
| id_027 | 0.911 | 10.295 | 0.881 | 7.781 |
| id_028 | 0.981 | 6.788 | 0.918 | 5.046 |
Prediction results of three scenarios.
| Scenario 1 | Scenario 2 | Scenario 3 | |||||
|---|---|---|---|---|---|---|---|
| PM10 | PM2.5 | PM10 | PM2.5 | PM10 | PM2.5 | ||
|
| Average RMSE | 27.11 | 18.93 |
|
| 10.15 | 7.20 |
| Average grade accuracy | 0.80 | 0.73 |
|
| 70.94 | 0.78 | |
|
| Average RMSE | 8.60 | 6.01 |
|
| 23.42 | 16.65 |
| Average grade accuracy | 0.94 | 0.94 |
|
| 0.82 | 0.92 | |
PM10 and PM2.5 prediction results based on the correlation ranking.
| Scenario1 | Scenario 2 | Scenario 3 | ||||||
|---|---|---|---|---|---|---|---|---|
| Average Correlation | Average RMSE | Average Grade Accuracy | Average RMSE | Average Accuracy | Average RMSE | Average Grade Accuracy | ||
|
| Low correlation group |
| 19.77 | 0.86 | 18.46 | 0.89 | 19.52 | 0.87 |
| Middle correlation group |
| 10.61 | 0.93 | 9.13 | 0.94 | 12.68 | 0.92 | |
| High correlation group |
|
|
|
|
|
|
| |
|
| Low correlation group |
| 14.39 | 0.79 | 12.27 | 0.83 | 13.36 | 0.82 |
| Middle correlation group |
| 7.50 | 0.92 | 6.50 | 0.92 | 9.02 | 0.90 | |
| High correlation group |
|
|
|
|
|
|
| |
Figure 6The average RMSE results of the high, middle, and low correlation groups for scenario three. (a) the RMSE averages in the low, middle, and high correlation groups for each space; (b) the average grade accuracies of each group in each space.