| Literature DB >> 30939722 |
Beom-Su Kim1, Ki-Il Kim2, Babar Shah3, Francis Chow4, Kyong Hoon Kim5.
Abstract
Before discovering meaningful knowledge from big data systems, it is first necessary to build a data-gathering infrastructure. Among many feasible data sources, wireless sensor networks (WSNs) are rich big data sources: a large amount of data is generated by various sensor nodes in large-scale networks. However, unlike typical wireless networks, WSNs have serious deficiencies in terms of data reliability and communication owing to the limited capabilities of the nodes. Moreover, a considerable amount of sensed data are of no interest, meaningless, and redundant when a large number of sensor nodes is densely deployed. Many studies address the existing problems and propose methods to overcome the limitations when constructing big data systems with WSN. However, a published paper that provides deep insight into this research area remains lacking. To address this gap in the literature, we present a comprehensive survey that investigates state-of-the-art research work on introducing WSN in big data systems. Potential applications and technical challenges of networks and infrastructure are presented and explained in accordance with the research areas and objectives. Finally, open issues are presented to discuss promising directions for further research.Entities:
Keywords: big data; data processing; infrastructure; wireless sensor networks
Year: 2019 PMID: 30939722 PMCID: PMC6480280 DOI: 10.3390/s19071565
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Big data system based on a WSN.
Figure 2Protocol layering of wireless big data systems.
Figure 3Typical activity recognition process for inferring activities from WBAN sensors.
Figure 4Classification of related work.
Figure 5Architecture of SVC4WSN.
Comparison of advantages and disadvantages for infrastructure.
| Method | Key Feature | Advantage | Disadvantage |
|---|---|---|---|
| SWV4WSN [ | Multi-layer model | Good lifecycle | High management cost |
| [ | K-medoid clustering | High energy efficiency | High maintenance cost |
| [ | Mobile sinks | Good for network partition | High energy consumption in few clusters |
| [ | QoS provisioning | Balanced energy consumption | High complexity |
| [ | Name and in-network caching | High energy efficiency | Low energy consumption in many clusters |
| [ | Spatial correlation | High energy efficiency | No consideration for temporal correlation |
Comparison of advantages and disadvantages of data systems.
| Method | Key Feature | Advantage | Disadvantage |
|---|---|---|---|
| [ | Framework based on Hadoop and Storm | Open-source based implementation | Low reliability |
| [ | Distributed data-centric storage | High energy efficiency | Deployment issues in real-world scenarios |
| [ | Three-tier data mining | Urgent response | Experiments in few scenarios |
Comparison of advantages and disadvantage for data collection.
| Method | Key Feature | Advantage | Disadvantage |
|---|---|---|---|
| CADAMULE [ | Situation and event awareness | Cost-efficient data collection | Simple weight based computation |
| [ | M-mobile collector | High energy efficiency | Few scenarios for simulation |
| [ | MDCP | High energy efficiency | Assumption of infinite storage memory |
| [ | Mobile collector | High energy efficiency | Flooding based operation |
| [ | Local data collector | Reduced data-gathering latency | Too much dependency on threshold value |
Figure 6Overview of fog computing.
Summary of open issues.
| Research Area | Current Research Challenges | Further Trends |
|---|---|---|
| Networks Architecture | Static/mobile sink, aggregation | Energy efficiency, Specific protocols for applications, 5G Communications |
| Framework | Single platform for WSN | Integrated platform for M2M, P2P, and IoT |
| Real-time | QoS protocol | Fog/Edge computing, Middleware |