| Literature DB >> 34366573 |
Ana I Torre-Bastida1, Josu Díaz-de-Arcaya1, Eneko Osaba1, Khan Muhammad2, David Camacho3, Javier Del Ser4.
Abstract
This overview gravitates on research achievements that have recently emerged from the confluence between Big Data technologies and bio-inspired computation. A manifold of reasons can be identified for the profitable synergy between these two paradigms, all rooted on the adaptability, intelligence and robustness that biologically inspired principles can provide to technologies aimed to manage, retrieve, fuse and process Big Data efficiently. We delve into this research field by first analyzing in depth the existing literature, with a focus on advances reported in the last few years. This prior literature analysis is complemented by an identification of the new trends and open challenges in Big Data that remain unsolved to date, and that can be effectively addressed by bio-inspired algorithms. As a second contribution, this work elaborates on how bio-inspired algorithms need to be adapted for their use in a Big Data context, in which data fusion becomes crucial as a previous step to allow processing and mining several and potentially heterogeneous data sources. This analysis allows exploring and comparing the scope and efficiency of existing approaches across different problems and domains, with the purpose of identifying new potential applications and research niches. Finally, this survey highlights open issues that remain unsolved to date in this research avenue, alongside a prescription of recommendations for future research.Entities:
Keywords: Big data; Bio-inspired computation; Data fusion; Evolutionary computation; Fuzzy logic; Neural networks; Swarm intelligence
Year: 2021 PMID: 34366573 PMCID: PMC8329000 DOI: 10.1007/s00521-021-06332-9
Source DB: PubMed Journal: Neural Comput Appl ISSN: 0941-0643 Impact factor: 5.606
Fig. 1Big Data dimensions associated with typical problems liable to be solved by bio-inspired computation. Nodes colored in blue correspond to computational tasks, whereas those colored in light brown indicate specific applications where the Big Data dimension indicated in their parent nodes are particularly relevant. Computational requirements enabled by bio-inspired computation are indicated in the gray box set on the background
Fig. 2Phases of the big data life cycle
Fig. 3Conceptual diagram showing the three tasks that can be tackled with Computational Intelligence
Relationship between the four families of bio-inspired computation approaches, typical problems and dimensions in the Big Data context
| Family | Task | Big data problems | Dimensions |
|---|---|---|---|
| Neural network | Modeling Simulation (clustering) | Server load forecasting Intrusion detection Data compression Predictive maintenance Manifold learning | Volume Variety |
EC SI | Optimization | Task scheduling Design of data filters Resource allocation Server placement Database sizing | Variability Velocity |
| Fuzzy systems | Modeling Simulation (clustering) | Predictive control Multi-criteria decision making Fuzzy preprocessing rules | Volume Veracity |
Fig. 4Diagram depicting the differences between a EC and b SI
Recent overviews on Big Data connected to bio-inspired computation, and their comparison to this work
| Survey | Period | # Reviewed works | Literature taxonomy (classification criterion) | Big data dimension and lifecycle | Families of bio-inspired computing approaches | Critical analysis, trends and challenges |
|---|---|---|---|---|---|---|
| [ | 2002–2018 | 83 | Yes (uncertainty modeling/mitigation approach) | Big data collection, fusion and processing | Evolutionary algorithms, artificial neural networks and Fuzzy Logic | Real-time machine learning and meta-heuristic algorithms for mitigating uncertainty |
| [ | 2001–2019 | 96 | No | Big data processing and learning | Fuzzy logic, Evolutionary algorithms and artificial neural networks | Development of modern smart cities |
| [ | 2014–2019 | 35 | Yes (family of bio-inspired algorithms) | Big data Processing and Learning | Evolutionary, swarm-based and ecological algorithms | Containers, serverless computing, blockchain, software-defined clouds and quantum computing |
| [ | 2014–2019 | 116 | Yes (Big data analytics for Industrial IoT) | Big data collection, fusion and processing | No specific focus on algorithmic solutions | Security, privacy and concentric computing |
| This work | 2010–2020 | 231 | Yes (Infrastructure management, Big data technologies, Big data phases) | Big data fusion, storage, processing, learning and visualization | Evolutionary computation, swarm intelligence, neural networks and fuzzy systems | Lack of reference problems, unrealistic use cases, algorithmic novelty, operationalization, real-time Big data, XAI and security (Sect. |
Fig. 5Taxonomy of works related to the application of bio-inspired computation to the big data domain, classified as per the different application areas under consideration
Fig. 6Application areas of bio-inspired computation for each Big Data life cycle phase
Fig. 7Challenges envisioned in the crossroads between bio-inspired computation and Big Data
Fig. 8Yearly publications retrieved from Scopus by submitting the queries indicated in the legend (as per June 1, 2021). The vertical axis is in logarithmic scale