| Literature DB >> 35408347 |
David Sarramia1, Alexandre Claude1, Francis Ogereau2, Jérémy Mezhoud2, Gilles Mailhot3.
Abstract
This article presents a platform for environmental data named "Environmental Cloud for the Benefit of Agriculture" (CEBA). The CEBA should fill the gap of a regional institutional platform to share, search, store and visualize heterogeneous scientific data related to the environment and agricultural researches. One of the main features of this tool is its ease of use and the accessibility of all types of data. To answer the question of data description, a scientific consensus has been established around the qualification of data with at least the information "when" (time), "where" (geographical coordinates) and "what" (metadata). The development of an on-premise solution using the data lake concept to provide a cloud service for end-users with institutional authentication and for open data access has been completed. Compared to other platforms, CEBA fully supports the management of geographic coordinates at every stage of data management. A comprehensive JavaScript Objet Notation (JSON) architecture has been designed, among other things, to facilitate multi-stage data enrichment. Data from the wireless network are queried and accessed in near real-time, using a distributed JSON-based search engine.Entities:
Keywords: data lake; data management; data visualization; environmental sensors; indexes; internet of things
Mesh:
Year: 2022 PMID: 35408347 PMCID: PMC9003009 DOI: 10.3390/s22072733
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Geographical area and sites covered by the CEBA.
Figure 2Global architecture of CEBA.
Figure 3Global functionalities of CEBA (in blue IoT features, in orange web features and in green common features).
Figure 4Schematic representation of generic ingestion pipeline.
Figure 5Dataflow for sensors database ingestion.
Figure 6Schema of the database.
Figure 7Converting and enrichment flow.
Figure 8Stored query example.
Figure 9Flows between website and data catalog.
Figure 10Extract of the dashboard of weather station at Mt. Etna (courtesy of LPC and LMV).
Figure 11Extract of the dashboard of one system—courtesy of Aydat Observatory.
Figure 12Extract of the dashboard of multiple independent sources—courtesy of Aydat Observatory.
Table describing the number of records and the size in the database.
| Project | Records | Size (in GB) |
|---|---|---|
| ConnecSens | 3.0 M | 3.8 |
| Aydat | 0.4 M | 0.6 |
| Etna | 0.7 M | 0.8 |
| ZATU | 0.3 M | 0.3 |
| Total | >5 M | >6 |
Table describing the number of records and the size in elastic cluster.
| Project | Records | Indexe Size (in GB) | Indexes |
|---|---|---|---|
| ConnecSens | 2.4 M | 2.2 | 302 |
| Aydat | 0.7 M | 0.7 | 62 |
| Etna | 0.6 M | 0.56 | 78 |
| ZATU | 0.4 M | 0.4 | 37 |
| Total | >4.5 M | 3 | >500 |