| Literature DB >> 30747919 |
Patrick A Curry1, Nils Moosdorf1.
Abstract
The number of online data repositories is growing and they are becoming increasingly difficult to navigate. Data are scattered among different repositories, or hidden on personal or institutional servers. To access data, users must search extensively and rely on site-specific tools. These hurdles substantially inhibit data findability and accessibility; in particular, those in the long tail of data. We developed an open source web application, Spatial Data Hub, that is a geospatial data index, connected to remote Internet sources. It allows simultaneous display and comparison of disparate datasets on a single map. It aims to promote all data equally and provide the flexibility to connect to any storage system, effectively making long-tail datasets as visible as those in large, established repositories. Its low barrier of entry allows scientists and organizations to easily add data throughout the research process; enhancing transparency, openness and reproducibility. This flexibility and functionality makes Spatial Data Hub a novel platform for researchers to promote their work, develop new hypotheses and create new collaborations.Entities:
Year: 2019 PMID: 30747919 PMCID: PMC6371891 DOI: 10.1038/sdata.2019.14
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Expected effect of the presented application to dataset findability.
Graphic is not to scale.
Comparison of the merits and limitations of Spatial Data Hub along with five other systems.
| Name | Description | Merits | Limitations |
|---|---|---|---|
| Antmaps | A map based front end and visualisation tool for the Global Ant Biodiversity Informatics database | • Open source code base• Impressive user interface• Easy to use | • Front end for a single database• Functionality purely focused on visualisation• Thematic niche focus |
| DataOne | A website that allows member repositories to share data and metadata links on a map background | • Access to a massive amount of data and metadata from many organizations• Provides a lot of dataset metadata including number of citations, views, and downloads• It is a large, well funded organization providing education and support | • Proprietary code base• Required to install and implement specific software• Limited to groups or organizations• Individual data points not visible on the map – links to metadata of datasets |
| Geojson.io | A GIS website providing dataset creation, display, editing and format conversion functionality | • Open source code base• Attractive and intuitive user interface• Highly featured | • Built to work with data, not focused on sharing• Limited support compared to larger programs |
| Nanoos | An interagency data visualisation and access portal focused on Northwestern United States | • Attractive user interface• High quantity data• Many apps that cover a wide variety of topics• Integrates space and time information | • Proprietary code base• Data visualisation for limited number of sources• Limited regional coverage |
| Pangaea | A data publishing repository that puts datasets through quality control process, gives them DOIs and makes them publically available | • Permanent data storage, incl. DOI• Powerful data search function• Quality controlled• Hundreds of thousands of datasets available | • Proprietary code base• Data are manually curated before storage• Focused on single repository |
| Spatialdatahub | An open source web application that simultaneously displays geospatial datasets from disparate Internet sources on a single map | • Open source code base• Easily extensible• Can add and view multiple datasets from different sources | • No centralized repository• Front end developed for a specialized audience• Does not have the support system of the larger programs |
Figure 2Appearance of the application’s demonstrator prototype available at www.spatialdatahub.org.