| Literature DB >> 31343683 |
Jian Liu1, Mo Yang1, Lei Zhang2, Weijun Zhou3.
Abstract
Resource Description Framework (RDF) is widely used for representing biomedical data in practical applications. With the increases of RDF-based applications, there is an emerging requirement of novel architectures to provide effective supports for the future RDF data explosion. Inspired by the success of the new designs in National Center for Biotechnology Information dbSNP (The Single Nucleotide Polymorphism Database) for managing the increasing data volumes using JSON (JavaScript Object Notation), in this paper we present an effective mapping tool that allows data migrations from RDF to JSON for supporting future massive data explosions and releases. We firstly introduce a set of mapping rules, which transform an RDF format into the JSON format, and then present the corresponding transformation algorithm. On this basis, we develop an effective and user-friendly tool called RDF2JSON, which enables automating the process of RDF data extractions and the corresponding JSON data generations.Entities:
Mesh:
Year: 2019 PMID: 31343683 PMCID: PMC6657663 DOI: 10.1093/database/baz088
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1A fragment of an RDF schema.
Figure 2The partial graph model based on the given instance of RDFS.
Figure 3A mapping example of the hierarchy relation of classes.
Figure 4A mapping example of the hierarchy relation of properties.
Figure 5A mapping example of the domain and the range of properties.
Figure 6A mapping instance from RDF to JSON.
Figure 7The mapping algorithm.
Figure 8The framework of RDF2JSON.
Experimental results running on UniprotKB
| File name | The numbers of subjects | RDF size (MB) | JSON size (MB) | |
|---|---|---|---|---|
| R1 | uniprotkb_eukaryota_oxymonadida_66288.rdf | 11 662 | 8.75 | 4.64 |
| R2 | uniprotkb_eukaryota_glaucocystophyceae_38254.rdf | 35 767 | 23.41 | 13.71 |
| R3 | uniprotkb_eukaryota_alveolata_33630_1000000.rdf | 763 169 | 541.61 | 310.40 |
| R4 | uniprotkb_eukaryota_rhizaria_543769.rdf | 1 973 509 | 1307.53 | 785.36 |
| R5 | uniprotkb_eukaryota_parabasalia_5719.rdf | 2 424 903 | 1571.67 | 950.61 |
| R6 | uniprotkb_eukaryota_rhodophyta_2763.rdf | 4 031 384 | 2678.03 | 1583.91 |
| R7 | uniprotkb_eukaryota_opisthokonta_fungi_4751_8000000.rdf | 4 448 979 | 3173.19 | 1815.40 |
Figure 9Occupied storage comparisons.
Figure 10Running times by varying the input size.