| Literature DB >> 29809172 |
Amir Aryani1, Marta Poblet2, Kathryn Unsworth3, Jingbo Wang4, Ben Evans4, Anusuriya Devaraju5, Brigitte Hausstein6, Claus-Peter Klas6, Benjamin Zapilko6, Samuele Kaplun7.
Abstract
This paper describes the open access graph dataset that shows the connections between Dryad, CERN, ANDS and other international data repositories to publications and grants across multiple research data infrastructures. The graph dataset was created using the Research Graph data model and the Research Data Switchboard (RD-Switchboard), a collaborative project by the Research Data Alliance DDRI Working Group (DDRI WG) with the aim to discover and connect the related research datasets based on publication co-authorship or jointly funded grants. The graph dataset allows researchers to trace and follow the paths to understanding a body of work. By mapping the links between research datasets and related resources, the graph dataset improves both their discovery and visibility, while avoiding duplicate efforts in data creation. Ultimately, the linked datasets may spur novel ideas, facilitate reproducibility and re-use in new applications, stimulate combinatorial creativity, and foster collaborations across institutions.Entities:
Year: 2018 PMID: 29809172 PMCID: PMC5972674 DOI: 10.1038/sdata.2018.99
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Research Graph meta model.
The graph consists of connections between research datasets, publications, grants and researchers.
Figure 2Folder structure.
The schematic diagram of data files.
Node types.
| Number of nodes for each type. | ||
|---|---|---|
| Dataset | 144,354 | 66,371 with DOI |
| Publication | 2,795,585 | 1,974,812 with DOI |
| Researcher | 1,084,094 | 1,049,424 with ORCID |
| Grants | 55,173 | 55,173 with PURL |
Figure 3Publication trend.
The logarithm of the number of publication and dataset accumulated each year.
Figure 4Data sources and data types.
(a) Publication, (b) Researcher, (c) Dataset, and (d) Grant.
Connections between node types.
| This is an undirected graph, i.e. relations are bidirectional. | ||||
|---|---|---|---|---|
| Dataset | 49,660 | 12,999 | 50,772 | 11,427 |
| Publication | 12,999 | 3,946 | 2,609,510 | 1 |
| Researcher | 50,772 | 2,609,510 | 213 | 58,804 |
| Grant | 11,427 | 1 | 58,804 | 397 |
Figure 5ANDS graph visualisation in Gephi.
The bright blue cluster contains the grants and related records from National Health and Medical Research Council, the datasets from GeoScience Australia are highlighted by light blue, and Australian Ocean Data Network datasets are the cluster in bright orange.