| Literature DB >> 30395209 |
YoSon Park1, Casey S Greene1,2.
Abstract
Data generation is expensive in terms of both time and money. Sharing data enables the rapid replication, validation, and application of discoveries, increasing the pace and accuracy of research. As research parasites, or users of other people's data, we recognize that a strong science ecosystem requires that those who share best be recognized. We find that widely accessible benchmark datasets have provided outsized benefits, and we hope that the benefits of sharing will also begin to accrue to individual investigators who share well. Funders can enhance progress by adjusting incentives to better support data sharers, which will make their programmatic investments more effective. We note some funders who are making such efforts.Entities:
Mesh:
Year: 2018 PMID: 30395209 PMCID: PMC6258825 DOI: 10.1093/gigascience/giy129
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Citations for publicly shared data between 2008 and the present. For four main public data repositories, the Cancer Genome Atlas (TCGA), Genotype-Tissue Expression Project (GTEx), Encyclopedia of DNA Elements (ENCODE), and UK Biobank, we estimated the number of publications citing each dataset on PubMed [8]. For ENCODE and ENCODE_Community publications, the ENCODE official website [9] was used in place of PubMed for searches up to its finalization in 2016. For the past decade, we estimated that approximately 6,516 publications benefited from these public resources.