| Literature DB >> 26886581 |
Steven Van Tuyl1, Amanda L Whitmire2.
Abstract
Sharing of research data has begun to gain traction in many areas of the sciences in the past few years because of changing expectations from the scientific community, funding agencies, and academic journals. National Science Foundation (NSF) requirements for a data management plan (DMP) went into effect in 2011, with the intent of facilitating the dissemination and sharing of research results. Many projects that were funded during 2011 and 2012 should now have implemented the elements of the data management plans required for their grant proposals. In this paper we define 'data sharing' and present a protocol for assessing whether data have been shared and how effective the sharing was. We then evaluate the data sharing practices of researchers funded by the NSF at Oregon State University in two ways: by attempting to discover project-level research data using the associated DMP as a starting point, and by examining data sharing associated with journal articles that acknowledge NSF support. Sharing at both the project level and the journal article level was not carried out in the majority of cases, and when sharing was accomplished, the shared data were often of questionable usability due to access, documentation, and formatting issues. We close the article by offering recommendations for how data producers, journal publishers, data repositories, and funding agencies can facilitate the process of sharing data in a meaningful way.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26886581 PMCID: PMC4757565 DOI: 10.1371/journal.pone.0147942
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Scoring criteria for the effectiveness of data sharing.
| Discoverable | ||
| 0 | No link or indication of data source in the data from the paper OR non-actionable mention of data location (e.g. broken links, mention of source without link) | Data cannot be found OR non-actionable mention of data location (e.g. broken links, mention of database but data in database cannot be directly linked to project) |
| 1 | A reference to the location or source of the data but no specific indication of the data used (e.g. a link to an external database) OR data shared are not all of the data used in the paper | The data can be found at the researcher's home page, a research group page or a project web page OR data shared are not all of the data used in the project |
| 2 | A direct link and/or persistent identifier for the dataset AND data are complete | The dataset can be found via a search at a subject repository or regional or national network, Google, or the DataCite Metadata Search AND data are complete |
| Accessible | ||
| 0 | Data shared through a closed or subscription access platform or accessed by request | |
| 1 | Data shared through a platform that requires some barrier to access such as a requirement to obtain permission to use the data OR data is shared through a closed-access source (e.g. journal, repository) | |
| 2 | Data shared in an open repository or platform or source (e.g. OA Journal, open repository, etc.) | |
| Transparent | ||
| 0 | No documentation provided for the data | |
| 1 | Some documentation provided for the data but lacks clear description of details such as how data were collected, analyzed or processed; description of units or headers; description of blanks; etc. Documentation may include a reference to the methods section of the paper | |
| 2 | Readme file, data dictionary, or other metadata shared with the dataset that provide clear details about the nature and content of the data | |
| Actionable | ||
| 0 | Data are not in a format that is usable in an analysis application (e.g. shared in a PDF or as a Fig) | |
| 1 | Data are in a format usable in an analysis application but are formatted in a way that makes use difficult (e.g. spreadsheets not in regular row-column form). OR data are shared in a proprietary or non-open format (e.g.xls,.doc,.mat,.sas,.shx) | |
| 2 | Data are in an open or non proprietary format (e.g.csv,.xlsx,.txt, etc.) with usable formatting | |
Fig 1Total DATA scores from 25 NSF-funded projects, as located via data management plans
Non-zero DATA scores from 25 NSF funded projects, with element scores shown.
| DATA Score | Discoverable | Accessible | Transparent | Actionable |
|---|---|---|---|---|
| 2 | 2 | 0 | 0 | 0 |
| 5 | 2 | 2 | 1 | 0 |
| 6 | 1 | 2 | 2 | 1 |
| 7 | 1 | 2 | 2 | 2 |
| 7 | 1 | 2 | 2 | 2 |
| 8 | 2 | 2 | 2 | 2 |
Fig 2Element-wise DATA scores from 25 NSF-funded projects, as located via data management plans.
Fig 3Total DATA scores from 104 NSF-funded projects, as located via journal articles.
Fig 4Element-wise DATA scores from 104 NSF-funded projects, as located via journal articles.