| Literature DB >> 29979709 |
Jessica L Couture1,2, Rachael E Blake2,3, Gavin McDonald1,4, Colette L Ward2,5.
Abstract
Growth of the open science movement has drawn significant attention to data sharing and availability across the scientific community. In this study, we tested the ability to recover data collected under a particular funder-imposed requirement of public availability. We assessed overall data recovery success, tested whether characteristics of the data or data creator were indicators of recovery success, and identified hurdles to data recovery. Overall the majority of data were not recovered (26% recovery of 315 data projects), a similar result to journal-driven efforts to recover data. Field of research was the most important indicator of recovery success, but neither home agency sector nor age of data were determinants of recovery. While we did not find a relationship between recovery of data and age of data, age did predict whether we could find contact information for the grantee. The main hurdles to data recovery included those associated with communication with the researcher; loss of contact with the data creator accounted for half (50%) of unrecoverable datasets, and unavailability of contact information accounted for 35% of unrecoverable datasets. Overall, our results suggest that funding agencies and journals face similar challenges to enforcement of data requirements. We advocate that funding agencies could improve the availability of the data they fund by dedicating more resources to enforcing compliance with data requirements, providing data-sharing tools and technical support to awardees, and administering stricter consequences for those who ignore data sharing preconditions.Entities:
Mesh:
Year: 2018 PMID: 29979709 PMCID: PMC6034829 DOI: 10.1371/journal.pone.0199789
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Data recovery status definitions.
| Data status | Definition |
|---|---|
| Outreach attempted | Contact information found and outreach attempted via email and/or phone |
| Contact established | At least one reply was received by the archiving team from the target researcher. Confirms contact information was correct |
| Data received | Data was received from the researcher, regardless of data quality or level of documentation |
| Published | Data were received and documented well enough to be archived by the data team or the researcher cooperated in data clean up and documentation |
| Unrecoverable | Data were unrecoverable either due to inability to find contact information or for reasons confirmed by the researcher/data owner |
Hurdles to recovery category definitions.
| Hurdle | Definition |
|---|---|
| No contact info | No contact information found or contact information was never confirmed because outreach attempts received no response |
| Communication lost | At least one reply was received by the archiving team from the target researcher but communication was lost before any data were sent |
| Data lost | PI of other data manager confirmed that the data no longer exist |
| Non digital data | Data exist in a non-digital format (excludes digital PDFs, includes hand-written or typed data) |
| Unwilling to share | PI or other data manager refused to share data |
| Requested funding | PI or other data manager agreed to share data only if additional funding was provided |
Fig 1Final status of all data projects.
Black bars are projects for which data were successfully acquired, grey bars represent projects for which no data were acquired.
Fig 2Data recovery grouped by research field.
Total successful (black) and unsuccessful (grey) data requests by research field. Asterisks indicate significant research fields.
Fig 3Reasons given for not providing data.
Communication loss and lack of contact information were the main reasons data were not obtained.