| Literature DB >> 29557977 |
Rohan Hazra1, Susan Tenney2, Alexandra Shlionskaya2, Rajni Samavedam2, Kristin Baxter2, John Ilekis3, Jennifer Weck4, Marian Willinger3, Gilman Grave5, Katerina Tsilou6, David Songco7.
Abstract
The benefits of data sharing are well-established and an increasing number of policies require that data be shared upon publication of the main study findings. As data sharing becomes the new norm, there is a heightened need for additional resources to drive efficient data reuse. This article describes the development and implementation of the Data and Specimen Hub (DASH) by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) to promote data sharing from NICHD-funded studies and enable researchers to comply with NIH data sharing policies. DASH's flexible architecture is designed to archive diverse data types and formats from NICHD's broad scientific portfolio in a manner that promotes FAIR data sharing principles. Performance of DASH over two years since launch is promising: the number of available studies and data requests are growing; three manuscripts have been published from data reanalysis, all within two years of access. Critical success factors included NICHD leadership commitment, stakeholder engagement and close coordination between the governance body and technical team.Entities:
Mesh:
Year: 2018 PMID: 29557977 PMCID: PMC5859878 DOI: 10.1038/sdata.2018.46
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Major DASH Goals Translated to Success Criteria.
| DASH Goals | Success Criteria (Select) |
|---|---|
| Quality Data | •Cleaned, participant level data•Essential study documentations (such as study protocol, data dictionary, data collection instruments) for meaningful data use•Data collection consistent with 45 C.F.R. Part 46 and other federal clinical research regulations•Data with accurate attributions (provenance) without any errors introduced by the system (integrity), linkage to study/investigators (pedigree) and protected from loss or unauthorized access (security) |
| Standardization | •Standard descriptors and metadata for data and documentations•Standard templates and workflows for submissions and requests |
| Task Driven Access | •Query, integrate and analyze data•Answer specific research questions•Support administrative activities and reporting |
| Efficiency | •Efficient browsing and searching, data submissions and requests•User friendly and dynamic format and page layout•Robust workflows for data submission, access, and use•Mechanism for user feedback and adjustments for improving efficiency and performance |
| Compliance | •Data sharing consistent with informed consents of the participants, relying upon guidance provided by an appropriate Institutional Review Board (IRB) when the terms and conditions of data sharing are unclear or not addressed in the informed consent•Data de-identified of all 18 HIPAA identifiers•Compliance to applicable federal and NIH policies as it relates to system design, data storage, access, sharing, and use |
| Transparency | •Easy visibility to study portfolio and data within DASH•Study and data file descriptions for meaningful interpretation of data•‘One-stop-shop’ for NICHD supported studies stored in DASH and other archives•Linkage of NICHD funded studies stored in external archives |
How DASH Addressed Top Data Sharing Challenges.
| Challenges | How DASH Addressed Challenges |
|---|---|
| •Per NIH data sharing policies, investigators with annual funding >$500 K must share data within one year of publishing the main findings; DASH will also accept data from investigators receiving annual NIH funding of less than $500 K•Per NICHD, all NICHD funded investigators must share data in DASH or another publicly accessible repository in alignment with NIH data sharing policies•As a condition of the Data Use Agreement that all data requesters must sign to receive data access, all studies that reuse data from DASH are required to acknowledge the original study in their publication, ensuring the original study principal investigator receives credit for their contribution | |
| •DASH provides guidance, tools and expertise to help investigator prepare data for sharing•NICHD has issued an R03 (small grant) funding opportunity announcement to assist investigators with archiving data (current information on the grants awarded through this funding opportunity are publicly available through the NIH Research Portfolio Online Reporting Tools (RePORT) by searching for FOA PAR-16-149) | |
| The study IRB or an equivalent Privacy Board must determine that data can be shared in DASH and is consistent with the informed consent of the participant. In some cases, the informed consent may not explicitly state broad data sharing, particularly for studies completed many years ago, before broad data sharing was feasible. | |
| •DASH acts as a data lake; accepting all types of clinical research data (clinical, laboratory, pathology, genomic, images, etc.) due to its flexible metadata model•Accepts data in a variety of structures and formats but is annotated with standard descriptors and metadata for easy discovery | |
| DASH stores only de-identified data and provides guidance and support to de-identify data of all 18 HIPAA identifiers | |
| DASH has multiple safeguards in place to assure data privacy and security:•Data recipient along with their institution must execute a Data Use Agreement with NICHD for a specific research plan and agree to data privacy and recipient’s institutional information security policies, thereby holding not just the recipient but the recipient’s institution liable for any violation•Additional study-specific approvals, such as IRB approval or study-specific steering committee or PI approvals, if required by the data submitter•Data are encrypted both at rest and in transition•System is compliant with FISMA standards | |
| DASH offers tools and guidance for researchers to prepare and annotate their data and documentation for sharing such as the Data Preparation Tool (DPT), guidance for de-identification and coding and the DASH tutorial |
Figure 1Number of Studies, Requests and Registrations in DASH (August 2015 – Feb 2018).
Key Lessons Learned from DASH Implementation at NICHD.
| Provide data sharing language for investigators to include in the informed consents prior to enrolling participants and initiating studies funded by NICHD |
| Provide guidance and tools to effectively manage data from the onset of the study and through the study life cycle so that at the end of the study, the investigator can ‘click to share’ in DASH |
| Provide guidance and standard templates to prepare study documentations at the outset of the study, as: •Essential study documentations varied in quality, content and structure •Significant effort is required to retroactively prepare documentations for meaningful use |
| Encourage use of a universal participant identifier such as Global Unique Identifier (GUID) to enhance data integration and meta-analysis |
| Ensure appropriate metadata standards are selected for broad searches and discovery |
| Coordinate with data submitters on options for de-identification of certain variables such as low frequency or rare conditions |
| Provide options for IRB approval for sharing data from older studies where data sharing is not explicitly stated in the informed consent or the study IRB is no longer active or for multi-site studies |
| Engage investigator stakeholders during design and implementation and collect feedback from broader user community. This is critical for long term success and sustenance of DASH |
| Identify and engage early users to become champions and advocates of data sharing and DASH |
| Maintain data linkages to studies available internally in DASH and/or externally to enable data users to optimize their data search efforts |
| Maintain data provenance by ensuring that any data changes or edits made by data owners or authorized users is recorded by the system and kept for review and auditing purposes |
| Provide URL links of NICHD supported studies deposited in publicly accessible repositories other than DASH to enhance data use across NICHD supported studies |