| Literature DB >> 33717315 |
Vicki Xafis1, Markus K Labude1.
Abstract
There is a growing expectation, or even requirement, for researchers to deposit a variety of research data in data repositories as a condition of funding or publication. This expectation recognizes the enormous benefits of data collected and created for research purposes being made available for secondary uses, as open science gains increasing support. This is particularly so in the context of big data, especially where health data is involved. There are, however, also challenges relating to the collection, storage, and re-use of research data. This paper gives a brief overview of the landscape of data sharing via data repositories and discusses some of the key ethical issues raised by the sharing of health-related research data, including expectations of privacy and confidentiality, the transparency of repository governance structures, access restrictions, as well as data ownership and the fair attribution of credit. To consider these issues and the values that are pertinent, the paper applies the deliberative balancing approach articulated in the Ethics Framework for Big Data in Health and Research (Xafis et al. 2019) to the domain of Openness in Big Data and Data Repositories. Please refer to that article for more information on how this framework is to be used, including a full explanation of the key values involved and the balancing approach used in the case study at the end.Entities:
Keywords: Big data; Data repository; Decision-making framework; Health data; Open data; Open science
Year: 2019 PMID: 33717315 PMCID: PMC7747413 DOI: 10.1007/s41649-019-00097-z
Source DB: PubMed Journal: Asian Bioeth Rev ISSN: 1793-9453
Benefits of data sharing
| Research integrity and the promotion of scientific rigour | |
• Reproducibility and replicability for independent verification of research results • Detection of research errors and fraudulent research | |
| Public benefit and harm minimisation | |
• Increase in the utility of existing datasets • Responsible use of public funds • Safer and better informed clinical practice and policy implementation • Reduction of the research burden on frequently studied individuals/groups • Data preservation | |
| Personal and professional benefits | |
• Opportunities for cross-domain, collaborative research • Reduction of duplication of effort and cost considerations • Validation of findings and building on published work |
The sharing of genomic data generates particularly difficult issues around privacy and confidentiality. Genomic data has been shared for a number of years backed by healthcare professionals and citizens who have advocated for its open use and re-use (Topol Some of the considerations in genomic research include the following: 1. When genomic data is shared openly, it can never be withdrawn from the public sphere, and it is impossible to know the uses to which it might be put (Heeney et al. 2. Re-identification of participants from their genomic data is increasingly possible (Erlich et al. 3. It is not possible for researchers to foresee the kinds of uses to which data will be put in the future with some uses potentially leading to re-identification which could result in harms. Likewise, public benefits/harms are not entirely anticipatable, which may pose a challenge to meeting participants’ expectations of social benefits that motivates their willingness to make available their genomic data. 4. Although data is often shared globally, legislative protections are not uniform across jurisdictions (e.g. anti-discrimination legislation). The difference in legal protections afforded can impact significantly not only on individuals but also on families and entire ethnic groups. Researchers need to bear such broader considerations in mind when engaging participants. 5. Concerns have also been raised about the discrepancy between the privacy protections afforded to participants and those afforded to biological relatives whose data is inevitably also shared but whose consent is not sought (Takashima et al. In view of the specific ethical issues raised in genomic and other kinds of research where sensitive health data is used, familiarity with the licence agreements and the conditions of access is essential, as is an understanding that researchers depositing data can impose conditions on access and re-use to protect participants’ confidentiality and privacy (Mauthner and Parry |