| Literature DB >> 36091999 |
Margherita Martorana1, Tobias Kuhn1, Ronald Siebes1, Jacco van Ossenbruggen1.
Abstract
Understanding the complexity of restricted research data is vitally important in the current new era of Open Science. While the FAIR Guiding Principles have been introduced to help researchers to make data Findable, Accessible, Interoperable and Reusable, it is still unclear how the notions of FAIR and Openness can be applied in the context of restricted data. Many methods have been proposed in support of the implementation of the principles, but there is yet no consensus among the scientific community as to the suitable mechanisms of making restricted data FAIR. We present here a systematic literature review to identify the methods applied by scientists when researching restricted data in a FAIR-compliant manner in the context of the FAIR principles. Through the employment of a descriptive and iterative study design, we aim to answer the following three questions: (1) What methods have been proposed to apply the FAIR principles to restricted data?, (2) How can the relevant aspects of the methods proposed be categorized?, (3) What is the maturity of the methods proposed in applying the FAIR principles to restricted data?. After analysis of the 40 included publications, we noticed that the methods found, reflect the stages of the Data Life Cycle, and can be divided into the following Classes: Data Collection, Metadata Representation, Data Processing, Anonymization, Data Publication, Data Usage and Post Data Usage. We observed that a large number of publications used 'Access Control' and 'Usage and License Terms' methods, while others such as 'Embargo on Data Release' and the use of 'Synthetic Data' were used in fewer instances. In conclusion, we are presenting the first extensive literature review on the methods applied to confidential data in the context of FAIR, providing a comprehensive conceptual framework for future research on restricted access data.Entities:
Keywords: Confidential data; FAIR guiding principles; FAIR implementation; Linked data; Restricted access data
Year: 2022 PMID: 36091999 PMCID: PMC9454861 DOI: 10.7717/peerj-cs.1038
Source DB: PubMed Journal: PeerJ Comput Sci ISSN: 2376-5992
The table shows the grouping of the technology readiness levels (TRLs) based on the work by (European Commission, 2017), and it provides a definition on how each TRL group was assigned to theincluded publications.
| Technology readiness levels based on ( | Definition |
|---|---|
| TRL 1 & 2 | This class was assigned to publications where the technology proposed was only conceptually formulated but not implemented. |
| TRL 3 & 4 | This class, instead, was assigned to publications where the technology proposed has gone through some testing but only in limited environments. |
| TRL 4 & 5 | This class was assigned to publications that clearly showed testing and expected performance. |
| TRL 7, 8 & 9 | Lastly, this class was assigned to publications that showed full technical capabilities and that were also available to users. |
Figure 1PRISMA flow diagram illustrating the results from the search and selection process, performed on the Google Scholar database.
List with short authors, year and title references of the final 40 publications included in the systematic review.
| Authors | Year | Title |
|---|---|---|
| Dyke et al. | 2016 | Consent codes: upholding standard data use conditions |
| Lakerveld et al. | 2017 | Identifying and sharing data for secondary data analysis of physical activity, sedentary behaviour and their determinants across the life course in Europe: general principles and an example from DEDIPAC |
| Bertocco et al. | 2018 | Cloud access to interoperable IVOA-compliant VOSpace storage |
| Kleemola et al. | 2019 | A FAIR guide for data providers to maximise sharing of human genomic data |
| Sun et al. | 2019 | A Privacy-Preserving Infrastructure for Analyzing Personal Health Data in a Vertically Partitioned Scenario. |
| Rockhold et al. | 2019 | Open science: The open clinical trials data journey |
| Demotes-Mainard et al. | 2019 | How the new European data protection regulation affects clinical research and recommendations? |
| Van Atteveldt et al. | 2019 | Computational communication science| toward open computational communication science: A practical road map for reusable data and code |
| Dimper et al. | 2019 | ESRF Data Policy, Storage, and Services |
| Lahti et al. | 2019 | ’As Open as Possible, as Closed as Necessary’-Managing legal and owner-defined restrictions to openness of biodiversity data. |
| Becker et al. | 2019 | DAISY: A Data Information System for accountability under the General Data Protection Regulation |
| Kephalopoulos et al. | 2020 | Indoor air monitoring: sharing and accessing data |
| Hoffmann et al. | 2020 | Guiding principles for the use of knowledge bases and real-world data in clinical decision support systems: report by an international expert workshop at Karolinska Institutet |
| Cullinan et al. | 2020 | Unlocking the potential of patient data through responsible sharing–has anyone seen my keys? |
| Nicholson et al. | 2020 | Interoperability of population-based patient registries |
| Paprica et al. | 2020 | Essential requirements for establishing and operating data trusts: practical guidance co-developed by representatives from fifteen Canadian organizations and initiatives |
| Jaddoe et al. | 2020 | The LifeCycle Project-EU Child Cohort Network: a federated analysis infrastructure and harmonized data of more than 250,000 children and parents |
| Bader et al. | 2020 | The International Data Spaces Information Model–An Ontology for Sovereign Exchange of Digital Content |
| Aarestrup et al. | 2020 | Towards a European health research and innovation cloud (HRIC) |
| Suver et al. | 2020 | Bringing Code to Data: Do Not Forget Governance |
| Roche et al. | 2020 | Open government data and environmental science: a federal Canadian perspective |
| Beyan et al. | 2020 | Distributed analytics on sensitive medical data: The Personal Health Train |
| Choudhury et al. | 2020 | Personal health train on FHIR: A privacy preserving federated approach for analyzing FAIR data in healthcare |
| Arefolov et al. | 2021 | Implementation of The FAIR Data Principles for Exploratory Biomarker Data from Clinical Trials |
| Ofili et al. | 2021 | The Research Centers in Minority Institutions (RCMI) Consortium: A Blueprint for Inclusive Excellence |
| Haendel et al. | 2021 | The National COVID Cohort Collaborative (N3C): rationale, design, infrastructure, and deployment |
| Kumar et al. | 2021 | Federated Learning Systems for Healthcare: Perspective and Recent Progress |
| Abuja et al. | 2021 | Public–Private Partnership in Biobanking: The Model of the BBMRI-ERIC Expert Centre |
| Schulman et al. | 2021 | The Finnish Biodiversity Information Facility as a best-practice model for biodiversity data infrastructures |
| Cooper et al. | 2021 | Perspective: The Power (Dynamics) of Open Data in Citizen Science |
| Øvrelid et al. | 2021 | TSD: A Research Platform for Sensitive Data |
| Hanisch et al. | 2021 | Research Data Framework (RDaF): Motivation, Development, and A Preliminary Framework Core |
| Read et al. | 2021 | Embracing the value of research data: introducing the JCHLA/JABSC Data Sharing Policy |
| Hanke et al. | 2021 | In defense of decentralized research data management |
| Zegers et al. | 2021 | Mind Your Data: Privacy and Legal Matters in eHealth |
| Groenen et al. | 2021 | The |
| Delgado Mercè et al. | 2021 | Approaches to the integration of TRUST and FAIR principles |
| Jeliazkova et al. | 2021 | Towards FAIR nanosafety data |
| Demchenko et al. | 2021 | Future Scientific Data Infrastructure: Towards Platform Research Infrastructure as a Service (PRIaaS) |
Figure 2Visual representations of the methods classes found during data analysis.
Below each method a short description can be found. Note the ‘is subclass of’ relations.
Visual representation of the frequency of each method found in the included publications.
|
|
Note:
If a publication presented the method assigned to the column, then it would show an ‘X’ coloured cell. The colour of the cell is in correspondence to the Technology Readiness Level (TRL) assigned to the given publication: from lightest (light blue - TRL 1 & 2) to darkest (dark blue - TRL 7, 8 & 9). At the bottom of the figure, there is a row showing the number of articles each method has been found in, also colour graded from dark (many instances) to light grey (few instances). Overall, this table shows that there are wide variations in the frequency of the methods, but also that the vast majority of methods present TRL scores of five and above. We can also see that the coverage of the methods is rather broad and they are approximately evenly distributed among each class.