| Literature DB >> 32031623 |
George Alter1, Alejandra Gonzalez-Beltran2, Lucila Ohno-Machado3, Philippe Rocca-Serra4.
Abstract
BACKGROUND: Data reuse is often controlled to protect the privacy of subjects and patients. Data discovery tools need ways to inform researchers about restrictions on data access and re-use.Entities:
Keywords: confidential data; data access; data discovery; data use; metadata
Year: 2020 PMID: 32031623 PMCID: PMC7006671 DOI: 10.1093/gigascience/giz165
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Descriptors for Authorization
| Authorization type | Description |
|---|---|
| None | Not covered by a DUA |
| “Click through” online license | Users must agree to an online agreement without providing additional identification |
| Registration | Users must register before access is allowed and agree to conditions of use. Registration information may be verified |
| DUA signed by an individual | An agreement signed by the investigator is required. DUAs may require additional information, such as a research plan and an IRB review (see discussion of licenses below) |
| DUA signed by an institution | An agreement signed by the investigator's institution is required. DUAs require additional information, such as a research plan and an IRB review (see discussion of licenses below) |
DUA: data use agreement; IRB: institutional review board.
Descriptors for Authorization
| Authentication type | Description |
|---|---|
| None | No authentication required |
| Simple login | Single-factor login or the use of an authentication key or registered IP address is required |
| Multi-factor login | Multiple-factor login using a combination of IP address, password protection, authentication key, or other forms of authentication |
Descriptors for Access
| Access method | Description |
|---|---|
| Download | The data are available for download. A license may be required |
| API | Interaction with the data may be automated via defined communication protocols, i.e., APIs |
| Remote access | Users may access the data in a secure remote environment (“virtual data enclave”). Individual-level data may not be downloaded, only approved results |
| Remote service | A user may submit program code or the script for a software package to be executed in a secure data center. The remote site returns outputs. It may perform a review before releasing the results |
| Enclave | Access is provided to approved users within a secure facility without remote access. Results may remain at the enclave or be released after review |
Figure 1:The network of agreements from data collection to data sharing. Solid lines connect parties in legal documents; dashed lines show agreements that are implicated in later documents. Documents are shown in white. Colors show roles and organizations.
Figure 2:Graphical representation of relevant constructs allowing consent, license, and terms of use information to be made available as information payload in DATS messages. The new "ConsentInformation" schema allows for annotation (semantic markup) with resources such as the Data Use Ontology (DUO; produced by the Global Alliance for Genomic Health) or the Information Consent Ontology (ICO).
Examples of dbGAP Consent Groups
| Consent group | Consent information |
|---|---|
| Health/Medical/Biomedical (IRB) | Use of this data is limited to health/medical/biomedical purposes, does not include the study of population origins or ancestry. Requestor must provide documentation of local IRB approval. Use of the MGH AF Study data deposited in dbGaP is restricted to research on associations between phenotypes and genotypes. MGH AF Study data may not be used to investigate individual subject genotypes, individual pedigree structures, perceptions of racial/ethnic identity, non-maternity/paternity, and of variables that could be considered as stigmatizing an individual or group. All research must be related to the etiology and prevention of morbidity and mortality of the U.S. population consistent with the demographic distribution in the MGH AF Study. Data users will be required to obtain IRB approval for their projects from their respective institutions (please note that only full or expedited approvals will be accepted). |
| Disease-Specific (Atrial Fibrillation, IRB, RD) | Use of the data must be related to Atrial Fibrillation and related disorders. Requestor must provide documentation of local IRB approval. Data use is limited to research related to atrial fibrillation and cardiovascular disease. |
Source: [61]. MGH AF: Massachusetts General Hospital Atrial Fibrillation.