| Literature DB >> 29435448 |
Stefanie Broes1,2, Denis Lacombe1, Michiel Verlinden1,2, Isabelle Huys2.
Abstract
The recent revolution in science and technology applied to medical research has left in its wake a trial of biomedical data and human samples; however, its opportunities remain largely unfulfilled due to a number of legal, ethical, financial, strategic, and technical barriers. Precision oncology has been at the vanguard to leverage this potential of "Big data" and samples into meaningful solutions for patients, considering the need for new drug development approaches in this area (due to high costs, late-stage failures, and the molecular diversity of cancer). To harness the potential of the vast quantities of data and samples currently fragmented across databases and biobanks, it is critical to engage all stakeholders and share data and samples across research institutes. Here, we identified two general types of sharing strategies. First, open access models, characterized by the absence of any review panel or decision maker, and second controlled access model where some form of control is exercised by either the donor (i.e., patient), the data provider (i.e., initial organization), or an independent party. Further, we theoretically describe and provide examples of nine different strategies focused on greater sharing of patient data and material. These models provide varying levels of control, access to various data and/or samples, and different types of relationship between the donor, data provider, and data requester. We propose a tiered model to share clinical data and samples that takes into account privacy issues and respects sponsors' legitimate interests. Its implementation would contribute to maximize the value of existing datasets, enabling unraveling the complexity of tumor biology, identify novel biomarkers, and re-direct treatment strategies better, ultimately to help patients with cancer.Entities:
Keywords: biobanking; clinical research; data sharing; oncology; precision medicine
Year: 2018 PMID: 29435448 PMCID: PMC5797296 DOI: 10.3389/fmed.2018.00006
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Examples of different data sharing models with respective benefits and drawbacks.
| Model | Advantage | Disadvantage | Reference |
|---|---|---|---|
| Open access | No selective access, enables research without barriers; data sharing at relatively low costs and little administrative burden | No benefit-risk balancing; magnified risks in terms of misuse of data (no assurance that sound scientific methods are used); requires tools and resources for freely downloadable large, heterogeneous and complex datasets; no direct contact between data provider and requester impeding to provide information on the dataset; less suitable for datasets with high privacy risks | ( |
| Provider | Pre-specified set of criteria should ensure a transparent system; possibility to appeal to an independent board | Lack of full transparency or assurance of impartiality; difficult to identify data holders | ( |
| Catalog | Clear overview of types of data held by different study teams; allows data generators to maintain autonomy | Datasets obtained on different consent forms complicated reuse | ( |
| Partnership | Conduct of research in accordance with requirements of both parties; benefit-sharing strategies | Complex negotiations; increased timelines before project start | ( |
| Gatekeeper | Data provider cannot veto a request; transparent procedure; full assessment of scientific request and requester; apply benefit-risk balance test data sharing and share minimum data necessary for the request; communication portal between data provider and data requester | Costly (infrastructure, administration, maintenance; curation costs; human resources; opportunity costs); potentially time-consuming procedure | ( |
| Database query | No direct data sharing, thus can be applied for (personal or commercially) sensitive data; analyses are conducted by original study team who are most familiar with the nuances of the dataset; not limited by particular formats | Little control and transparency on executed queries; resource-intensive for data holders; potentially considerable wait times for requesters. | ( |
| Donor controlled | Patient engagement and empowerment; effective reuse of data with explicit consent of the donor | Additional burden (increased resources for health literacy; infrastructures to manage patient preferences…) | ( |
Figure 1Chain of stakeholders involved in the process of sharing of clinical patient data and samples.
Figure 2Schematic overviews of nine different data sharing models identified. PDS, project data sphere, PGP, Personal Genome Project, EGA European Genome-phenome Archive, YODA Yale University Open Data Access, CSDR.com ClinicalStudyDataRequest.com, SOAR, Supporting Open Access Research, ARCAD Aide et Recherche en Cancérologie Diggestive, HDC health data cooperatives. Some of these models are also applicable to share samples.
Categorization of data and material according to (A) level of identifiability or encryption to safeguard the protection of an individuals’ identity, and (B) nature of the data and material.
| Category | Explanation | |
|---|---|---|
| A | Identifiable data | Data that can be attributed to a specific data subject without the use of additional information |
| Coded/pseudonymized data | Data processed in such a manner that the personal data can no longer be attributed to a specific data subject without the use of additional information | |
| De-identified/anonymized data | Data that cannot be attributed to a specific data subject | |
| B | Material | Blood, saliva, tumor tissue… |
| Primary patient data | The raw data underlying the results that enable reproducing the research | |
| Inferred, derived patient data | Data created by an (intellectual or financial) investment on the part of the primary research team | |
| Report of results | Summary of research data | |
Non-exhaustive overview of prospective, collaborative -omics screening platforms to facilitate clinical research in precision oncology.
| Platform | Organization(s) | Location | Omics analysis | Tumor | Reference |
|---|---|---|---|---|---|
| AURORA | BIG | Belgium | NGS for a panel of 411 cancer-related genes | Breast | ( |
| Exactis | PMT | Canada | No information publicly available | Breast, lung, colorectal, ovarian, melanoma, prostate | ( |
| ORIEN | Moffitt Cancer Center, The Ohio State University Comprehensive Cancer Center, Arthur G. James Cancer Hospital, Richard J. Solove Research Institute in Columbus | US | No information publicly available | All malignancies | ( |
| NCI-MATCH | NCI | US | NGS | Solid tumors | ( |
| PMT initiative | Exactis Innovation | Canada | -omics platforms | Colorectal, lung, melanoma, breast | ( |
| SPECTA | EORTC | Europe | -omics platforms | Colorectal, lung, brain, melanoma, rare, prostate | ( |
| Stratified Medicine Platform 2 | Cancer Research UK | The UK | No information publicly available | NCSLC | ( |
| The CPCT | Nederlands Kanker Instituut-Antoni van Leeuwenhoek Ziekenhuis, Erasmus MC Kanker Instituut, UMC Utrecht | The Netherlands | HiSeq Xten Illumina (WGS) ( | All malignancies | ( |
| U-can | Uppsala University | Sweden | WGS, SNP analyses, RNA Seq | Colorectal, leukemia, lymphoma, prostate, brain, gynecological, neuroendocrine, breast | ( |
CGH, comparative genomic hybridization; EORTC, European Organisation for Research and Treatment of Cancer; NCI-MATCH, National Cancer Institute-Molecular Analysis for Therapy Choice; NGS, next generation sequencing; NCSLC, non-small cell lung cancer; ORIEN, oncology research information exchange network; PMT, personalize my treatment; RNA, ribonucleic acid, SNP, single nucleotide polymorphism, SPECTA, screening patients for efficient clinical trial access; The CPCT, the center for personalized cancer treatment; WGS, whole genome sequencing.
Figure 3Proposed tiered-layered model to share and access clinical data and samples.