| Literature DB >> 34863191 |
Rebecca Asiimwe1,2, Stephanie Lam3,4, Samuel Leung1,5,4, Shanzhao Wang5,4, Rachel Wan5,6, Anna Tinker7,6,4, Jessica N McAlpine3,4, Michelle M M Woo3,4, David G Huntsman1,5,3,4, Aline Talhouk8,9.
Abstract
BACKGROUND: To drive translational medicine, modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. Currently, vast amounts of research and clinical data remain in silos, held and managed by individual researchers, operating under different standards and governance structures; a framework that impedes sharing and effective use of data. In this article, we describe the journey of British Columbia's Gynecological Cancer Research Program (OVCARE) in moving a traditional tumour biobank, outcomes unit, and a collection of data silos, into an integrated data commons to support data standardization and resource sharing under collaborative governance, as a means of providing the gynecologic cancer research community in British Columbia access to tissue samples and associated clinical and molecular data from thousands of patients.Entities:
Keywords: Biobank-technologies; Biobanks; Biospecimens; Data commons; Data governance; Federated systems; Laboratory Information Management Systems (LIMS); Precision medicine
Mesh:
Year: 2021 PMID: 34863191 PMCID: PMC8645144 DOI: 10.1186/s12967-021-03147-z
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Summary of fundamental research and infrastructural needs of OVCARE’s research community
| 1 | Generate efficiencies in data collection, storage and analysis to maximize utility of collected data |
| 2 | Limit errors in data handling and ensure reproducibility of research findings |
| 3 | Protect patients’ privacy and honor their consent |
| 4 | Optimize secondary and continuous use of data generated from research and clinical care |
| 5 | Facilitate the recruitment of patients in various clinical studies |
| 6 | Identify specimen from patients with specific clinical, molecular and genomic characteristics |
| 7 | Integration of medical and clinical data with molecular information to enable the discovery and testing of new associations and hypotheses towards translational research |
| 8 | Organize data towards a learning healthcare system where translation is bi-directional: Evidence-based research is used to inform practice, and the data generated during clinical care is in turn used to inform guidelines, generate hypotheses and trigger pragmatic trials |
| 1 | Allow batch data imports and exports |
| 2 | Facilitate validation of data entered to minimize errors (e.g. returning an error message when text is entered instead of a numeric value) |
| 3 | Easy-to-use and customizable user interfaces |
| 4 | Support both prospective and retrospective data collection mechanisms |
| 5 | Adapt to changing needs between studies and projects, as well as over time |
| 6 | Track biospecimen locations, usage and shipment to both local and offsite storage locations |
| 7 | Support multi-tenancy for the banking of biospecimens from distributed and diverse studies lead by different investigators interested in sharing resources |
| 8 | Adherence to best practices in privacy and security, such as, support for data encryption, audit trails on all user actions and data changes for regulatory compliance, configurable user privileges, role-based access control and adherence to federal regulations with respect to deidentification of specimen and tracking of consent |
| 9 | Support interoperability and integration with other institutions, systems, and data sources to facilitate data sharing |
| 10 | Potential to scale-up biospecimen and user capacity at no added cost |
| 11 | Stable and mature vendor and community support |
Fig. 1Needs-to-biobank mapping and the number of requirements fulfilled by each LIMS. a Tiled plot of the mapping of each biospecimen research need to the biobank solution meeting that need. Surveyed biobanks are plotted on the y-axis and research needs (desired biobank features) are plotted on the x-axis, grouped and colored by feature class. b Barplot on the overall number of features provided by a specific LIMS. The LIMS solutions are plotted on the y-axis and the number of features provided are plotted on the x-axis
Fig. 2OVCARE’s data commons infrastructure and software stack. The overall data commons infrastructure comprises of five main components: (1) A clinical database (REDCap) that consolidates and manages clinical data collections from the BC Cancer Registry and the Cheryl Brown Gynecological Cancers Outcomes Unit, (2) a Library Information Management System (OpenSpecimen) that stores and manages biospecimens collected from consented participants at different hospital sites (i.e. Vancouver General Hospital, the University of British Columbia Hospital, BC Cancer Vancouver, and now a few more centers in BC, (3) the cBioPortal that supports the exploration, analysis and visualization of clinical attributes and molecular profiles from patient tumor samples, (4) the OVCARE Resource Portal (ORP) that governs data and resource sharing based on stipulated protocols, standard operating procedures and research ethics, and (5) the Research Community (this includes the OVCARE internal research and informatics team, and the broader research community that OVCARE serves). Each of the components (REDCap, OpenSpecimen, cBioPortal, ORP) identified to meet our research needs are separately hosted in our hospital’s computing environment and programmatically interlinked through API calls. The data from the different domains are interlinked using system-wide unique identifiers that link patients to their biospecimen collections and molecular/genomics data. To access the amassed clinical and biospecimen collections, authenticated researchers in the OVCARE research community send data and sample acquisition requests to the ORP through which those requests are met by informatics staff, if all stipulated requirements including ethics approval are met. Upon successful data and sample acquisition, researchers conduct their respective studies, and the data generated (raw or processed, and/ biospecimen derivatives) from their research are retuned to OVCARE making it available for re-purposing/secondary use. Furthermore, molecular data returned to the data commons are linked back to the available and stored patient biospecimens. Together with clinical outcomes, these molecular profiles are further explored, analyzed and visualized using the cBioPortal
Fig. 3Implementation timeline of OVCARE’s data commons
Fig. 4Clinical and outcome data on all gynecological cancer patients diagnosed in British Columbia. In the tiled plot, data elements (demographic, medical history, pathology, chemotherapy, radiation, surgery and quality of life data) were plotted on the y-axis against gynecological cancer patients (patient 1 to n) on the x-axis. Darker tiles indicate availability of data on a patient per data element. Clinical studies (study 1 to n) are interested in certain patients with available data on specific data elements. Subsets of patients overlap between clinical studies