| Literature DB >> 35289369 |
Craig Barnes1, Binam Bajracharya1, Matthew Cannalte1, Zakir Gowani1, Will Haley1, Taha Kass-Hout2, Kyle Hernandez1, Michael Ingram1, Hara Prasad Juvvala1, Gina Kuffel1, Plamen Martinov3, J Montgomery Maxwell1, John McCann1, Ankit Malhotra2, Noah Metoki-Shlubsky1, Chris Meyer1, Andre Paredes1, Jawad Qureshi1, Xenia Ritter1, Philip Schumm4, Mingfei Shao1, Urvi Sheth3, Trevar Simmons1, Alexander VanTol1, Zhenyu Zhang1, Robert L Grossman1,5.
Abstract
OBJECTIVE: The objective was to develop and operate a cloud-based federated system for managing, analyzing, and sharing patient data for research purposes, while allowing each resource sharing patient data to operate their component based upon their own governance rules. The federated system is called the Biomedical Research Hub (BRH).Entities:
Keywords: clinical research data warehouse; data commons; data ecosystem; patient data repository; virtual data warehouse
Mesh:
Year: 2022 PMID: 35289369 PMCID: PMC8922179 DOI: 10.1093/jamia/ocab247
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.A high-level architectural overview of the cloud-based software services, data resources, and applications/workspaces in the Biomedical Research Hub.
Selected data resources available in the Biomedical Research Hub
| Data resource | Organization | Number of research participants | Size (TB) | Number of data elements |
|---|---|---|---|---|
| Genomic Data Commons | NIH/NCI | 83 000 | 3710 | 622 |
| Inflammatory Bowel Disease Genetics Consortium Data Commons | NIH/NIDDK | 107 418 | 4.6 | 762 |
| Kids First Data Resource Center | NIH Common Fund | 18 085 | 6010 | 622 |
| Medical Imaging and Data Resource Center | NIH/NIBIB | 13 439 | 0.5 | 510 |
Figure 2.The Biomedical Research Hub Discovery Portal (https://brh.data-commons.org).
Some of the similarities and differences between the Biomedical Research Hub and other distributed systems for managing research participant data
| Centralized or distributed data | Number of system operators | Data harmonization level | Data access level | |
|---|---|---|---|---|
| Biomedical Research Hub (BRH) | Data distributed across multiple systems (one per organization) | Multiple system operators, with systems managed by separate organizations | Multiple data models, with data elements linking to Common Data Elements and controlled vocabularies | Dataset level |
| Several commercial systems | Data distributed across multiple systems (one per organization) | One system operator | One data model, with local adapters | Research participant level |
| HMO Research Network (HMORN) Virtual Data Warehouse | Data distributed across multiple systems (one per organization) | Multiple system operators with shared governance model | Single data model | Research participant level |
| NCI Genomic Data Commons (GDC) | Centralized data in one system | One system operator | Harmonized data with single data model | Research participant level |
| NCI Cancer Research Data Commons (CRDC) | Data is distributed across multiple systems (one per data type) | Multiple system operators managed by a single organization | Multiple data models (one per system), with harmonized data model (across systems) | At dataset and research participant level |