| Literature DB >> 35468846 |
Núria Queralt-Rosinach1, Rajaram Kaliyaperumal1, César H Bernabé1, Qinqin Long1, Simone A Joosten2, Henk Jan van der Wijk3, Erik L A Flikkenschild4, Kees Burger1, Annika Jacobsen1, Barend Mons1,5,6, Marco Roos7.
Abstract
BACKGROUND: The COVID-19 pandemic has challenged healthcare systems and research worldwide. Data is collected all over the world and needs to be integrated and made available to other researchers quickly. However, the various heterogeneous information systems that are used in hospitals can result in fragmentation of health data over multiple data 'silos' that are not interoperable for analysis. Consequently, clinical observations in hospitalised patients are not prepared to be reused efficiently and timely. There is a need to adapt the research data management in hospitals to make COVID-19 observational patient data machine actionable, i.e. more Findable, Accessible, Interoperable and Reusable (FAIR) for humans and machines. We therefore applied the FAIR principles in the hospital to make patient data more FAIR.Entities:
Keywords: FAIR; Hospital; Ontologies; Open science; Patient data; Research data management
Mesh:
Year: 2022 PMID: 35468846 PMCID: PMC9036506 DOI: 10.1186/s13326-022-00263-7
Source DB: PubMed Journal: J Biomed Semantics
Fig. 1Illustration of the central concepts of the envisioned FAIR based architecture: the central star represents the data linking model for interoperability that the sources refer to (data and metadata), the small stars next to each source represent what is used of the central model to describe the source (thereby becoming ‘self-describing’), the arrows represent workflows or scripts: for the source systems to map or convert source data and metadata to the central data linking model, for retrieving data from across the sources through the central data linking model, and for analysis. FAIR Data Points provide access to the ‘ontologised’ metadata and data (not shown)
FAIR assessment of existing systems containing cytokine data
| Existing system | FAIR assessment |
|---|---|
| Original cytokine dataset (Excel) | Structured, but custom-built, thus not in a uniform, globally machine readable way. |
| Dataset in Castor EDC | Structured in a uniform way, but no standards were applied to create a FAIR dataset. |
| Dataset in Opal | Structured, findable through the central LUMC warehouse and accessible through an API, but no global machine readable standards were applied to represent the data and metadata for machine processing. |
Fig. 2Ontological data model for the cytokine measurements patient dataset
Fig. 3Semantic module to represent disease severity score phenotypes calculated in the hospital
Fig. 4Ontological metadata model instantiated as an RDF graph. The four lower edges are the four additional metadata elements for COVID-19 data resource description
Fig. 5Integration of our ontological approach with existing systems
Example queries using external LOD resources
| Question | Result |
|---|---|
| Count number of patients | LUMCquery |
| Retrieve measured cytokines in the LUMC with protein annotation from the UniProt knowledgebase | Federatedquery |
Fig. 6Federated SPARQL query crossing FAIR patient data with the UniProt knowledgebase
Fig. 7BEAT-COVID FAIRification workflow to make the data management and infrastructure in the hospital more FAIR. Collaborators and results are described in every step where applicable