| Literature DB >> 36010273 |
Tobias Ueli Blatter1, Harald Witte1, Christos Theodoros Nakas1,2, Alexander Benedikt Leichtle1,3.
Abstract
Laboratory medicine is a digital science. Every large hospital produces a wealth of data each day-from simple numerical results from, e.g., sodium measurements to highly complex output of "-omics" analyses, as well as quality control results and metadata. Processing, connecting, storing, and ordering extensive parts of these individual data requires Big Data techniques. Whereas novel technologies such as artificial intelligence and machine learning have exciting application for the augmentation of laboratory medicine, the Big Data concept remains fundamental for any sophisticated data analysis in large databases. To make laboratory medicine data optimally usable for clinical and research purposes, they need to be FAIR: findable, accessible, interoperable, and reusable. This can be achieved, for example, by automated recording, connection of devices, efficient ETL (Extract, Transform, Load) processes, careful data governance, and modern data security solutions. Enriched with clinical data, laboratory medicine data allow a gain in pathophysiological insights, can improve patient care, or can be used to develop reference intervals for diagnostic purposes. Nevertheless, Big Data in laboratory medicine do not come without challenges: the growing number of analyses and data derived from them is a demanding task to be taken care of. Laboratory medicine experts are and will be needed to drive this development, take an active role in the ongoing digitalization, and provide guidance for their clinical colleagues engaging with the laboratory data in research.Entities:
Keywords: FAIRification; artificial intelligence; clinical chemistry; digitalization; interoperability
Year: 2022 PMID: 36010273 PMCID: PMC9406962 DOI: 10.3390/diagnostics12081923
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Patients’ data is entered into the patient data management system (PDMS), predominantly manually, while information about samples collected as well as about analyses conducted is entered into the laboratory information system (LIS), either manually or automatically. PDMS and LIS are connected and exchange parts of their stored data. Both systems feed a “data lake” comprising various types of data, which can be provided to researchers for Big Data applications.
Recommendations for the FAIRification (FAIR+) of laboratory data.
| Requirements | Implementation |
|---|---|
| Findability |
Assign PIDs meaningfully. Each PID should uniquely identify a single patient, which needs to be consistent between branch laboratories with parallel systems. Develop solutions for unknown emergency patients, which allow correct assignment of test results when personal data is identified later on. Develop solutions for analyses conducted for research purposes. Avoid cumulative PIDs. Record actual sampling time instead of planned sampling time. Connect all analytical devices to the lab IT system to avoid manual entries. Connect the lab IT system to the hospital’s central IT system to enable searches by clinicians and researchers. |
| Accessibility |
Protect lab data adequately with: secure data storage solutions. careful data governance. Design ETL processes efficiently. Consider the general consent status of patients and allow access to data accordingly. Employ modern technical solutions such as multiparty computing and homomorphic encryption for merging data from different sites. |
| Interoperability |
Code analyses in a standardized manner, e.g., with LOINC codes. Additionally, code the device manufacturer and kit version in a standardized way. Code newly developed analyses in a homogenous way, even if no standardized codes are available yet. Enable consolidation of data from different labs. |
| Reusability |
Provide detailed metadata to maximize reproducibility, including: LOINC codes. batch numbers. quality management data. SPREC codes. |
| + |
Offer your laboratory medicine expertise to clinicians and researchers, as no one knows the intricacies of your laboratory data better than you. |
Abbreviations: ETL: extract—transform—load; lab: laboratory; LOINC: Logical Observation Identifiers Names and Code; PID: patient identifier; SPREC: Standard Preanalytical Code. + signifies the additional human resource (laboratory expertise).