| Literature DB >> 27245222 |
Martin Dugas1,2, Alexandra Meidt3, Philipp Neuhaus3, Michael Storck3, Julian Varghese3.
Abstract
BACKGROUND: The volume and complexity of patient data - especially in personalised medicine - is steadily increasing, both regarding clinical data and genomic profiles: Typically more than 1,000 items (e.g., laboratory values, vital signs, diagnostic tests etc.) are collected per patient in clinical trials. In oncology hundreds of mutations can potentially be detected for each patient by genomic profiling. Therefore data integration from multiple sources constitutes a key challenge for medical research and healthcare.Entities:
Keywords: Data integration; ODM; Personalised medicine; Semantic annotation
Mesh:
Year: 2016 PMID: 27245222 PMCID: PMC4888420 DOI: 10.1186/s12874-016-0164-9
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Data flow for clinical studies. Patient data is collected both at doctor’s offices and in hospitals and needs to be transferred to a dedicated study database (DB) for each study. Each study has a unique set of participating doctors and hospitals, therefore many different types of computer systems need to be connected
Fig. 2Workflow of ODMedit. To achieve uniform semantic annotations, codes from a metadata repository are re-used and the repository is updated continuously during the annotation process
Fig. 3a: Summary of data item “height” within item group “Vital Sign”. b: Items with similar names like “Height” in the metadata repository. c: Form 5518 is presented, which is referenced by b. “Height” refers to “Body Height” in this example
Fig. 4CDASH form “Vital Signs” with semantic annotations (UMLS codes) for all patient data items. Codelists, for example regarding Blood Pressure (BP) location, were also semantically annotated. Column one corresponds to UMLS terms, column two (similar, but not identical) to text labels on this form
Semantic annotations of five randomly selected data elements in the MDM portal. For "Body weight" uniform annotation with C0005910 was achieved, for other data elements domain experts selected 2-3 coding variants
| Data element | #UMLS codes | #Matching UMLS codes | #Occurences in MDM portal | Semantic annotation in MDM portal |
|---|---|---|---|---|
| Body weight | 276 | 23 | 86 | 86x C0005910 Body weight |
| Date of Birth | 55 | 9 | 85 | 55x C0421451 Patient date of birth |
| Creatinine in Serum | 182 | 13 | 66 | 44x C0201976 Creatinine measurement, serum |
| Platelets | 249 | 16 | 229 | 213x C0005821 Blood Platelets |
| ALT | 104 | 13 | 37 | 30x C0201836 Alanine aminotransferase measurement |