| Literature DB >> 25973166 |
Hugo Leroux1, Laurent Lefort2.
Abstract
BACKGROUND: There is an increasing recognition of the need for the data capture phase of clinical studies to be improved and for more effective sharing of clinical data. The Health Care and Life Sciences community has embraced semantic technologies to facilitate the integration of health data from electronic health records, clinical studies and pharmaceutical research. This paper explores the integration of clinical study data exchange standards and semantic statistic vocabularies to deliver clinical data as linked data in a format that is easier to enrich with links to complementary data sources and consume by a broad user base.Entities:
Keywords: Longitudinal clinical study; Medication mapping; Ontology; RDF data cube; Semantic enrichment
Year: 2015 PMID: 25973166 PMCID: PMC4429421 DOI: 10.1186/s13326-015-0012-6
Source DB: PubMed Journal: J Biomed Semantics
Figure 1The Australian imaging biomarker and lifestyle study of ageing. Illustrates the logical organisation of the AIBL study. The AIBL study (depicted as a rectangle in light green with thick border) is split into the five domains (depicted as rectangles in light blue). Each domain is further categorised into sub-domains depicted by rounded rectangles.
Number of instances for the LCDC classes organised by theme
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
|
|
|
|
| ||||
| Clinical | 1030430 | 4495 | 25210 | 1416 | 25 | 6452 | 506 |
| Cognitive | 761650 | 4612 | 9826 | 1415 | 19 | 4069 | 367 |
| Imaging | 58601 | 866 | 2136 | 365 | 12 | 941 | 59 |
| Lifestyle | 710594 | 4026 | 11953 | 1415 | 19 | 7360 | 391 |
| Study | 235566 | 5384 | 6218 | 1414 | 13 | 3292 | 155 |
Figure 2Mapping the CDISC ODM model to the data cube and DDI vocabularies. Illustrates how the original CDISC ODM model (depicted by rectangles in light gray) is overlaid with the RDF Data Cube (depicted by ellipses in green) and the DDI-RDF vocabularies (depicted by rounded rectangles in blue). The Data section, depicted on the left of the model, comprises a hierarchical structure whereby each level is fully contained within the preceding level. As the left side is more about structuring the clinical data, the Data section of the CDISC ODM model is more closely related to qb. The Clinical Data node is mapped to qb:Dataset while qb:Slice is used to split the Subject, Study Event and Form data nodes across the ODM hierarchy into slices, and the Item Data node is mapped to qb:Observation. The ODM node refers to the entire data set and is mapped to disco:LogicalDataset. The right side comprises the metadata section, which contains one Study node, which further comprises one MetaData node. The MetaData node contains a number of StudyEventDef, FormDef, ItemGroupDef and ItemDef nodes, one corresponding to each of the Subject, Study Event, Form, Item Group and Item data nodes defined in the Data section. The Metadata section shows how the variable definitions managed through disco matches ODM’s ItemDef while the grouping of variables via disco:Universe is applied at the FormDef level. Finally, Item Data is logically mapped to disco:Variable.
Figure 3Linked clinical data cube architecture aligned with the RDF data cube. Depicts the architecture of the LCDC. The Main cube (depicted as a red cube) is split into modular Specialised cubes (depicted as a blue cube) and linked using the void:subset property. The Main cube is organised into time-series, cross-section and theme slices using the qb:slice property. The slices are then divided into Observations using the qb:observation property. The qb:dataset property is used to link the observations back to the cube. The Specialised cubes are organised similarly to the Main cube with the exception of the theme slices. The dotted lines show how the slices from all cubes interlink to the study observations through the use of ObservationGroups and the qb:observationGroup property. The mainObservation property manages the link between the ObservationGroups and the Observations while the specialisedObservation property handles the link between the ObservationGroups in the main cube and the corresponding Observations in the specialised cubes.
Figure 4Linked Australian medications data set. Depicts the interlinking of the drug terminologies available, mostly, in Australia in order to facilitate their navigation. For the sake of simplicity, all data item variables have been omitted from the Figure. The AMT concepts are depicted in teal. The ATC DDD concepts are depicted in orange. UNII concepts are in light-blue while DrugBank concept is in light green and the ARTG concept is in magenta. The Figure also introduces an xkos:ConceptAssociation predicate (depicted in yellow) to define many-to-many relationships between amt:MedicinalProduct and artg:RegisteredMedicine concepts.
Medications mapping statistics
|
|
|
|
|---|---|---|
| Total | 7942 | 100.00% |
| Medicinal product/trade product/substance | 5536 | 69.71% |
| Trade product | 5518 | 69.48% |
| Medicinal product | 5266 | 66.31% |
| Substance | 5382 | 67.77% |
Participants taking anti-diabetic drugs
|
|
|
|---|---|
| 1 | Insulin glargine |
| 3 | Glimepiride |
| 46 | Metformin |
| 4 | Rosiglitazone |
| 2 | Glipizide |
| 17 | Gliclazide |
| 4 | Pioglitazone |
| 1 | Sitagliptin |
Participants’ classifications and triglycerides level
|
|
|
|
|---|---|---|
| 11 | ConfirmClassification360- | ConfirmClassification360- |
| memoryComplainerHealthyControl | mciPatient | |
| 4 | ConfirmClassification360- | ConfirmClassification360- |
| nonMemoryComplainerHealthy | mciPatient | |
| Control |
Query performances
|
|
|
|---|---|
| 1 | 22 |
| 2 | 36 |
| 3 | 270 |