| Literature DB >> 28912709 |
Santiago Timón1,2,3, Mariano Rincón1, Rafael Martínez-Tomás1.
Abstract
Informatics increases the yield from neuroscience due to improved data. Data sharing and accessibility enable joint efforts between different research groups, as well as replication studies, pivotal for progress in the field. Research data archiving solutions are evolving rapidly to address these necessities, however, distributed data integration is still difficult because of the need of explicit agreements for disparate data models. To address these problems, ontologies are widely used in biomedical research to obtain common vocabularies and logical descriptions, but its application may suffer from scalability issues, domain bias, and loss of low-level data access. With the aim of improving the application of semantic models in biobanking systems, an incremental semantic framework that takes advantage of the latest advances in biomedical ontologies and the XNAT platform is designed and implemented. We follow a layered architecture that allows the alignment of multi-domain biomedical ontologies to manage data at different levels of abstraction. To illustrate this approach, the development is integrated in the JPND (EU Joint Program for Neurodegenerative Disease) APGeM project, focused on finding early biomarkers for Alzheimer's and other dementia related diseases.Entities:
Keywords: Neurodegenerative Diseases; Semantic Web; XNAT; biomedical ontologies; data analysis; data exchange; knowledge management
Year: 2017 PMID: 28912709 PMCID: PMC5583223 DOI: 10.3389/fninf.2017.00057
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Figure 1The three component layers of the framework, ordered by abstraction.
Figure 2Schema to formal level transformation from a stripped down experiment for exemplification. XNAT XML data is transformed to an RDF graph using SIO classes and properties. Instances are represented as white rectangles and classes as rounded orange rectangles.
Relation of the component parts of the CRF with subsections and the ontologies with which are modeled.
| Subject demographics | PATO | |
| Medical history | Social information | ADO |
| Cognitive screening | MMSE | ADO |
| Physical examination | General somatic examination | ADO |
| Diagnosis | Staging | ADO |
| Biochemistry | Blood tests | ADO |
| Genetics | ADO | |
| Imaging reports | NIDM |
Figure 3A diagram showing the relations building the assertion that a subject is staged with Mild Cognitive Impairment. The formal level improves the semantics of experiment data, but is still attached to raw values. The domain level introduces specific concepts for a given domain, in this case diagnosis in Alzheimer's Disease. Instances are represented as rounded white rectangles and classes as rounded colored rectangles.
Figure 4Activity diagram of the ETL pipeline. When any change in the data is registered by XNAT's middleware, the pipeline engine executes the xnat2rdf script passing the XML of the changed resource. This script transforms XNAT XML to RDF, which is processed by the reasoner to execute DL and SPIN inferencing and the resulting triples loaded into the triplestore. Finally QC related data is processed for reporting.
Description of stage categories and simplified criteria definition with Description Logics.
| Normal Control (NC) | The subject's MMSE score is over 28, all T-Scores are equal or greater than 35 and does not report subjective cognitive decline | |
| Subjective Cognitive Decline (SCD) | The subject's MMSE score is over 28, all T-Scores are equal or greater than 35 and reports subjective cognitive decline | |
| Mild Cognitive Impairment (MCI) | The subject's MMSE score is between 23 and 28, having at least one T-Score under 35 | |
| Dementia | The subject's MMSE score is under 23 and has at least one T-Score under 35 |