| Literature DB >> 21228907 |
Naveen Ashish1, José Luis Ambite, Maria Muslea, Jessica A Turner.
Abstract
We describe an application of the BIRN mediator to the integration of neuroscience experimental data sources. The BIRN mediator is a general purpose solution to the problem of providing integrated, semantically-consistent access to biomedical data from multiple, distributed, heterogeneous data sources. The system follows the mediation approach, where the data remains at the sources, providers maintain control of the data, and the integration system retrieves data from the sources in real-time in response to client queries. Our aim with this paper is to illustrate how domain-specific data integration applications can be developed quickly and in a principled way by using our general mediation technology. We describe in detail the integration of two leading, but radically different, experimental neuroscience sources, namely, the human imaging database, a relational database, and the eXtensible neuroimaging archive toolkit, an XML web services system. We discuss the steps, sources of complexity, effort, and time required to build such applications, as well as outline directions of ongoing and future research on biomedical data integration.Entities:
Keywords: data integration; heterogeneous sources; neuroinformatics
Year: 2010 PMID: 21228907 PMCID: PMC3017358 DOI: 10.3389/fninf.2010.00118
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 4.081
Figure 1Mediator architecture.
Figure 2Developing a data integration application.
Figure 4(Model) Heterogeneity Across HID and XNAT.
Source and domain model, and integration rules.
Not all details shown, for readability.
Mediator queries in FBIRN.
| Information need | Mediator query | Variants | Correct answers (Size) | (Avg) response time (s) |
|---|---|---|---|---|
| Find all female subjects between the ages of 40 and 50. | Q(source, subjectid):- g_hasAge(source, subjectid, A) ^ g_hasGender(source, subjectid, G) and (40 < A < 50) ^ (G = “F”) | Vary age parameters, and constraints. on other aspects such as handedness, race etc | Yes | 2.3 |
| Find all subjects with indications of Alzheimer's | Q(source, subjectid, CDR, MMSE):- g_Assessment(source, subjectid, “CDR,” C) ^ g_Assessment(source, subjectid, “MMSE,” M) ^ C > 3 and M > 10 | Vary (subject) conditions being searched for. | Yes | 1.9 |
| Find all fMRI scans taken with a 3T scanner | Q(source, subjectid, scan):- g_ExperimentAcquisition() ^ (ST = fMRI) ? (scanner = 3T) | Vary types of scans and/or scanners | Yes | 12.1 |
Figure 3FBIRN query interface and results.
FBIRN application development effort.
| Task | Personnel | Time (person months) |
|---|---|---|
| Requirements understanding | Model developer, Domain expert, Data administrators | 2 |
| Data source understanding | Model developer | 3 |
| Developing domain model | Model developer | 5 |
| Wrapper development | Programmer | 0.75 |
| Query evaluation | All | 2 |
| GUI development | Programmer | 1 |