MOTIVATION: The numerous public data resources make integrative bioinformatics experimentation increasingly important in life sciences research. However, it is severely hampered by the way the data and information are made available. The semantic web approach enhances data exchange and integration by providing standardized formats such as RDF, RDF Schema (RDFS) and OWL, to achieve a formalized computational environment. Our semantic web-enabled data integration (SWEDI) approach aims to formalize biological domains by capturing the knowledge in semantic models using ontologies as controlled vocabularies. The strategy is to build a collection of relatively small but specific knowledge and data models, which together form a 'personal semantic framework'. This can be linked to external large, general knowledge and data models. In this way, the involved scientists are familiar with the concepts and associated relationships in their models and can create semantic queries using their own terms. We studied the applicability of our SWEDI approach in the context of a biological use case by integrating genomics data sets for histone modification and transcription factor binding sites. RESULTS: We constructed four OWL knowledge models, two RDFS data models, transformed and mapped relevant data to the data models, linked the data models to knowledge models using linkage statements, and ran semantic queries. Our biological use case demonstrates the relevance of these kinds of integrative bioinformatics experiments. Our findings show high startup costs for the SWEDI approach, but straightforward extension with similar data.
MOTIVATION: The numerous public data resources make integrative bioinformatics experimentation increasingly important in life sciences research. However, it is severely hampered by the way the data and information are made available. The semantic web approach enhances data exchange and integration by providing standardized formats such as RDF, RDF Schema (RDFS) and OWL, to achieve a formalized computational environment. Our semantic web-enabled data integration (SWEDI) approach aims to formalize biological domains by capturing the knowledge in semantic models using ontologies as controlled vocabularies. The strategy is to build a collection of relatively small but specific knowledge and data models, which together form a 'personal semantic framework'. This can be linked to external large, general knowledge and data models. In this way, the involved scientists are familiar with the concepts and associated relationships in their models and can create semantic queries using their own terms. We studied the applicability of our SWEDI approach in the context of a biological use case by integrating genomics data sets for histone modification and transcription factor binding sites. RESULTS: We constructed four OWL knowledge models, two RDFS data models, transformed and mapped relevant data to the data models, linked the data models to knowledge models using linkage statements, and ran semantic queries. Our biological use case demonstrates the relevance of these kinds of integrative bioinformatics experiments. Our findings show high startup costs for the SWEDI approach, but straightforward extension with similar data.
Authors: Zuotian Tatum; Marco Roos; Andrew P Gibson; Peter Em Taschner; Mark Thompson; Erik A Schultes; Jeroen Fj Laros Journal: J Biomed Semantics Date: 2014-06-03
Authors: Zhihui Wang; Jonathan Sagotsky; Thomas Taylor; Patrick Shironoshita; Thomas S Deisboeck Journal: Wiley Interdiscip Rev Syst Biol Med Date: 2012-11-27
Authors: Young Soo Song; Chan Hee Park; Hee-Joon Chung; Hyunjung Shin; Jihun Kim; Ju Han Kim Journal: BMC Bioinformatics Date: 2011-02-15 Impact factor: 3.169
Authors: Dieter Maier; Wenzel Kalus; Martin Wolff; Susana G Kalko; Josep Roca; Igor Marin de Mas; Nil Turan; Marta Cascante; Francesco Falciani; Miguel Hernandez; Jordi Villà-Freixa; Sascha Losko Journal: BMC Syst Biol Date: 2011-03-05
Authors: Marco Roos; M Scott Marshall; Andrew P Gibson; Martijn Schuemie; Edgar Meij; Sophia Katrenko; Willem Robert van Hage; Konstantinos Krommydas; Pieter W Adriaans Journal: BMC Bioinformatics Date: 2009-10-01 Impact factor: 3.169