Literature DB >> 21756325

S3QL: a distributed domain specific language for controlled semantic integration of life sciences data.

Helena F Deus1, Miriã C Correa, Romesh Stanislaus, Maria Miragaia, Wolfgang Maass, Hermínia de Lencastre, Ronan Fox, Jonas S Almeida.   

Abstract

BACKGROUND: The value and usefulness of data increases when it is explicitly interlinked with related data. This is the core principle of Linked Data. For life sciences researchers, harnessing the power of Linked Data to improve biological discovery is still challenged by a need to keep pace with rapidly evolving domains and requirements for collaboration and control as well as with the reference semantic web ontologies and standards. Knowledge organization systems (KOSs) can provide an abstraction for publishing biological discoveries as Linked Data without complicating transactions with contextual minutia such as provenance and access control.We have previously described the Simple Sloppy Semantic Database (S3DB) as an efficient model for creating knowledge organization systems using Linked Data best practices with explicit distinction between domain and instantiation and support for a permission control mechanism that automatically migrates between the two. In this report we present a domain specific language, the S3DB query language (S3QL), to operate on its underlying core model and facilitate management of Linked Data.
RESULTS: Reflecting the data driven nature of our approach, S3QL has been implemented as an application programming interface for S3DB systems hosting biomedical data, and its syntax was subsequently generalized beyond the S3DB core model. This achievement is illustrated with the assembly of an S3QL query to manage entities from the Simple Knowledge Organization System. The illustrative use cases include gastrointestinal clinical trials, genomic characterization of cancer by The Cancer Genome Atlas (TCGA) and molecular epidemiology of infectious diseases.
CONCLUSIONS: S3QL was found to provide a convenient mechanism to represent context for interoperation between public and private datasets hosted at biomedical research institutions and linked data formalisms.

Entities:  

Mesh:

Year:  2011        PMID: 21756325      PMCID: PMC3155508          DOI: 10.1186/1471-2105-12-285

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  28 in total

1.  The UCSC Genome Browser Database.

Authors:  D Karolchik; R Baertsch; M Diekhans; T S Furey; A Hinrichs; Y T Lu; K M Roskin; M Schwartz; C W Sugnet; D J Thomas; R J Weber; D Haussler; W J Kent
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 2.  State of the nation in data integration for bioinformatics.

Authors:  Carole Goble; Robert Stevens
Journal:  J Biomed Inform       Date:  2008-02-05       Impact factor: 6.317

3.  Computer science. Beyond the data deluge.

Authors:  Gordon Bell; Tony Hey; Alex Szalay
Journal:  Science       Date:  2009-03-06       Impact factor: 47.728

4.  Moby and Moby 2: creatures of the deep (web).

Authors:  Ben P Vandervalk; E Luke McCarthy; Mark D Wilkinson
Journal:  Brief Bioinform       Date:  2009-01-16       Impact factor: 11.622

5.  Bio2RDF: towards a mashup to build bioinformatics knowledge systems.

Authors:  François Belleau; Marc-Alexandre Nolin; Nicole Tourigny; Philippe Rigault; Jean Morissette
Journal:  J Biomed Inform       Date:  2008-03-21       Impact factor: 6.317

Review 6.  Data-driven methods to discover molecular determinants of serious adverse drug events.

Authors:  A P Chiang; A J Butte
Journal:  Clin Pharmacol Ther       Date:  2009-01-28       Impact factor: 6.875

7.  S3DB core: a framework for RDF generation and management in bioinformatics infrastructures.

Authors:  Jonas S Almeida; Helena F Deus; Wolfgang Maass
Journal:  BMC Bioinformatics       Date:  2010-07-20       Impact factor: 3.169

8.  Evolution of MRSA during hospital transmission and intercontinental spread.

Authors:  Simon R Harris; Edward J Feil; Matthew T G Holden; Michael A Quail; Emma K Nickerson; Narisara Chantratita; Susana Gardete; Ana Tavares; Nick Day; Jodi A Lindsay; Jonathan D Edgeworth; Hermínia de Lencastre; Julian Parkhill; Sharon J Peacock; Stephen D Bentley
Journal:  Science       Date:  2010-01-22       Impact factor: 47.728

9.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  A Semantic Web management model for integrative biomedical informatics.

Authors:  Helena F Deus; Romesh Stanislaus; Diogo F Veiga; Carmen Behrens; Ignacio I Wistuba; John D Minna; Harold R Garner; Stephen G Swisher; Jack A Roth; Arlene M Correa; Bradley Broom; Kevin Coombes; Allen Chang; Lynn H Vogel; Jonas S Almeida
Journal:  PLoS One       Date:  2008-08-13       Impact factor: 3.240

View more
  2 in total

1.  Shared data science infrastructure for genomics data.

Authors:  Hamid Bagheri; Usha Muppirala; Rick E Masonbrink; Andrew J Severin; Hridesh Rajan
Journal:  BMC Bioinformatics       Date:  2019-08-22       Impact factor: 3.169

Review 2.  The semantic web in translational medicine: current applications and future directions.

Authors:  Catia M Machado; Dietrich Rebholz-Schuhmann; Ana T Freitas; Francisco M Couto
Journal:  Brief Bioinform       Date:  2013-11-06       Impact factor: 11.622

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.