| Literature DB >> 20626922 |
Jose Cruz-Toledo1, Michel Dumontier, Marc Parisien, François Major.
Abstract
Increasingly sophisticated knowledge about RNA structure and function requires an inclusive knowledge representation that facilitates the integration of independently -generated information arising from such efforts as genome sequencing projects, microarray analyses, structure determination and RNA SELEX experiments. While RNAML, an XML-based representation, has been proposed as an exchange format for a select subset of information, it lacks domain-specific semantics that are essential for answering questions that require expert knowledge. Here, we describe an RNA knowledge base (RKB) for structure-based knowledge using RDF/OWL Semantic Web technologies. RKB extends a number of ontologies and contains basic terminology for nucleic acid composition along with context/model-specific structural features such as sugar conformations, base pairings and base stackings. RKB (available at http://semanticscience.org/projects/rkb) is populated with PDB entries and MC-Annotate structural annotation. We show queries to the RKB using description logic reasoning, thus opening the door to question answering over independently-published RNA knowledge using Semantic Web technologies.Entities:
Year: 2010 PMID: 20626922 PMCID: PMC2903721 DOI: 10.1186/2041-1480-1-S1-S2
Source DB: PubMed Journal: J Biomed Semantics
Figure 1Information content entities are about material entities in the RKB Illustration of the RDF-based representation used to relate Information Content Entities with their corresponding Material Entities. Molecular structure files are specific manifestations of structure models therefore structure models are represented by their corresponding PDB files. Structure models are also about molecules and other real entities (atoms, base pairs, etc).
Figure 2RKB nucleotide base pairs with varying sub-edge interactions. Illustration of the RDF-based representation of molecular structure obtained from PDB files and from structure feature analysis of MC-Annotate. (A) Structure model 5 of PDB 1AJU is about a nucleotide base pair that is composed of a sub-edge interaction between the Watson-Watson sub-edge of the guanine residue at position 34 of chain A and the O2’ sub-edge of the guanine residue at position 36 of chain A. (B) Structure model 10 of PDB 1AJU is about a nucleotide base pair between guanine residue at position 34 in chain A and guanine residue at position 36 in chain A, which is composed of two sub-edge interactions – a Watson-Watson sub-edge and O2’ sub-edge, as well as a Sugar-sugar sub-edge and the O2’ sub-edge.