| Literature DB >> 23031277 |
Myles Byrne1, Ivo Fac Fokkema, Owen Lancaster, Tomasz Adamusiak, Anni Ahonen-Bishopp, David Atlan, Christophe Béroud, Michael Cornell, Raymond Dalgleish, Andrew Devereau, George P Patrinos, Morris A Swertz, Peter Em Taschner, Gudmundur A Thorisson, Mauno Vihinen, Anthony J Brookes, Juha Muilu.
Abstract
BACKGROUND: Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement.Entities:
Mesh:
Year: 2012 PMID: 23031277 PMCID: PMC3507772 DOI: 10.1186/1471-2105-13-254
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Glossary
| API | Application programming interface | |
| BSVM | Pioneering early LSDB integration standard. | See |
| Café Variome | Variation data publishing service | |
| Extended Backus-Naur Form | A notation that expresses the grammar of a computer language. | |
| GEN2PHEN | EU project integrating genotype and phenotype data. | |
| GSVML | Genomic Sequence Variation Markup Language | See |
| HPO | Human Phenotype Ontology | |
| Jackson | Java JSON library | |
| JAVA | General programming language | |
| JAXB | Java JSON library | |
| JSON | Javascript Object Notation | |
| LSDB | Gene variant database, Locus Specific Database | |
| MAGE-TAB | A tab-delimited format for representing functional genomics data. | |
| MIRIAM | The MIRIAM Registry provides a set of online services for the generation of unique and perennial identifiers, in the form of URIs. | |
| MOLGENIS | Software generating infrastructure (databases, APIs, GUIs) for life science projects. | |
| Object Model | An abstract representation of a domain’s concepts, data, and relationships between these, used to design or generate software. | - |
| Observ-OM | A simple system to format and exchange observation data. | |
| ORCID | Open Researcher and Contributor Identification | |
| PML/DVAR | An implementation of the PaGE-OM object model. | |
| RelaxNG | Schema definition language for use with XML. | |
| RDF | Resource Description Framework | |
| Schematron | High-level schema definition language for use with XML. | |
| SKOS | Simple Knowledge Organization System | |
| SO | Sequence Ontology | |
| UML | Unified Modeling Language | |
| VariO | Variation Ontology | |
| VCF | Variant Call Format | |
| XGAP | XGAP is an open and flexible object model for xQTL, GWL, GWA and mutagenesis data | |
| XML | eXtensible Markup Language |
Figure 1The VarioML object model is derived from Observ-OM (http://www.observ-om.org/wiki/ObservStart), with some modifications to simplify implementation. E.g., Observable Feature (such as phenotype or mutation name) and Observed Value (existence of phenotype or variation) are denormalized into a single XML element. This avoids unnecessary nesting of observation elements which do often have one-to-one relationship, in the XML implementation. Entities are composed into Observations, having properties such as evidence codes, observation protocols and observation time. Associations between elements are described as single lines, where an asterisk means a 0-to-many multiplicity relationship; i.e. Observation can have one or many evidence codes. All entities also inherit from Annotatable properties which are needed for database cross references and comments. In this case, the open arrow symbol means inheritance or an is-a relationship.
Figure 2The different VarioML elements of the data submitted are flanked by the corresponding XML tags and explained in the text.
Figure 3. The VarioML elements describing the effect of an AIRE variant at the transcript and protein levels are flanked by the corresponding XML tags and explained in the text.
Figure 4XML elements are mapped to JSON objects using JAXB and Jackson annotations via VarioML's Java API. Repeating XML elements become pluralised into JSON arrays. Because JSON does not have an equivalent to XML attributes, XML attribute names can clash with inner element names. In these cases, the JSON name for the XML attribute is changed. Otherwise, mapping VarioML from XML to JSON is a direct transformation of the data structure.