| Literature DB >> 20462862 |
Steve Pettifer1, Jon Ison, Matús Kalas, Dave Thorne, Philip McDermott, Inge Jonassen, Ali Liaquat, José M Fernández, Jose M Rodriguez, David G Pisano, Christophe Blanchet, Mahmut Uludag, Peter Rice, Edita Bartaseviciute, Kristoffer Rapacki, Maarten Hekkelman, Olivier Sand, Heinz Stockinger, Andrew B Clegg, Erik Bongcam-Rudloff, Jean Salzemann, Vincent Breton, Teresa K Attwood, Graham Cameron, Gert Vriend.
Abstract
The EMBRACE (European Model for Bioinformatics Research and Community Education) web service collection is the culmination of a 5-year project that set out to investigate issues involved in developing and deploying web services for use in the life sciences. The project concluded that in order for web services to achieve widespread adoption, standards must be defined for the choice of web service technology, for semantically annotating both service function and the data exchanged, and a mechanism for discovering services must be provided. Building on this, the project developed: EDAM, an ontology for describing life science web services; BioXSD, a schema for exchanging data between services; and a centralized registry (http://www.embraceregistry.net) that collects together around 1000 services developed by the consortium partners. This article presents the current status of the collection and its associated recommendations and standards definitions.Entities:
Mesh:
Year: 2010 PMID: 20462862 PMCID: PMC2896104 DOI: 10.1093/nar/gkq297
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The problems identified in Hull et al. (7) and their corresponding solutions, as developed by the EMBRACE consortium
| Problem | EMBRACE solution |
|---|---|
| Inconsistent use of technology | EMBRACE Technology Recommendation Documents. Provide guidance for producers and consumers on selecting appropriate technologies and standards for developing life science web services. |
| Limited service metadata | EDAM Ontology. A vocabulary of terms and relations suitable for annotating the behaviour, inputs and outputs of life science services, and for associating meaning with data exchanged between services. |
| Incompatible interfaces | BioXSD data interchange format definition. A mechanism for unifying the exchange of data between bioinformatics services. |
| Unreliable services | EMBRACE Active Registry. A mechanism for finding services and for monitoring their performance. |
Examples of terms and their categories from the EDAM ontology
| Category | Example terms |
|---|---|
| Field | Sequence analysis; Alignment; Sequencing; Microarrays |
| Entity | Gene; Amino acid; Residue cluster; Active site; Atom–atom-interaction |
| Tool function | Sequence alignment; Pairwise sequence alignment, Sequence database search |
| Data resource | PDB database ( |
| Data type | Sequence alignment; Sequence record; Comparison matrix; Phylogenetic tree |
| Identifier | PDB code; UniProtKB ( |
Figure 1.An example of bioinformatics workflow, illustrating the flow of data (red ovals) through various services (blue rectangles). Without a common exchange format such as BioXSD, each edge in this graph would also require additional ‘converter' (or ‘shim’) processes to transform the data into the input formats required by the main services. This would more than double the technical complexity of the workflow for no additional scientific advantage.
Figure 2.A pie chart showing the percentage of web services reported by the EMBRACE registry as belonging to various high-level categories.