| Literature DB >> 23479348 |
Jon Ison1, Matús Kalas, Inge Jonassen, Dan Bolser, Mahmut Uludag, Hamish McWilliam, James Malone, Rodrigo Lopez, Steve Pettifer, Peter Rice.
Abstract
MOTIVATION: Advancing the search, publication and integration of bioinformatics tools and resources demands consistent machine-understandable descriptions. A comprehensive ontology allowing such descriptions is therefore required.Entities:
Mesh:
Year: 2013 PMID: 23479348 PMCID: PMC3654706 DOI: 10.1093/bioinformatics/btt113
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
The main EDAM sub-ontologies
| Operation | A function that processes a set of inputs and results in a set of outputs, or associates arguments (inputs) with values (outputs). Special cases are: (a) An operation that consumes no input (has no input arguments). Such operation is either a constant function, or an operation depending only on the underlying state. (b) An operation that may modify the underlying state but has no output. (c) The singular-case operation with no input or output, that still may modify the underlying state | Singular, bioinformatics-specific operations that are functions of tools, workflows or scripts, or can be performed manually | RNA structure prediction Protein docking Data retrieval | 558 |
| Data | Information, represented in an information artefact (data record) that is ‘understandable’ by dedicated computational tools that can use the data as input or produce it as output | Types of data that are relevant in bioinformatics, commonly used as inputs, outputs or intermediate data of analyses, or provided by databases and portals | Sequence Sequence record Phylogenetic tree UniProt accession | 1140 |
| Identifier (under Data) | A text token, number or something else that identifies an entity, but which may not be persistent (stable) or unique (the same identifier may identify multiple things) | Types of identifiers that identify biological or computational entities; including resource-specific data accessions. Several identifier concepts in EDAM include regular expressions and examples | UniProt accession EC number | 528 |
| Topic | A category denoting a rather broad domain or field of interest, of study, application, work, data or technology. Topics have no clearly defined borders between each other | Application domains of bioinformatics tools and resources; topics of research, studies or analyses; approaches, techniques and paradigms within—or directly related to—Bioinformatics | Sequence analysis Phylogenetics ontology | 209 |
| Format | A defined way or layout of representing and structuring data in a computer file, blob, string, message or elsewhere. | Data formats commonly used in—and specific to—Bioinformatics. Many format concepts in EDAM include references to their definition and documentation | BAM GVF SBML | 347 |
Note: The EDAM sub-ontologies contain common concepts specific—or directly related—to bioinformatics.
Fig. 1.Organization of the main EDAM sub-ontologies and the relations explicitly maintained between EDAM concepts
Types of relations defined in EDAM
| Not between EDAM concepts | A tool |
Note: Definitions, domains and ranges are present in the EDAM.owl file. EDAM relations apply between concepts and/or annotated entities.
Fig. 2.Sketches of information models for semantic annotations with EDAM. (a) A model for annotations of tools corresponding to the SAWSDL standard (Kopecky ). Standardizing an information model of tool metadata is, at least so far, out of scope of EDAM. (b) A similar model for annotations of data resources, used within DRCAT. Note that a query has always (implicitly) the function of Data retrieval. Defining an information standard for database metadata is within scope of the BioDBCore initiative (Gaudet )