| Literature DB >> 25052702 |
Marcus C Chibucos1, Christopher J Mungall2, Rama Balakrishnan2, Karen R Christie2, Rachael P Huntley2, Owen White3, Judith A Blake2, Suzanna E Lewis2, Michelle Giglio3.
Abstract
The Evidence Ontology (ECO) is a structured, controlled vocabulary for capturing evidence in biological research. ECO includes diverse terms for categorizing evidence that supports annotation assertions including experimental types, computational methods, author statements and curator inferences. Using ECO, annotation assertions can be distinguished according to the evidence they are based on such as those made by curators versus those automatically computed or those made via high-throughput data review versus single test experiments. Originally created for capturing evidence associated with Gene Ontology annotations, ECO is now used in other capacities by many additional annotation resources including UniProt, Mouse Genome Informatics, Saccharomyces Genome Database, PomBase, the Protein Information Resource and others. Information on the development and use of ECO can be found at http://evidenceontology.org. The ontology is freely available under Creative Commons license (CC BY-SA 3.0), and can be downloaded in both Open Biological Ontologies and Web Ontology Language formats at http://code.google.com/p/evidenceontology. Also at this site is a tracker for user submission of term requests and questions. ECO remains under active development in response to user-requested terms and in collaborations with other ontologies and database resources. Database URL: Evidence Ontology Web site: http://evidenceontology.org.Entities:
Mesh:
Year: 2014 PMID: 25052702 PMCID: PMC4105709 DOI: 10.1093/database/bau075
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Flow of protein functional annotation and associated evidence, as represented by ECO. Intricate relationships and inputs typically represented in greater detail by other ontologies such as OBI tend to be collapsed into general summary classes in ECO. Top row (A–D) begins with (A) researcher performing a gene expression experiment and resulting data and analysis, (B) which are interpreted by the researcher and eventually published as conclusions. (C) A biological curator reads the paper and makes an association between a protein and a term from a controlled vocabulary such as the GO. (D) The annotation is deposited at a sequence repository along with associated evidence and assertion method. Bottom row (E–G) depicts two methods for asserting annotations based on similarity evidence. (E) Either a person or a computer compares a protein sequence with sequences in a database, which results in a hit to highly similar sequence and associated alignment data. (F) The machine or the human interprets the alignment, considering cutoff thresholds and other parameters. Two proteins of sufficient similarity are determined to share function, and protein 1 of unknown function will be assigned the function of protein 2 of known function. (G) The annotation is deposited at a sequence repository along with associated evidence and assertion method.
Groups and resources using the ECO or with plans to implement ECO
| Group/resource | URL |
|---|---|
| AgBase | |
| AmiGO 2 | |
| The | |
| Ascidian Network for | |
| Bgee—a dataBase for Gene Expression Evolution | |
| BioModels Database | |
| BioSapiens Network (legacy project) | |
| European Bioinformatics Institute (EMBL-EBI) | |
| The Gene Ontology (GO) | |
| IntAct Complex Portal | |
| ISA Software Suite | |
| Mouse Genome Informatics (MGI) | |
| Neural ElectroMagnetic Ontologies (NEMO) | nemo.nic.uoregon.edu |
| The Ontology for Biomedical Investigations (OBI) | |
| The Ontology of Microbial Phenotypes (OMP) | |
| Phylogenetic Annotation and INference Tool (PAINT) | |
| PhenoScape | |
| PomBase | |
| Protein Information Resource (PIR) | |
| RNAcentral | |
| Structure integration with function, taxonomy and sequence (SIFTS) (uses GO codes) | |
| Swiss Institute of Bioinformatics (SIB) | |
| The Universal Protein Resource (UniProt) | |
| UniProt-Gene Ontology Annotation (UniProt-GOA) project | |
| Variation Ontology Annotation Tool VariOtator | |
| ZOOMA |
The table includes users who have contacted ECO with a specific development request, users of GO evidence codes transitioning to using ECO and other resources collaborating with or using ECO. While most of the users depicted use ECO to support structured queries (i.e. to group annotations), some unique examples are discussed in the text. For convenience, the ECO instances for four multi-ontology visualization tools are also listed.
Figure 2.Selected ECO terms representing two major categories of evidence. The ECO class ‘experimental evidence’ (ECO:0000006) and its subclasses are circled with blue. The ECO class ‘similarity evidence’ (ECO:0000041) and some representative subclasses are circled with red.
Figure 3.‘Evidence’ and ‘assertion method’ root classes and an internal cross product term. The evidence hierarchy (blue text) is disjoint_from (grey arrow) the assertion method hierarchy (red text). The cross product term ‘BLAST evidence used in manual assertion’ (purple text) descends from the evidence hierarchy (‘I’ = is_a), and the term’s logical definition allows a reasoner to infer the used_in relationship (dotted arrow) to ‘manual assertion’.