| Literature DB >> 24885854 |
Rachael P Huntley, Midori A Harris, Yasmin Alam-Faruque, Judith A Blake, Seth Carbon, Heiko Dietze, Emily C Dimmer, Rebecca E Foulger, David P Hill, Varsha K Khodiyar, Antonia Lock, Jane Lomax, Ruth C Lovering, Prudence Mutowo-Meullenet, Tony Sawford, Kimberly Van Auken, Valerie Wood, Christopher J Mungall1.
Abstract
BACKGROUND: The Gene Ontology project integrates data about the function of gene products across a diverse range of organisms, allowing the transfer of knowledge from model organisms to humans, and enabling computational analyses for interpretation of high-throughput experimental and clinical data. The core data structure is the annotation, an association between a gene product and a term from one of the three ontologies comprising the GO. Historically, it has not been possible to provide additional information about the context of a GO term, such as the target gene or the location of a molecular function. This has limited the specificity of knowledge that can be expressed by GO annotations.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24885854 PMCID: PMC4039540 DOI: 10.1186/1471-2105-15-155
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Representation of annotations made using the core GO annotation model. Gene products can be annotated to several GO terms, and any GO term can be used to annotate any number of gene products, but the annotations remain independent. The stars indicate an annotation of a gene product to the GO term and each colour represents a single gene product. Using this simple GO annotation model, it is not clear from the annotations shown in which Cellular Component each of the protein activities are performed. For example, microsomal glutathione S-transferase 1 is represented by the red star and can perform two activities; glutathione transferase activity and glutathione peroxidase activity. It is found in three Cellular Components, the mitochondrion, endoplasmic reticulum and peroxisomal membrane, but from these annotations the knowledge that the glutathione transferase activity is performed in the mitochondrion [7] cannot be found. For clarity not all annotations of each gene product are shown nor all terms between the specified terms and the root node.
Figure 2Graphical web view of the annotation extension relations. (a) A graphical view of the relations was created to assist curators in selecting the appropriate relation for curation [14]; (b) A user can zoom into a particular area of the graph; (c) Relations can be clicked to view information, such as which GO terms the relation can be used with and which identifiers may be used with the relation.
Most commonly used relationships for annotation extension statements and examples of their usage
| part_of | |
| occurs_in | Mouse opsin-4; G-protein coupled photoreceptor activity; occurs_in(CL:0000740 |
| happens_during | |
| has_regulation_target | Human suppressor of fused homolog SUFU; negative regulation of transcription factor import into nucleus; has_regulation_target(UniProtKB:P08151 |
| has_input | |
| has_direct_input | Human WNK4; chloride channel inhibitor activity; has_direct_input(UniProtKB:Q7LBE3 |
Molecular relations take an entity such as a gene, gene product, complex or chemical as an argument; contextual relations take an entity such as a cell type, anatomy term, developmental stage or a GO term as an argument. Entity names in italics are shown for clarity and are not part of the annotation extension format.
Figure 3The deneddylation activity of Nep1. (a) The experimental data reported in [18] is interpreted as: Nep1 is capable of deneddylating the cullins Cul1, Cul3 and Pcu4. (b) and (c) Graphical representation of Nep1 annotations using (b) the core GO annotation model and (c) the extended GO annotation model.
Figure 4Display of annotation extension data in PomBase for Nep1 gene product. Annotation of the observation that Nep1 deneddylates the three cullins Cul1, Cul3 and Pcu4 [18] requires one annotation with three separate expressions in the annotation extension. Note that more human-readable text has been substituted for the annotation extension relation names for display purposes in PomBase. The underlying data retain the relation names, and the mapping between relation names and display text is available on the PomBase website [19].
Extended annotations categorized by species
| Mus musculus | 409098 | 25209 | 6.2 |
| Homo sapiens | 219258 | 9042 | 4.1 |
| Saccharomyces cerevisiae | 53750 | 2713 | 5.0 |
| Schizosaccharomyces pombe | 29049 | 1902 | 6.5 |
| Caenorhabditis elegans | 27488 | 1102 | 4.0 |
| Arabidopsis thaliana | 101936 | 503 | 0.5 |
| Rattus norvegicus | 72280 | 477 | 0.7 |
| Escherichia coli | 11658 | 426 | 3.7 |
| Dictyostelium discoideum | 19278 | 228 | 1.2 |
| Drosophila melanogaster | 109886 | 214 | 0.2 |
The number of extended annotations is shown compared to the total number of manual annotations for each species. Calculated with the statistics from the UniProt-GOA database [3] on 21 November 2013.