| Literature DB >> 30395331 |
.
Abstract
The Gene Ontology resource (GO; http://geneontology.org) provides structured, computable knowledge regarding the functions of genes and gene products. Founded in 1998, GO has become widely adopted in the life sciences, and its contents are under continual improvement, both in quantity and in quality. Here, we report the major developments of the GO resource during the past two years. Each monthly release of the GO resource is now packaged and given a unique identifier (DOI), enabling GO-based analyses on a specific release to be reproduced in the future. The molecular function ontology has been refactored to better represent the overall activities of gene products, with a focus on transcription regulator activities. Quality assurance efforts have been ramped up to address potentially out-of-date or inaccurate annotations. New evidence codes for high-throughput experiments now enable users to filter out annotations obtained from these sources. GO-CAM, a new framework for representing gene function that is more expressive than standard GO annotations, has been released, and users can now explore the growing repository of these models. We also provide the 'GO ribbon' widget for visualizing GO annotations to a gene; the widget can be easily embedded in any web page.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30395331 PMCID: PMC6323945 DOI: 10.1093/nar/gky1055
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.GO structure. (A) Graphical representation of relationships between terms: black lines represent is_a and blue lines represent part_of (representation obtained from https://www.ebi.ac.uk/QuickGO/term/GO:0060887). (B) Equivalence axiom for the term ‘GO:0060887: limb epidermis development’, as displayed in Protégé (12).
Number of experimental annotations in the GO knowledgebase, 5 September 2018 release (doi:10.5281/zenodo.1410625)
| Protein binding, EXP | Molecular function EXP, excluding protein binding | Cellular component EXP | Biological process EXP | |
|---|---|---|---|---|
|
| 83589 (+158%) | 29999 (+26%) | 41341 (+13%) | 47230 (+22%) |
|
| 12990 (+49%) | 14380 (+11%) | 28262 (+25%) | 67094 (+13%) |
|
| 4329 (+2%) | 10879 (−9%) | 15693 (+4%) | 27241 (−1%) |
|
| 509 (+30%) | 1756 (+15%) | 1087 (+16%) | 21635 (+20%) |
|
| 1516 (+33%) | 5694 (+15%) | 10803 (+3%) | 30762 (+1%) |
|
| 2993 (+13%) | 2482 (+13%) | 5245 (+8%) | 14511 (+24%) |
|
| 690 (+32%) | 1081 (+15%) | 2738 (+30%) | 4149 (+14%) |
|
| 168 (+58%) | 8886 (+8%) | 17456 (+4%) | 20194 (+14%) |
|
| 2076 (+52%) | 4636 (+42%) | 12184 (+8%) | 5651 (+11%) |
|
| 13074 (+113%) | 8344 (+14%) | 25486 (+7%) | 25223 (+12%) |
|
| 3602 (+57%) | 6006 (+20%) | 4171 (+7%) | 5756 (+5%) |
Note that for the molecular function annotations, we present annotations to ‘GO:0005515 protein binding’ separately from other GO:0003674 molecular functions, as these GO:0005515 annotations are used differently than other annotations (the class itself is not very informative, but each annotation includes additional information about the specific binding partner). The number of annotations for the main species annotated by the GOC are shown, and the percentage change relative to the 2016 update is indicated in parentheses.
Figure 2.The molecular function branch, before and after refactoring. The term marked with an ‘x’ in the left-hand panel has been obsoleted. Terms moved (assigned to a new parent) are indicated by arrows. New terms (right panel) are marked with ‘NEW’. 1The class label ‘electron carrier activity’ was changed to ‘electron transfer activity’.
Figure 3.Current structure of the ‘GO:0140110 transcription regulator activity’ branch of the ontology.
Subsets maintained in GO
| GO subsets | |
|---|---|
| Generic GO subset | |
| GO slim AGR | |
| GO do not annotate list | |
| GO do not manually annotate list | |
|
| |
|
|
|
| Aspergillus subset | Aspergillus Genome Data |
| Candida albicans | Candida Genome Database |
| Chembl Drug Target subset | ChEMBL |
| FlyBase Ribbon slim | FlyBase |
| Metagenomics subset | EBI Metagenomics group |
| Mouse GO slim | MGI |
| Plant subset | The Arabidopsis Information Resource |
| Protein Information Resource subset | PIR |
| Schizosaccharomyces pombe subset | PomBase |
| Synapse GO slim | SynGO |
| Yeast subset | Saccharomyces Genome Database |
Figure 4.GO ribbon representation. Darker boxes indicate terms with the most annotations; white boxes represent terms that are not annotated for this protein (Mus musculus Sox7, MGI:98369). Screenshot obtained from https://www.alliancegenome.org/gene/MGI:98369.