| Literature DB >> 23327938 |
Prudence Mutowo-Meullenet1, Rachael P Huntley, Emily C Dimmer, Yasmin Alam-Faruque, Tony Sawford, Maria Jesus Martin, Claire O'Donovan, Rolf Apweiler.
Abstract
The Gene Ontology (GO) is the de facto standard for the functional description of gene products, providing a consistent, information-rich terminology applicable across species and information repositories. The UniProt Consortium uses both manual and automatic GO annotation approaches to curate UniProt Knowledgebase (UniProtKB) entries. The selection of a protein set prioritized for manual annotation has implications for the characteristics of the information provided to users working in a specific field or interested in particular pathways or processes. In this article, we describe an organelle-focused, manual curation initiative targeting proteins from the human peroxisome. We discuss the steps taken to define the peroxisome proteome and the challenges encountered in defining the boundaries of this protein set. We illustrate with the use of examples how GO annotations now capture cell and tissue type information and the advantages that such an annotation approach provides to users. Database URL: http://www.ebi.ac.uk/GOA/ and http://www.uniprot.org.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23327938 PMCID: PMC3548334 DOI: 10.1093/database/bas062
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Capturing cell- and tissue-type information in a protein GO annotation using the annotation extension field
| Gene name | GO terms | Evidence | Annotation extension | PubMed identifier |
|---|---|---|---|---|
| MAVS | GO:0005777 peroxisome | IDA | Part_of (CL:0000182) hepatocyte | 20451243 |
| (UniProtKB Q7Z434) | ||||
| EPHX2 | GO:0005777 peroxisome | IDA | Part_of (Uberon :0005151) metanephric proximal tubule | 16314446 |
| (UniProtKB P34913) | ||||
| POMC | GO:0005782 peroxisomal matrix | IDA | Part of (CL:0002559) hair follicle cell | 20810565 |
| (UniProtKB P01189) | ||||
| FAR1 | GO:0005777 peroxisome | IDA | Occurs in (CL:0000057) fibroblast | 20071337 |
| (UniProtKB Q8WVX9) |
Figure 1A protein–protein interaction map of the human peroxisome. The peroxisome proteins and their interacting partners comprised a set of 421 proteins with a total of 408 binary interactions; peroxisomal proteins are shown as triangles and non-peroxisomal proteins as circles. Protein–protein interactions are depicted as white edges. Multiple edges between two proteins represent interactions that have been identified by more than one approach. The generic GO slim was used to identify terms in the proteome. GO terms that were common to all proteins in a cluster are shown as the cluster label.
Figure 2Biological process GO enrichment of 88 human peroxisome proteins before the focused manual peroxisome protein annotation effort. The circles represent enriched GO terms. The size of circle is proportional to the number of proteins containing the biological process term.
Figure 3Biological process GO enrichment of 88 human peroxisome proteins after the focused manual peroxisome protein annotation effort. The circles represent enriched GO terms. The size of circle is proportional to the number of proteins containing the biological process term. Circles with different colours represent proteins that contain an intersection of GO terms.
Figure 4Comparison between biological processes enriched only in human peroxisome proteins (A) and those enriched only in the yeast peroxisomal protein set (B).