| Literature DB >> 19426460 |
Cecilia N Arighi1, Hongfang Liu, Darren A Natale, Winona C Barker, Harold Drabkin, Judith A Blake, Barry Smith, Cathy H Wu.
Abstract
BACKGROUND: The Protein Ontology (PRO) is designed as a formal and principled Open Biomedical Ontologies (OBO) Foundry ontology for proteins. The components of PRO extend from a classification of proteins on the basis of evolutionary relationships at the homeomorphic level to the representation of the multiple protein forms of a gene, including those resulting from alternative splicing, cleavage and/or post-translational modifications. Focusing specifically on the TGF-beta signaling proteins, we describe the building, curation, usage and dissemination of PRO.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19426460 PMCID: PMC2679403 DOI: 10.1186/1471-2105-10-S5-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1PRO framework. Current working model and a subset of the possible connections to other ontologies.
Figure 2Ontology for the TGF-beta 1 protein. A. Snapshot of the ontology (partial DAG view) in OBO Edit 1.1 including terms representing ProEvo and ProForm. Terms in red boxes are described in panel B. B. Detailed example of the ontology and annotation displayed together for convenience. The above is a partial view, not all forms are listed, and only key annotations are shown.
Example annotation in the PRO association file (PAF).
| c-myc isoform 1 acetylated 1 | has_function | GO:0003700 | transcription factor activity | 16126174 | P01106-1 | Lys-143/Lys-157/Lys-275/Lys-317/Lys-323/Lys-371, MOD:00394 | ||
| c-myc isoform 1 acetylated 1 | located_in | GO:0005634 | nucleus | 16126174 | P01106-1 | Lys-143/Lys-157/Lys-275/Lys-317/Lys-323/Lys-371, MOD:00394 | ||
| c-myc isoform 1 acetylated 1 | part_of | GO:0005667 | transcription factor complex | 16126174 | P01106-1 | Lys-143/Lys-157/Lys-275/Lys-317/Lys-323/Lys-371, MOD:00394 | ||
| c-myc isoform 1 acetylated 1 | has_modification | MOD:00394 | acetylated residue | 16126174 | P01106-1 | Lys-143/Lys-157/Lys-275/Lys-317/Lys-323/Lys-371, MOD:00394 | ||
| c-myc isoform 1 glycosylated 1 | has_function | GO:0003700 | transcription factor activity | 11904304 | P01106-1 | Thr-58, MOD:00806 | ||
| c-myc isoform 1 glycosylated 1 | participates_in | GO:0006357 | regulation of transcription from RNA polymerase II promoter | 11904304 | P01106-1 | Thr-58, MOD:00806 | ||
| c-myc isoform 1 glycosylated 1 | has_modification | MOD:00806 | O-(N-acetylaminoglucosyl)-L-threonine | 11904304 | P01106-1 | Thr-58, MOD:00806 | ||
| c-myc isoform 1 phosphorylated 1 | has_function | GO:0003700 | transcription factor activity | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | located_in | GO:0005634 | nucleus | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | part_of | GO:0005667 | transcription factor complex | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | participates_in | GO:0006357 | regulation of transcription from RNA polymerase II promoter | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | participates_in | GO:0008283 | cell proliferation | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | participates_in | GO:0055072 | iron ion homeostasis | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 1 | has_modification | MOD:00046 | O-phospho-L-serine | 11018017 | P01108-1 | Ser-62, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 2 | NOT | has_function | GO:0003700 | transcription factor activity | 7623799 | P01106-1 | Ser-62, MOD:00046|Thr-58, MOD:00047 | |
| c-myc isoform 1 phosphorylated 2 | located_in | GO:0005634 | nucleus | 14563837 | P01108-1 | Ser-62, MOD:00046|Thr-58, MOD:00047 | ||
| c-myc isoform 1 phosphorylated 2 | has_modification | MOD:00046 | O-phospho-L-serine | 14563837 | P01108-1 | Ser-62, MOD:00046|Thr-58, MOD:00047 | ||
| c-myc isoform 1 phosphorylated 2 | has_modification | MOD:00047 | O-phospho-L-threonine | 14563837 | P01108-1 | Ser-62, MOD:00046|Thr-58, MOD:00047 | ||
| c-myc isoform 1 phosphorylated 3 | located_in | GO:0005634 | nucleus | 12676581 | P01106-1 | Ser-62/Ser-71, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 3 | participates_in | GO:0045766 | positive regulation of angiogenesis | 12676581 | P01106-1 | Ser-62/Ser-71, MOD:00046 | ||
| c-myc isoform 1 phosphorylated 3 | has_modification | MOD:00046 | O-phospho-L-serine | 12676581 | P01106-1 | Ser-62/Ser-71, MOD:00046 | ||
| smad6 isoform 2 | NOT | has_part | Pfam:PF03165 | MH1 domain | 11284962 | O43541-2 |
Selected columns are shown. Note that the last two columns are a tuple that define the sub-type of the class.
Figure 3PRO for cross-species comparison. A. Ontology and annotation for smad2 isoform 2 protein. B. Multiple sequence alignment of the N- and C-termini of smad2 protein isoform 2 orthologs containing the modified sites.
Figure 4PRO mapping to the Reactome TGF-beta signaling pathway. The pathway described herein corresponds to REACT_6844. Each step in the pathway is described by a Reactome event ID. Bold PRO IDs indicate objects that undergo some modification that is relevant for function (the modified form is underlined). The names in the boxes represent the names of the complexes or molecules involved in the processes, not of the PRO terms.
Statistics on Protein Ontology releases. PRO ontology coverage of proteins, and ProEvo and ProForm terms
| 1.0 | 2.0 | 3.0 | |
| Proteins | 79 | 149 | 289 |
| PRO terms | 667 | 1023 | 1269 |
| ProEvo terms | 119 | 223 | 397 |
| ProForm terms | 548 | 800 | 872 |
PRO annotation coverage by GO terms in release 3.0
| GO Molecular Function | has_function | 295 | PRO:000000650 smad 5 isoform 1 phosphorylated 1 | GO:0046332 SMAD binding |
| NOT has_function | 55 | PRO:000000478 smad 5 isoform 2 | GO:0046332 SMAD binding | |
| GO Cellular Component Complex | part_of | 62 | PRO:000000178 RING-box protein 2 isoform 1 | GO:0000151 ubiquitin ligase complex |
| NOT part_of | 5 | PRO:000000179 RING-box protein 2 isoform 2 | GO:0000151 ubiquitin ligase complex | |
| GO Cellular Component | located_in | 240 | PRO:000000457 noggin isoform 1 cleaved 1 | GO:0005615 extracellular space |
| GO Biological Process | participates_in | 274 | PRO:000000086 chordin isoform 1 | GO:0001501 skeletal development |
| PSI-MOD | has_modification | 172 | PRO:000000650 smad 2 isoform 1 phosphorylated 1 | MOD:00046 O-phospho- L-serine |
| SO | has_agent | 305 | TGF-beta receptor type-1 sequence variant 11 | SO:1000097 mutation causing amino acid deletion |
* Number of PRO terms under the umbrella of the corresponding ontology.