| Literature DB >> 19429688 |
Choong-Hyun Sun1, Min-Sung Kim, Youngwoong Han, Gwan-Su Yi.
Abstract
COFECO is a web-based tool for a composite annotation of protein complexes, KEGG pathways and Gene Ontology (GO) terms within a class of genes and their orthologs under study. Widely used functional enrichment tools using GO and KEGG pathways create large list of annotations that make it difficult to derive consolidated information and often include over-generalized terms. The interrelationship of annotation terms can be more clearly delineated by integrating the information of physically interacting proteins with biological pathways and GO terms. COFECO has the following advanced characteristics: (i) The composite annotation sets of correlated functions and cellular processes for a given gene set can be identified in a more comprehensive and specified way by the employment of protein complex data together with GO and KEGG pathways as annotation resources. (ii) Orthology based integrative annotations among different species complement the defective annotations in an individual genome and provide the information of evolutionary conserved correlations. (iii) A term filtering feature enables users to collect the specified annotations enriched with selected function terms. (iv) A cross-comparison of annotation results between two different datasets is possible. In addition, COFECO provides a web-based GO hierarchical viewer and KEGG pathway viewer where the enrichment results can be summarized and further explored. COFECO is freely accessible at http://piech.kaist.ac.kr/cofeco.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19429688 PMCID: PMC2703949 DOI: 10.1093/nar/gkp331
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Statistics of annotations and proteins in annotation resources of COFECO
| Organisms | Proteins | Annotations | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Complex | KEGG | GO BP | GO MF | GO CC | Complex | KEGG | GO BP | GO MF | GO CC | |
| 5670 | 5313 | 33 714 | 39 440 | 34 231 | 2118 | 199 | 3584 | 2320 | 614 | |
| 5246 | 1237 | 5147 | 5020 | 5761 | 8049 | 110 | 1556 | 1484 | 511 | |
| 4123 | 5507 | 36 780 | 40 115 | 36 594 | 2858 | 205 | 3082 | 2612 | 684 | |
| 2957 | 2674 | 9407 | 10 729 | 9837 | 1850 | 197 | 2059 | 1858 | 463 | |
| 1820 | 604 | 4379 | 5058 | 4345 | 1287 | 121 | 894 | 984 | 289 | |
| 1606 | 1947 | 19 694 | 25 962 | 13 380 | 619 | 132 | 1043 | 1240 | 269 | |
| 1583 | 2216 | 13 862 | 17 892 | 10 666 | 865 | 135 | 2196 | 1545 | 439 | |
| 899 | 966 | 3553 | 3429 | 4790 | 526 | 106 | 1153 | 1082 | 370 | |
| 806 | 1012 | 9891 | 11651 | 6456 | 600 | 122 | 1163 | 969 | 237 | |
aThe numbers of annotations for complexes are the number of complexes for each genome. Among the whole 19 064 complexes, 63% have their own function annotations or complex names and 37%, which are from high-throughput co-complex data in general, do not have complex name. In this system, complex itself is used as an annotation unit representing functionally correlated group.
Figure 1.Screenshot depicting the results of the analysis of 85 differentially expressed genes (DEGs) in human testis tissue. (A) Composite annotation results from the function annotation of protein complexes, KEGG pathways and GO biological processes. (B) Composite annotation results of an ortholog list of 85 DEGs. (C) and (D) Graphical GO views of A and B, respectively. (E) Inset of KEGG cell-cycle pathway, which involves co-complexed proteins of cell cycle kinase complex. (F) Inset of KEGG cell-cycle pathway, which involves co-complexed proteins of kinetochore. Red outlined rectangles in Figure 1E indicate co-complexed proteins of the protein complex named as ‘centrosome containing phosphorylated Nlp’, which is actually ranked the third annotation, not displayed in Figure 1B. Yellow boxes indicate statistically significant genes in enriched composite annotations and pink boxes indicate co-complexed proteins except for statistically significant genes. Dotted lines indicate same genes.