| Literature DB >> 22074332 |
Abstract
BACKGROUND: Current approaches of predicting protein functions from a protein-protein interaction (PPI) dataset are based on an assumption that the available functions of the proteins (a.k.a. annotated proteins) will determine the functions of the proteins whose functions are unknown yet at the moment (a.k.a. un-annotated proteins). Therefore, the protein function prediction is a mono-directed and one-off procedure, i.e. from annotated proteins to un-annotated proteins. However, the interactions between proteins are mutual rather than static and mono-directed, although functions of some proteins are unknown for some reasons at present. That means when we use the similarity-based approach to predict functions of un-annotated proteins, the un-annotated proteins, once their functions are predicted, will affect the similarities between proteins, which in turn will affect the prediction results. In other words, the function prediction is a dynamic and mutual procedure. This dynamic feature of protein interactions, however, was not considered in the existing prediction algorithms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22074332 PMCID: PMC3224793 DOI: 10.1186/1471-2105-12-437
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Recall chart of three algorithms.
Figure 2Precision chart of three algorithms.
Figure 3F-value chart of three algorithms.
Figure 4Recall-Precision chart of three algorithms using original evaluation metrics in cross-validation experiments.
Figure 5Recall-Precision chart of three algorithms using new evaluation metrics in cross-validation experiments.
Figure 6Recall-Precision chart for Affinity-MS datasets.
Figure 7Recall-Precision chart for Two-hybrid datasets.
Sample prediction results
| Protein | Real Function | NC | Iteration | GO description |
|---|---|---|---|---|
| YGR285C | GO:0006417 | GO:0006417 | GO:0006417 | Regulation of translation |
| GO:0006450 | x | GO:0006450 | Regulation of translational fidelity | |
| GO:0006457 | x | x | Protein folding | |
| YGL022W | GO:0006486 | x | GO:0006486 | protein amino acid glycosylation |
| GO:0006487 | GO:0006487 | GO:0006487 | protein amino acid N-linked glycosylation | |
| GO:0018193 | GO:0018193 | GO:0018193 | peptidyl-amino acid modification | |
| GO:0009100 | x | GO:0009100 | glycoprotein metabolic process | |
| YPR180W | GO:0006974 | x | GO:0006974 | response to DNA damage stimulus |
| GO:0008152 | x | x | metabolic process | |
| YKL181W | GO:0006015 | GO:0006015 | GO:0006015 | 5-phosphoribose 1-diphosphate biosynthetic process |
| GO:0009117 | GO:0009117 | GO:0009117 | nucleotide metabolic process | |
| GO:0009156 | GO:0009156 | GO:0009156 | ribonucleoside monophosphate biosynthetic process | |
| GO:0009165 | GO:0009165 | GO:0009165 | nucleotide biosynthetic process | |
| GO:0031505 | GO:0031505 | GO:0031505 | fungal-type cell wall organization | |
| GO:0043093 | x | GO:0043093 | cytokinesis by binary fission | |
| YGL078C | GO:0006364 | GO:0006364 | GO:0006364 | rRNA processing |
| GO:0000027 | x | GO:0000027 | ribosomal large subunit assembly | |
| GO:0009451 | x | x | RNA modification | |
| YDR306C | GO:0006511 | x | GO:0006511 | ubiquitin-dependent protein catabolic process |
| YGR043C | GO:0005975 | GO:0005975 | GO:0005975 | carbohydrate metabolic process |
| GO:0008152 | GO:0008152 | GO:0008152 | metabolic process | |
| GO:0006098 | x | GO:0006098 | pentose-phosphate shunt | |
| GO:0006914 | x | x | Autophagy | |
| YJR140C | GO:0006350 | GO:0006350 | GO:0006350 | Transcription |
| GO:0045449 | GO:0045449 | GO:0045449 | regulation of transcription | |
| GO:0006368 | x | GO:0006368 | RNA elongation from RNA polymerase II promoter | |
| GO:0006336 | x | x | DNA replication-independent nucleosome assembly | |
| GO:0000083 | x | x | regulation of transcription involved in G1/S phase of mitotic cell cycle | |
| YLR086W | GO:0007076 | GO:0007076 | GO:0007076 | mitotic chromosome condensation |
| GO:0007049 | GO:0007049 | GO:0007049 | cell cycle | |
| GO:0007067 | GO:0007067 | GO:0007067 | Mitosis | |
| GO:0051301 | GO:0051301 | GO:0051301 | cell division | |
| GO:0000070 | GO:0000070 | GO:0000070 | mitotic sister chromatid segregation | |
| GO:0030261 | x | GO:0030261 | chromosome condensation | |
| GO:0070058 | x | x | tRNA gene clustering | |
| GO:0051276 | x | x | chromosome organization | |
| YJR065C | GO:0048308 | GO:0048308 | GO:0048308 | organelle inheritance |
| GO:0007015 | GO:0007015 | GO:0007015 | actin filament organization | |
| GO:0000001 | GO:0000001 | GO:0000001 | mitochondrion inheritance | |
| GO:0030833 | x | GO:0030833 | regulation of actin filament polymerization | |
| GO:0034314 | x | x | Arp2/3 complex-mediated acting nucleation | |
Note: X stands for a not-predicted function. These sample results show that the iterative approach predicted more correct functions than the non-iterative approach. For instance, the NC algorithm predicted only one correct function for protein YGR285C, while the iterative algorithm predicted two correct functions; the NC algorithms failed in predicting any correct functions for protein YPR180W, however one function was correctly predicted by the iterative algorithm.