Literature DB >> 26357323

iPFPi: A System for Improving Protein Function Prediction through Cumulative Iterations.

Kamal Taha, Paul D Yoo, Mohammed Alzaabi.   

Abstract

We propose a classifier system called iPFPi that predicts the functions of un-annotated proteins. iPFPi assigns an un-annotated protein P the functions of GO annotation terms that are semantically similar to P. An un-annotated protein P and a GO annotation term T are represented by their characteristics. The characteristics of P are GO terms found within the abstracts of biomedical literature associated with P. The characteristics of Tare GO terms found within the abstracts of biomedical literature associated with the proteins annotated with the function of T. Let F and F/ be the important (dominant) sets of characteristic terms representing T and P, respectively. iPFPi would annotate P with the function of T, if F and F/ are semantically similar. We constructed a novel semantic similarity measure that takes into consideration several factors, such as the dominance degree of each characteristic term t in set F based on its score, which is a value that reflects the dominance status of t relative to other characteristic terms, using pairwise beats and looses procedure. Every time a protein P is annotated with the function of T, iPFPi updates and optimizes the current scores of the characteristic terms for T based on the weights of the characteristic terms for P. Set F will be updated accordingly. Thus, the accuracy of predicting the function of T as the function of subsequent proteins improves. This prediction accuracy keeps improving over time iteratively through the cumulative weights of the characteristic terms representing proteins that are successively annotated with the function of T. We evaluated the quality of iPFPi by comparing it experimentally with two recent protein function prediction systems. Results showed marked improvement.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26357323     DOI: 10.1109/TCBB.2014.2344681

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  3 in total

1.  Predicting protein functions by applying predicate logic to biomedical literature.

Authors:  Kamal Taha; Youssef Iraqi; Amira Al Aamri
Journal:  BMC Bioinformatics       Date:  2019-02-08       Impact factor: 3.169

2.  Constructing Genetic Networks using Biomedical Literature and Rare Event Classification.

Authors:  Amira Al-Aamri; Kamal Taha; Yousof Al-Hammadi; Maher Maalouf; Dirar Homouz
Journal:  Sci Rep       Date:  2017-11-17       Impact factor: 4.379

3.  Selection of computational environments for PSP processing on scientific gateways.

Authors:  Edvard Martins de Oliveira; Júlio Cézar Estrella; Alexandre Cláudio Botazzo Delbem; Luiz Henrique Nunes; Henrique Yoshikazu Shishido; Stephan Reiff-Marganiec
Journal:  Heliyon       Date:  2018-07-17
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.