Literature DB >> 34030624

Moonlighting protein prediction using physico-chemical and evolutional properties via machine learning methods.

Farshid Shirafkan1, Sajjad Gharaghani2, Karim Rahimian3, Reza Hasan Sajedi4, Javad Zahiri5,6.   

Abstract

BACKGROUND: Moonlighting proteins (MPs) are a subclass of multifunctional proteins in which more than one independent or usually distinct function occurs in a single polypeptide chain. Identification of unknown cellular processes, understanding novel protein mechanisms, improving the prediction of protein functions, and gaining information about protein evolution are the main reasons to study MPs. They also play an important role in disease pathways and drug-target discovery. Since detecting MPs experimentally is quite a challenge, most of them are detected randomly. Therefore, introducing an appropriate computational approach to predict MPs seems reasonable.
RESULTS: In this study, we introduced a competent model for detecting moonlighting and non-MPs through extracted features from protein sequences. We attempted to set up a well-judged scheme for detecting outlier proteins. Consequently, 37 distinct feature vectors were utilized to study each protein's impact on detecting MPs. Furthermore, 8 different classification methods were assessed to find the best performance. To detect outliers, each one of the classifications was executed 100 times by tenfold cross-validation on feature vectors; proteins which misclassified 90 times or more were grouped. This process was applied to every single feature vector and eventually the intersection of these groups was determined as the outlier proteins. The results of tenfold cross-validation on a dataset of 351 samples (containing 215 moonlighting and 136 non-moonlighting proteins) reveal that the SVM method on all feature vectors has the highest performance among all methods in this study and other available methods. Besides, the study of outliers showed that 57 of 351 proteins in the dataset could be an appropriate candidate for the outlier. Among the outlier proteins, there were non-MPs (such as P69797) that have been misclassified in 8 different classification methods with 16 different feature vectors. Because these proteins have been obtained by computational methods, the results of this study could reduce the likelihood of hypothesizing whether these proteins are non-moonlighting at all.
CONCLUSIONS: MPs are difficult to be identified through experimentation. Using distinct feature vectors, our method enabled identification of novel moonlighting proteins. The study also pinpointed that a number of non-MPs are likely to be moonlighting.

Entities:  

Keywords:  Moonlighting protein; Multitasking proteins; Outlier; PSSM; Physico-chemical properties; Random forest; SVM; bioinformatics

Mesh:

Substances:

Year:  2021        PMID: 34030624      PMCID: PMC8142502          DOI: 10.1186/s12859-021-04194-5

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  22 in total

1.  PPIevo: protein-protein interaction prediction from PSSM based evolutionary information.

Authors:  Javad Zahiri; Omid Yaghoubi; Morteza Mohammad-Noori; Reza Ebrahimpour; Ali Masoudi-Nejad
Journal:  Genomics       Date:  2013-06-06       Impact factor: 5.736

Review 2.  Moonlighting proteins: an intriguing mode of multitasking.

Authors:  Daphne H E W Huberts; Ida J van der Klei
Journal:  Biochim Biophys Acta       Date:  2010-02-06

3.  Do protein-protein interaction databases identify moonlighting proteins?

Authors:  Antonio Gómez; Sergio Hernández; Isaac Amela; Jaume Piñol; Juan Cedano; Enrique Querol
Journal:  Mol Biosyst       Date:  2011-06-16

4.  Genome-scale prediction of moonlighting proteins using diverse protein association information.

Authors:  Ishita K Khan; Daisuke Kihara
Journal:  Bioinformatics       Date:  2016-03-26       Impact factor: 6.937

5.  rpiCOOL: A tool for In Silico RNA-protein interaction detection using random forest.

Authors:  Mohammad Akbaripour-Elahabad; Javad Zahiri; Reza Rafeh; Morteza Eslami; Mahboobeh Azari
Journal:  J Theor Biol       Date:  2016-04-28       Impact factor: 2.691

Review 6.  Protein moonlighting: what is it, and why is it important?

Authors:  Constance J Jeffery
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2018-01-19       Impact factor: 6.237

7.  Protein binding hot spots prediction from sequence only by a new ensemble learning method.

Authors:  Shan-Shan Hu; Peng Chen; Bing Wang; Jinyan Li
Journal:  Amino Acids       Date:  2017-08-01       Impact factor: 3.520

8.  Why study moonlighting proteins?

Authors:  Constance J Jeffery
Journal:  Front Genet       Date:  2015-06-19       Impact factor: 4.599

9.  MoonProt 2.0: an expansion and update of the moonlighting proteins database.

Authors:  Chang Chen; Shadi Zabad; Haipeng Liu; Wangfei Wang; Constance Jeffery
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

10.  Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method.

Authors:  Xiaodi Yang; Shiping Yang; Qinmengge Li; Stefan Wuchty; Ziding Zhang
Journal:  Comput Struct Biotechnol J       Date:  2019-12-26       Impact factor: 7.271

View more
  4 in total

1.  Correction to: Moonlighting protein prediction using physico‑chemical and evolutional properties via machine learning methods.

Authors:  Farshid Shirafkan; Sajjad Gharaghani; Karim Rahimian; Reza Hasan Sajedi; Javad Zahiri
Journal:  BMC Bioinformatics       Date:  2021-07-09       Impact factor: 3.169

Review 2.  Moonlighting in Rickettsiales: Expanding Virulence Landscape.

Authors:  Ana Luísa Matos; Pedro Curto; Isaura Simões
Journal:  Trop Med Infect Dis       Date:  2022-02-19

3.  The Minimal Translation Machinery: What We Can Learn From Naturally and Experimentally Reduced Genomes.

Authors:  María José Garzón; Mariana Reyes-Prieto; Rosario Gil
Journal:  Front Microbiol       Date:  2022-04-11       Impact factor: 5.640

4.  A method for identifying moonlighting proteins based on linear discriminant analysis and bagging-SVM.

Authors:  Yu Chen; Sai Li; Jifeng Guo
Journal:  Front Genet       Date:  2022-08-15       Impact factor: 4.772

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.