Literature DB >> 17876820

Discrimination of mesophilic and thermophilic proteins using machine learning algorithms.

M Michael Gromiha1, M Xavier Suresh.   

Abstract

Discriminating thermophilic proteins from their mesophilic counterparts is a challenging task and it would help to design stable proteins. In this work, we have systematically analyzed the amino acid compositions of 3075 mesophilic and 1609 thermophilic proteins belonging to 9 and 15 families, respectively. We found that the charged residues Lys, Arg, and Glu as well as the hydrophobic residues, Val and Ile have higher occurrence in thermophiles than mesophiles. Further, we have analyzed the performance of different methods, based on Bayes rules, logistic functions, neural networks, support vector machines, decision trees and so forth for discriminating mesophilic and thermophilic proteins. We found that most of the machine learning techniques discriminate these classes of proteins with similar accuracy. The neural network-based method could discriminate the thermophiles from mesophiles at the five-fold cross-validation accuracy of 89% in a dataset of 4684 proteins. Moreover, this method is tested with 325 mesophiles in Xylella fastidosa and 382 thermophiles in Aquifex aeolicus and it could successfully discriminate them with the accuracy of 91%. These accuracy levels are better than other methods in the literature and we suggest that this method could be effectively used to discriminate mesophilic and thermophilic proteins. 2007 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 17876820     DOI: 10.1002/prot.21616

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  16 in total

1.  Proteome-wide Analysis of Protein Thermal Stability in the Model Higher Plant Arabidopsis thaliana.

Authors:  Jeremy D Volkening; Kelly E Stecker; Michael R Sussman
Journal:  Mol Cell Proteomics       Date:  2018-11-06       Impact factor: 5.911

2.  Impacts of the charged residues mutation S48E/N62H on the thermostability and unfolding behavior of cold shock protein: insights from molecular dynamics simulation with Gō model.

Authors:  Ji-Guo Su; Xiao-Ming Han; Shu-Xin Zhao; Yan-Xue Hou; Xing-Yuan Li; Li-Sheng Qi; Ji-Hua Wang
Journal:  J Mol Model       Date:  2016-03-28       Impact factor: 1.810

3.  Classification of lung cancer tumors based on structural and physicochemical properties of proteins by bioinformatics models.

Authors:  Faezeh Hosseinzadeh; Mansour Ebrahimi; Bahram Goliaei; Narges Shamabadi
Journal:  PLoS One       Date:  2012-07-19       Impact factor: 3.240

4.  Prediction of thermostability from amino acid attributes by combination of clustering with attribute weighting: a new vista in engineering enzymes.

Authors:  Mansour Ebrahimi; Amir Lakizadeh; Parisa Agha-Golzadeh; Esmaeil Ebrahimie; Mahdi Ebrahimi
Journal:  PLoS One       Date:  2011-08-10       Impact factor: 3.240

5.  Development of a machine learning method to predict membrane protein-ligand binding residues using basic sequence information.

Authors:  M Xavier Suresh; M Michael Gromiha; Makiko Suwa
Journal:  Adv Bioinformatics       Date:  2015-01-31

6.  Naïve Bayes classifier with feature selection to identify phage virion proteins.

Authors:  Peng-Mian Feng; Hui Ding; Wei Chen; Hao Lin
Journal:  Comput Math Methods Med       Date:  2013-05-15       Impact factor: 2.238

7.  Bayesian prediction of bacterial growth temperature range based on genome sequences.

Authors:  Dan B Jensen; Tammi C Vesth; Peter F Hallin; Anders G Pedersen; David W Ussery
Journal:  BMC Genomics       Date:  2012-12-13       Impact factor: 3.969

8.  AcalPred: a sequence-based tool for discriminating between acidic and alkaline enzymes.

Authors:  Hao Lin; Wei Chen; Hui Ding
Journal:  PLoS One       Date:  2013-10-09       Impact factor: 3.240

9.  A novel scoring function for discriminating hyperthermophilic and mesophilic proteins with application to predicting relative thermostability of protein mutants.

Authors:  Yunqi Li; C Russell Middaugh; Jianwen Fang
Journal:  BMC Bioinformatics       Date:  2010-01-28       Impact factor: 3.169

10.  Understanding the undelaying mechanism of HA-subtyping in the level of physic-chemical characteristics of protein.

Authors:  Mansour Ebrahimi; Parisa Aghagolzadeh; Narges Shamabadi; Ahmad Tahmasebi; Mohammed Alsharifi; David L Adelson; Farhid Hemmatzadeh; Esmaeil Ebrahimie
Journal:  PLoS One       Date:  2014-05-08       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.