Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction of integral membrane protein type by collocated hydrophobic amino acid pairs.

Literature DB >> 18567007

Prediction of integral membrane protein type by collocated hydrophobic amino acid pairs.

Ke Chen¹, Yingfu Jiang, Li Du, Lukasz Kurgan.

Abstract

A computational model, IMP-TYPE, is proposed for the classification of five types of integral membrane proteins from protein sequence. The proposed model aims not only at providing accurate predictions but most importantly it incorporates interesting and transparent biological patterns. When contrasted with the best-performing existing models, IMP-TYPE reduces the error rates of these methods by 19 and 34% for two out-of-sample tests performed on benchmark datasets. Our empirical evaluations also show that the proposed method provides even bigger improvements, i.e., 29 and 45% error rate reductions, when predictions are performed for sequences that share low (40%) identity with sequences from the training dataset. We also show that IMP-TYPE can be used in a standalone mode, i.e., it duplicates significant majority of correct predictions provided by other leading methods, while providing additional correct predictions which are incorrectly classified by the other methods. Our method computes predictions using a Support Vector Machine classifier that takes feature-based encoded sequence as its input. The input feature set includes hydrophobic AA pairs, which were selected by utilizing a consensus of three feature selection algorithms. The hydrophobic residues that build up the AA pairs used by our method are shown to be associated with the formation of transmembrane helices in a few recent studies concerning integral membrane proteins. Our study also indicates that Met and Phe display a certain degree of hydrophobicity, which may be more crucial than their polarity or aromaticity when they occur in the transmembrane segments. This conclusion is supported by a recent study on potential of mean force for membrane protein folding and a study of scales for membrane propensity of amino acids. Copyright 2008 Wiley Periodicals, Inc.

Entities: Chemical

Mesh：

Substances：

Year: 2009 PMID： 18567007 DOI： 10.1002/jcc.21053

Source DB: PubMed Journal: J Comput Chem ISSN： 0192-8651 Impact factor: 3.376

Keyword Cloud
Cited

26 in total

1. iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors: Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal: Nucleic Acids Res Date: 2021-06-04 Impact factor: 16.971

2. A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.

Authors: Mohammad Saleh Refahi; A Mir; Jalal A Nasiri
Journal: Sci Rep Date: 2020-09-01 Impact factor: 4.379

3. BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors: Bin Liu; Xin Gao; Hanyu Zhang
Journal: Nucleic Acids Res Date: 2019-11-18 Impact factor: 16.971

4. Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes.

Authors: Xiaobao Dong; Yong-Jun Zhang; Ziding Zhang
Journal: PLoS One Date: 2013-02-20 Impact factor: 3.240

5. Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs.

Authors: Zhen Chen; Yong-Zi Chen; Xiao-Feng Wang; Chuan Wang; Ren-Xiang Yan; Ziding Zhang
Journal: PLoS One Date: 2011-07-29 Impact factor: 3.240

6. ATPsite: sequence-based prediction of ATP-binding residues.

Authors: Ke Chen; Marcin J Mizianty; Lukasz Kurgan
Journal: Proteome Sci Date: 2011-10-14 Impact factor: 2.480

7. Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.

Authors: Ce Zheng; Lukasz Kurgan
Journal: BMC Bioinformatics Date: 2008-10-10 Impact factor: 3.169