Literature DB >> 18567007

Prediction of integral membrane protein type by collocated hydrophobic amino acid pairs.

Ke Chen1, Yingfu Jiang, Li Du, Lukasz Kurgan.   

Abstract

A computational model, IMP-TYPE, is proposed for the classification of five types of integral membrane proteins from protein sequence. The proposed model aims not only at providing accurate predictions but most importantly it incorporates interesting and transparent biological patterns. When contrasted with the best-performing existing models, IMP-TYPE reduces the error rates of these methods by 19 and 34% for two out-of-sample tests performed on benchmark datasets. Our empirical evaluations also show that the proposed method provides even bigger improvements, i.e., 29 and 45% error rate reductions, when predictions are performed for sequences that share low (40%) identity with sequences from the training dataset. We also show that IMP-TYPE can be used in a standalone mode, i.e., it duplicates significant majority of correct predictions provided by other leading methods, while providing additional correct predictions which are incorrectly classified by the other methods. Our method computes predictions using a Support Vector Machine classifier that takes feature-based encoded sequence as its input. The input feature set includes hydrophobic AA pairs, which were selected by utilizing a consensus of three feature selection algorithms. The hydrophobic residues that build up the AA pairs used by our method are shown to be associated with the formation of transmembrane helices in a few recent studies concerning integral membrane proteins. Our study also indicates that Met and Phe display a certain degree of hydrophobicity, which may be more crucial than their polarity or aromaticity when they occur in the transmembrane segments. This conclusion is supported by a recent study on potential of mean force for membrane protein folding and a study of scales for membrane propensity of amino acids. Copyright 2008 Wiley Periodicals, Inc.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 18567007     DOI: 10.1002/jcc.21053

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  26 in total

1.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

2.  A novel fusion based on the evolutionary features for protein fold recognition using support vector machines.

Authors:  Mohammad Saleh Refahi; A Mir; Jalal A Nasiri
Journal:  Sci Rep       Date:  2020-09-01       Impact factor: 4.379

3.  BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors:  Bin Liu; Xin Gao; Hanyu Zhang
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

4.  Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes.

Authors:  Xiaobao Dong; Yong-Jun Zhang; Ziding Zhang
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

5.  Prediction of ubiquitination sites by using the composition of k-spaced amino acid pairs.

Authors:  Zhen Chen; Yong-Zi Chen; Xiao-Feng Wang; Chuan Wang; Ren-Xiang Yan; Ziding Zhang
Journal:  PLoS One       Date:  2011-07-29       Impact factor: 3.240

6.  ATPsite: sequence-based prediction of ATP-binding residues.

Authors:  Ke Chen; Marcin J Mizianty; Lukasz Kurgan
Journal:  Proteome Sci       Date:  2011-10-14       Impact factor: 2.480

7.  Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.

Authors:  Ce Zheng; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2008-10-10       Impact factor: 3.169

8.  IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations.

Authors:  Md Mehedi Hasan; Md Ashad Alam; Watshara Shoombuatong; Hiroyuki Kurata
Journal:  J Comput Aided Mol Des       Date:  2021-01-04       Impact factor: 3.686

9.  Prediction of protein phosphorylation sites by using the composition of k-spaced amino acid pairs.

Authors:  Xiaowei Zhao; Wenyi Zhang; Xin Xu; Zhiqiang Ma; Minghao Yin
Journal:  PLoS One       Date:  2012-10-22       Impact factor: 3.240

10.  Position-specific analysis and prediction for protein lysine acetylation based on multiple features.

Authors:  Sheng-Bao Suo; Jian-Ding Qiu; Shao-Ping Shi; Xing-Yu Sun; Shu-Yun Huang; Xiang Chen; Ru-Ping Liang
Journal:  PLoS One       Date:  2012-11-16       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.