Literature DB >> 15308540

Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes.

Kuo-Chen Chou1.   

Abstract

MOTIVATION: With protein sequences entering into databanks at an explosive pace, the early determination of the family or subfamily class for a newly found enzyme molecule becomes important because this is directly related to the detailed information about which specific target it acts on, as well as to its catalytic process and biological function. Unfortunately, it is both time-consuming and costly to do so by experiments alone. In a previous study, the covariant-discriminant algorithm was introduced to identify the 16 subfamily classes of oxidoreductases. Although the results were quite encouraging, the entire prediction process was based on the amino acid composition alone without including any sequence-order information. Therefore, it is worthy of further investigation.
RESULTS: To incorporate the sequence-order effects into the predictor, the 'amphiphilic pseudo amino acid composition' is introduced to represent the statistical sample of a protein. The novel representation contains 20 + 2lambda discrete numbers: the first 20 numbers are the components of the conventional amino acid composition; the next 2lambda numbers are a set of correlation factors that reflect different hydrophobicity and hydrophilicity distribution patterns along a protein chain. Based on such a concept and formulation scheme, a new predictor is developed. It is shown by the self-consistency test, jackknife test and independent dataset tests that the success rates obtained by the new predictor are all significantly higher than those by the previous predictors. The significant enhancement in success rates also implies that the distribution of hydrophobicity and hydrophilicity of the amino acid residues along a protein chain plays a very important role to its structure and function.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15308540     DOI: 10.1093/bioinformatics/bth466

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  180 in total

1.  iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.

Authors:  Hao Lin; En-Ze Deng; Hui Ding; Wei Chen; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2014-10-31       Impact factor: 16.971

2.  Prediction of ketoacyl synthase family using reduced amino acid alphabets.

Authors:  Wei Chen; Pengmian Feng; Hao Lin
Journal:  J Ind Microbiol Biotechnol       Date:  2011-10-26       Impact factor: 3.346

3.  Prediction of metalloproteinase family based on the concept of Chou's pseudo amino acid composition using a machine learning approach.

Authors:  Majid Mohammad Beigi; Mohaddeseh Behjati; Hassan Mohabatkar
Journal:  J Struct Funct Genomics       Date:  2011-12-03

4.  Predicting the Functional Types of Singleplex and Multiplex Eukaryotic Membrane Proteins via Different Models of Chou's Pseudo Amino Acid Compositions.

Authors:  Hong-Liang Zou; Xuan Xiao
Journal:  J Membr Biol       Date:  2015-10-12       Impact factor: 1.843

5.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

6.  Using fourier spectrum analysis and pseudo amino acid composition for prediction of membrane protein types.

Authors:  Hui Liu; Jie Yang; Meng Wang; Li Xue; Kuo-Chen Chou
Journal:  Protein J       Date:  2005-08       Impact factor: 2.371

7.  EHPred: an SVM-based method for epoxide hydrolases recognition and classification.

Authors:  Jia Jia; Liang Yang; Zi-Zhang Zhang
Journal:  J Zhejiang Univ Sci B       Date:  2006-01       Impact factor: 3.066

8.  Using an Ensemble to Identify and Classify Macroalgae Antimicrobial Peptides.

Authors:  Michela Chiara Caprani; John Healy; Orla Slattery; Joan O'Keeffe
Journal:  Interdiscip Sci       Date:  2021-05-12       Impact factor: 2.233

9.  D-Glucose sensing by a plasma membrane regulator of G signaling protein, AtRGS1.

Authors:  Jeffrey C Grigston; Daniel Osuna; Wolf-Rüdiger Scheible; Chenggang Liu; Mark Stitt; Alan M Jones
Journal:  FEBS Lett       Date:  2008-09-24       Impact factor: 4.124

10.  Imbalanced multi-label learning for identifying antimicrobial peptides and their functional types.

Authors:  Weizhong Lin; Dong Xu
Journal:  Bioinformatics       Date:  2016-08-26       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.