Literature DB >> 18293306

Prediction of protein structural class using novel evolutionary collocation-based sequence representation.

Ke Chen1, Lukasz A Kurgan, Jishou Ruan.   

Abstract

Knowledge of structural classes is useful in understanding of folding patterns in proteins. Although existing structural class prediction methods applied virtually all state-of-the-art classifiers, many of them use a relatively simple protein sequence representation that often includes amino acid (AA) composition. To this end, we propose a novel sequence representation that incorporates evolutionary information encoded using PSI-BLAST profile-based collocation of AA pairs. We used six benchmark datasets and five representative classifiers to quantify and compare the quality of the structural class prediction with the proposed representation. The best, classifier support vector machine achieved 61-96% accuracy on the six datasets. These predictions were comprehensively compared with a wide range of recently proposed methods for prediction of structural classes. Our comprehensive comparison shows superiority of the proposed representation, which results in error rate reductions that range between 14% and 26% when compared with predictions of the best-performing, previously published classifiers on the considered datasets. The study also shows that, for the benchmark dataset that includes sequences characterized by low identity (i.e., 25%, 30%, and 40%), the prediction accuracies are 20-35% lower than for the other three datasets that include sequences with a higher degree of similarity. In conclusion, the proposed representation is shown to substantially improve the accuracy of the structural class prediction. A web server that implements the presented prediction method is freely available at http://biomine.ece.ualberta.ca/Structural_Class/SCEC.html. (c) 2008 Wiley Periodicals, Inc. J Comput Chem, 2008.

Mesh:

Substances:

Year:  2008        PMID: 18293306     DOI: 10.1002/jcc.20918

Source DB:  PubMed          Journal:  J Comput Chem        ISSN: 0192-8651            Impact factor:   3.376


  31 in total

1.  Identifying anticancer peptides by using a generalized chaos game representation.

Authors:  Li Ge; Jiaguo Liu; Yusen Zhang; Matthias Dehmer
Journal:  J Math Biol       Date:  2018-10-05       Impact factor: 2.259

2.  Predicting the molecular interactions of CRIP1a-cannabinoid 1 receptor with integrated molecular modeling approaches.

Authors:  Mostafa H Ahmed; Glen E Kellogg; Dana E Selley; Martin K Safo; Yan Zhang
Journal:  Bioorg Med Chem Lett       Date:  2014-01-08       Impact factor: 2.823

3.  Exploring protein structural dissimilarity to facilitate structure classification.

Authors:  Pooja Jain; Jonathan D Hirst
Journal:  BMC Struct Biol       Date:  2009-09-19

4.  A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  PLoS One       Date:  2010-04-01       Impact factor: 3.240

5.  Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  PLoS One       Date:  2010-06-28       Impact factor: 3.240

6.  BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches.

Authors:  Bin Liu; Xin Gao; Hanyu Zhang
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

7.  Deletion of Murid Herpesvirus 4 ORF63 Affects the Trafficking of Incoming Capsids toward the Nucleus.

Authors:  Muhammad Bilal Latif; Bénédicte Machiels; Xue Xiao; Jan Mast; Alain Vanderplasschen; Laurent Gillet
Journal:  J Virol       Date:  2015-12-16       Impact factor: 5.103

8.  Efficient Biosynthesis of Fungal Polyketides Containing the Dioxabicyclo-octane Ring System.

Authors:  Xu-Ming Mao; Zha-Jun Zhan; Matthew N Grayson; Man-Cheng Tang; Wei Xu; Yong-Quan Li; Wen-Bing Yin; Hsiao-Ching Lin; Yit-Heng Chooi; K N Houk; Yi Tang
Journal:  J Am Chem Soc       Date:  2015-09-10       Impact factor: 15.419

9.  Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

Authors:  Marcin J Mizianty; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2009-12-13       Impact factor: 3.169

10.  Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position.

Authors:  Qi Dai; Yan Li; Xiaoqing Liu; Yuhua Yao; Yunjie Cao; Pingan He
Journal:  BMC Bioinformatics       Date:  2013-05-04       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.