Literature DB >> 12758155

Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence.

Yu-dong Cai1, Shuo Liang Lin.   

Abstract

Classification of gene function remains one of the most important and demanding tasks in the post-genome era. Most of the current predictive computer methods rely on comparing features that are essentially linear to the protein sequence. However, features of a protein nonlinear to the sequence may also be predictive to its function. Machine learning methods, for instance the Support Vector Machines (SVMs), are particularly suitable for exploiting such features. In this work we introduce SVM and the pseudo-amino acid composition, a collection of nonlinear features extractable from protein sequence, to the field of protein function prediction. We have developed prototype SVMs for binary classification of rRNA-, RNA-, and DNA-binding proteins. Using a protein's amino acid composition and limited range correlation of hydrophobicity and solvent accessible surface area as input, each of the SVMs predicts whether the protein belongs to one of the three classes. In self-consistency and cross-validation tests, which measures the success of learning and prediction, respectively, the rRNA-binding SVM has consistently achieved >95% accuracy. The RNA- and DNA-binding SVMs demonstrate more diverse accuracy, ranging from approximately 76% to approximately 97%. Analysis of the test results suggests the directions of improving the SVMs.

Mesh:

Substances:

Year:  2003        PMID: 12758155     DOI: 10.1016/s1570-9639(03)00112-2

Source DB:  PubMed          Journal:  Biochim Biophys Acta        ISSN: 0006-3002


  43 in total

1.  Highly accurate and high-resolution function prediction of RNA binding proteins by fold recognition and binding affinity prediction.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  RNA Biol       Date:  2011-11-01       Impact factor: 4.652

2.  Structure-based prediction of DNA-binding proteins by structural alignment and a volume-fraction corrected DFIRE-based energy function.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  Bioinformatics       Date:  2010-06-04       Impact factor: 6.937

3.  Prediction of RNA binding sites in proteins from amino acid sequence.

Authors:  Michael Terribilini; Jae-Hyung Lee; Changhui Yan; Robert L Jernigan; Vasant Honavar; Drena Dobbs
Journal:  RNA       Date:  2006-06-21       Impact factor: 4.942

4.  Prediction of interactiveness of proteins and nucleic acids based on feature selections.

Authors:  YouLang Yuan; XiaoHe Shi; XinLei Li; WenCong Lu; YuDong Cai; Lei Gu; Liang Liu; MinJie Li; XiangYin Kong; Meng Xing
Journal:  Mol Divers       Date:  2009-10-09       Impact factor: 2.943

5.  Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome.

Authors:  Huiying Zhao; Yuedong Yang; Sarath Chandra Janga; C Cheng Kao; Yaoqi Zhou
Journal:  Proteins       Date:  2013-11-22

6.  Evolutionary insights into the active-site structures of the metallo-β-lactamase superfamily from a classification study with support vector machine.

Authors:  Lili Wang; Ling Yang; Yu-Lan Feng; Hao Zhang
Journal:  J Biol Inorg Chem       Date:  2020-09-18       Impact factor: 3.358

7.  Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains.

Authors:  Bharat Panwar; Gajendra P S Raghava
Journal:  BMC Genomics       Date:  2010-09-22       Impact factor: 3.969

8.  Boosting the prediction and understanding of DNA-binding domains from sequence.

Authors:  Robert E Langlois; Hui Lu
Journal:  Nucleic Acids Res       Date:  2010-02-15       Impact factor: 16.971

Review 9.  Prediction of RNA binding proteins comes of age from low resolution to high resolution.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  Mol Biosyst       Date:  2013-10

10.  Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

Authors:  Kosuke Fujishima; Mizuki Komasa; Sayaka Kitamura; Haruo Suzuki; Masaru Tomita; Akio Kanai
Journal:  DNA Res       Date:  2007-06-15       Impact factor: 4.458

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.