Literature DB >> 16274699

Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machines.

Xiaojing Yu1, Jianping Cao, Yudong Cai, Tieliu Shi, Yixue Li.   

Abstract

In the post-genome era, the prediction of protein function is one of the most demanding tasks in the study of bioinformatics. Machine learning methods, such as the support vector machines (SVMs), greatly help to improve the classification of protein function. In this work, we integrated SVMs, protein sequence amino acid composition, and associated physicochemical properties into the study of nucleic-acid-binding proteins prediction. We developed the binary classifications for rRNA-, RNA-, DNA-binding proteins that play an important role in the control of many cell processes. Each SVM predicts whether a protein belongs to rRNA-, RNA-, or DNA-binding protein class. Self-consistency and jackknife tests were performed on the protein data sets in which the sequences identity was < 25%. Test results show that the accuracies of rRNA-, RNA-, DNA-binding SVMs predictions are approximately 84%, approximately 78%, approximately 72%, respectively. The predictions were also performed on the ambiguous and negative data set. The results demonstrate that the predicted scores of proteins in the ambiguous data set by RNA- and DNA-binding SVM models were distributed around zero, while most proteins in the negative data set were predicted as negative scores by all three SVMs. The score distributions agree well with the prior knowledge of those proteins and show the effectiveness of sequence associated physicochemical properties in the protein function prediction. The software is available from the author upon request.

Mesh:

Substances:

Year:  2005        PMID: 16274699     DOI: 10.1016/j.jtbi.2005.09.018

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  48 in total

1.  Highly accurate and high-resolution function prediction of RNA binding proteins by fold recognition and binding affinity prediction.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  RNA Biol       Date:  2011-11-01       Impact factor: 4.652

2.  Prediction of RNA binding sites in proteins from amino acid sequence.

Authors:  Michael Terribilini; Jae-Hyung Lee; Changhui Yan; Robert L Jernigan; Vasant Honavar; Drena Dobbs
Journal:  RNA       Date:  2006-06-21       Impact factor: 4.942

3.  Predicting miRNA's target from primary structure by the nearest neighbor algorithm.

Authors:  Kao Lin; Ziliang Qian; Lin Lu; Lingyi Lu; Lihui Lai; Jieyi Gu; Zhenbing Zeng; Haipeng Li; Yudong Cai
Journal:  Mol Divers       Date:  2009-12-30       Impact factor: 2.943

4.  Identifying differences in protein expression levels by spectral counting and feature selection.

Authors:  P C Carvalho; J Hewel; V C Barbosa; J R Yates
Journal:  Genet Mol Res       Date:  2008-04-15

5.  Prediction of interactiveness of proteins and nucleic acids based on feature selections.

Authors:  YouLang Yuan; XiaoHe Shi; XinLei Li; WenCong Lu; YuDong Cai; Lei Gu; Liang Liu; MinJie Li; XiangYin Kong; Meng Xing
Journal:  Mol Divers       Date:  2009-10-09       Impact factor: 2.943

6.  Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome.

Authors:  Huiying Zhao; Yuedong Yang; Sarath Chandra Janga; C Cheng Kao; Yaoqi Zhou
Journal:  Proteins       Date:  2013-11-22

7.  Boosting the prediction and understanding of DNA-binding domains from sequence.

Authors:  Robert E Langlois; Hui Lu
Journal:  Nucleic Acids Res       Date:  2010-02-15       Impact factor: 16.971

8.  Prediction of pharmacological and xenobiotic responses to drugs based on time course gene expression profiles.

Authors:  Tao Huang; Weiren Cui; Lele Hu; Kaiyan Feng; Yi-Xue Li; Yu-Dong Cai
Journal:  PLoS One       Date:  2009-12-02       Impact factor: 3.240

9.  Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

Authors:  Kosuke Fujishima; Mizuki Komasa; Sayaka Kitamura; Haruo Suzuki; Masaru Tomita; Akio Kanai
Journal:  DNA Res       Date:  2007-06-15       Impact factor: 4.458

10.  LipocalinPred: a SVM-based method for prediction of lipocalins.

Authors:  Jayashree Ramana; Dinesh Gupta
Journal:  BMC Bioinformatics       Date:  2009-12-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.