Literature DB >> 14970381

Prediction of RNA-binding proteins from primary sequence by a support vector machine approach.

Lian Yi Han1, Cong Zhong Cai, Siew Lin Lo, Maxey C M Chung, Yu Zong Chen.   

Abstract

Elucidation of the interaction of proteins with different molecules is of significance in the understanding of cellular processes. Computational methods have been developed for the prediction of protein-protein interactions. But insufficient attention has been paid to the prediction of protein-RNA interactions, which play central roles in regulating gene expression and certain RNA-mediated enzymatic processes. This work explored the use of a machine learning method, support vector machines (SVM), for the prediction of RNA-binding proteins directly from their primary sequence. Based on the knowledge of known RNA-binding and non-RNA-binding proteins, an SVM system was trained to recognize RNA-binding proteins. A total of 4011 RNA-binding and 9781 non-RNA-binding proteins was used to train and test the SVM classification system, and an independent set of 447 RNA-binding and 4881 non-RNA-binding proteins was used to evaluate the classification accuracy. Testing results using this independent evaluation set show a prediction accuracy of 94.1%, 79.3%, and 94.1% for rRNA-, mRNA-, and tRNA-binding proteins, and 98.7%, 96.5%, and 99.9% for non-rRNA-, non-mRNA-, and non-tRNA-binding proteins, respectively. The SVM classification system was further tested on a small class of snRNA-binding proteins with only 60 available sequences. The prediction accuracy is 40.0% and 99.9% for snRNA-binding and non-snRNA-binding proteins, indicating a need for a sufficient number of proteins to train SVM. The SVM classification systems trained in this work were added to our Web-based protein functional classification software SVMProt, at http://jing.cz3.nus.edu.sg/cgi-bin/svmprot.cgi. Our study suggests the potential of SVM as a useful tool for facilitating the prediction of protein-RNA interactions.

Mesh:

Substances:

Year:  2004        PMID: 14970381      PMCID: PMC1370931          DOI: 10.1261/rna.5890304

Source DB:  PubMed          Journal:  RNA        ISSN: 1355-8382            Impact factor:   4.942


  44 in total

1.  Simulations of the dynamics at an RNA-protein interface.

Authors:  T Hermann; E Westhof
Journal:  Nat Struct Biol       Date:  1999-06

Review 2.  Metabolic networks: a signal-oriented approach to cellular models.

Authors:  J W Lengeler
Journal:  Biol Chem       Date:  2000 Sep-Oct       Impact factor: 3.915

3.  Prediction of protein structural classes by support vector machines.

Authors:  Yu-Dong Cai; Xiao-Jun Liu; Xue-biao Xu; Kuo-Chen Chou
Journal:  Comput Chem       Date:  2002-02

Review 4.  RNA-protein interactions that regulate pre-mRNA splicing.

Authors:  Ravinder Singh
Journal:  Gene Expr       Date:  2002

5.  Classifying G-protein coupled receptors with support vector machines.

Authors:  Rachel Karchin; Kevin Karplus; David Haussler
Journal:  Bioinformatics       Date:  2002-01       Impact factor: 6.937

Review 6.  Themes in RNA-protein recognition.

Authors:  D E Draper
Journal:  J Mol Biol       Date:  1999-10-22       Impact factor: 5.469

7.  Support vector machines for spam categorization.

Authors:  H Drucker; D Wu; V N Vapnik
Journal:  IEEE Trans Neural Netw       Date:  1999

Review 8.  Protein modules and signalling networks.

Authors:  T Pawson
Journal:  Nature       Date:  1995-02-16       Impact factor: 49.962

9.  Conservation of gene order: a fingerprint of proteins that physically interact.

Authors:  T Dandekar; B Snel; M Huynen; P Bork
Journal:  Trends Biochem Sci       Date:  1998-09       Impact factor: 13.807

10.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

View more
  43 in total

1.  Highly accurate and high-resolution function prediction of RNA binding proteins by fold recognition and binding affinity prediction.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  RNA Biol       Date:  2011-11-01       Impact factor: 4.652

2.  Prediction of RNA binding sites in proteins from amino acid sequence.

Authors:  Michael Terribilini; Jae-Hyung Lee; Changhui Yan; Robert L Jernigan; Vasant Honavar; Drena Dobbs
Journal:  RNA       Date:  2006-06-21       Impact factor: 4.942

3.  MHC-BPS: MHC-binder prediction server for identifying peptides of flexible lengths from sequence-derived physicochemical properties.

Authors:  Juan Cui; Lian Yi Han; Hong Huang Lin; Zhi Qun Tang; Li Jiang; Zhi Wei Cao; Yu Zong Chen
Journal:  Immunogenetics       Date:  2006-07-11       Impact factor: 2.846

4.  Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome.

Authors:  Huiying Zhao; Yuedong Yang; Sarath Chandra Janga; C Cheng Kao; Yaoqi Zhou
Journal:  Proteins       Date:  2013-11-22

5.  Incorporating significant amino acid pairs and protein domains to predict RNA splicing-related proteins with functional roles.

Authors:  Justin Bo-Kai Hsu; Kai-Yao Huang; Tzu-Ya Weng; Chien-Hsun Huang; Tzong-Yi Lee
Journal:  J Comput Aided Mol Des       Date:  2014-01-19       Impact factor: 3.686

6.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

7.  Prediction and classification of aminoacyl tRNA synthetases using PROSITE domains.

Authors:  Bharat Panwar; Gajendra P S Raghava
Journal:  BMC Genomics       Date:  2010-09-22       Impact factor: 3.969

8.  Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

Authors:  Samad Jahandideh; Vinodh Srinivasasainagendra; Degui Zhi
Journal:  J Theor Biol       Date:  2012-08-03       Impact factor: 2.691

Review 9.  Prediction of RNA binding proteins comes of age from low resolution to high resolution.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  Mol Biosyst       Date:  2013-10

10.  Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

Authors:  Kosuke Fujishima; Mizuki Komasa; Sayaka Kitamura; Haruo Suzuki; Masaru Tomita; Akio Kanai
Journal:  DNA Res       Date:  2007-06-15       Impact factor: 4.458

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.