Literature DB >> 30923827

TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs.

Annkatrin Bressin1, Roman Schulte-Sasse1, Davide Figini2, Erika C Urdaneta2, Benedikt M Beckmann2, Annalisa Marsico1,3,4.   

Abstract

In recent years, hundreds of novel RNA-binding proteins (RBPs) have been identified, leading to the discovery of novel RNA-binding domains. Furthermore, unstructured or disordered low-complexity regions of RBPs have been identified to play an important role in interactions with nucleic acids. However, these advances in understanding RBPs are limited mainly to eukaryotic species and we only have limited tools to faithfully predict RNA-binders in bacteria. Here, we describe a support vector machine-based method, called TriPepSVM, for the prediction of RNA-binding proteins. TriPepSVM applies string kernels to directly handle protein sequences using tri-peptide frequencies. Testing the method in human and bacteria, we find that several RBP-enriched tri-peptides occur more often in structurally disordered regions of RBPs. TriPepSVM outperforms existing applications, which consider classical structural features of RNA-binding or homology, in the task of RBP prediction in both human and bacteria. Finally, we predict 66 novel RBPs in Salmonella Typhimurium and validate the bacterial proteins ClpX, DnaJ and UbiG to associate with RNA in vivo.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30923827      PMCID: PMC6511874          DOI: 10.1093/nar/gkz203

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  57 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

Review 2.  What do we know about ribosomal RNA methylation in Escherichia coli?

Authors:  O V Sergeeva; A A Bogdanov; P V Sergiev
Journal:  Biochimie       Date:  2014-12-13       Impact factor: 4.079

3.  KeBABS: an R package for kernel-based analysis of biological sequences.

Authors:  Johannes Palme; Sepp Hochreiter; Ulrich Bodenhofer
Journal:  Bioinformatics       Date:  2015-03-25       Impact factor: 6.937

4.  RBPPred: predicting RNA-binding proteins from sequence using SVM.

Authors:  Xiaoli Zhang; Shiyong Liu
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

5.  APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins.

Authors:  Malvika Sharan; Konrad U Förstner; Ana Eulalio; Jörg Vogel
Journal:  Nucleic Acids Res       Date:  2017-06-20       Impact factor: 16.971

6.  Structures of RNA complexes with the Escherichia coli RNA pyrophosphohydrolase RppH unveil the basis for specific 5'-end-dependent mRNA decay.

Authors:  Nikita Vasilyev; Alexander Serganov
Journal:  J Biol Chem       Date:  2015-02-05       Impact factor: 5.157

7.  HMMER web server: interactive sequence similarity searching.

Authors:  Robert D Finn; Jody Clements; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2011-05-18       Impact factor: 16.971

8.  The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets.

Authors:  Takaya Saito; Marc Rehmsmeier
Journal:  PLoS One       Date:  2015-03-04       Impact factor: 3.240

9.  RBscore&NBench: a high-level web server for nucleic acid binding residues prediction with a large-scale benchmarking database.

Authors:  Zhichao Miao; Eric Westhof
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

10.  Serial interactome capture of the human cell nucleus.

Authors:  Thomas Conrad; Anne-Susann Albrecht; Veronica Rodrigues de Melo Costa; Sascha Sauer; David Meierhofer; Ulf Andersson Ørom
Journal:  Nat Commun       Date:  2016-04-04       Impact factor: 14.919

View more
  12 in total

1.  RBP-TSTL is a two-stage transfer learning framework for genome-scale prediction of RNA-binding proteins.

Authors:  Xinxin Peng; Xiaoyu Wang; Yuming Guo; Zongyuan Ge; Fuyi Li; Xin Gao; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

Review 2.  How RNA-Binding Proteins Interact with RNA: Molecules and Mechanisms.

Authors:  Meredith Corley; Margaret C Burns; Gene W Yeo
Journal:  Mol Cell       Date:  2020-04-02       Impact factor: 17.970

Review 3.  The Function and Therapeutic Potential of Circular RNA in Cardiovascular Diseases.

Authors:  Kai Wang; Xiang-Qian Gao; Tao Wang; Lu-Yu Zhou
Journal:  Cardiovasc Drugs Ther       Date:  2021-07-16       Impact factor: 3.727

4.  iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles.

Authors:  Haitao Han; Chenchen Ding; Xin Cheng; Xiuzhi Sang; Taigang Liu
Journal:  Molecules       Date:  2021-04-24       Impact factor: 4.411

5.  Dual Attention Mechanisms and Feature Fusion Networks Based Method for Predicting LncRNA-Disease Associations.

Authors:  Yu Liu; Yingying Yu; Shimin Zhao
Journal:  Interdiscip Sci       Date:  2022-01-24       Impact factor: 2.233

6.  BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors:  Hong-Liang Li; Yi-He Pang; Bin Liu
Journal:  Nucleic Acids Res       Date:  2021-12-16       Impact factor: 16.971

7.  Autogenous regulation in vivo of the rpmE gene encoding ribosomal protein L31 (bL31), a key component of the protein-protein intersubunit bridge B1b.

Authors:  Leonid V Aseev; Ludmila S Koledinskaya; Irina V Boni
Journal:  RNA       Date:  2020-03-24       Impact factor: 4.942

8.  Graph Convolutional Network and Convolutional Neural Network Based Method for Predicting lncRNA-Disease Associations.

Authors:  Ping Xuan; Shuxiang Pan; Tiangang Zhang; Yong Liu; Hao Sun
Journal:  Cells       Date:  2019-08-30       Impact factor: 6.600

Review 9.  Long non-coding RNAs in cutaneous biology and proliferative skin diseases: Advances and perspectives.

Authors:  Lipeng Tang; Yongxin Liang; Hesong Xie; Xiaozhi Yang; Guangjuan Zheng
Journal:  Cell Prolif       Date:  2019-10-06       Impact factor: 6.831

10.  Comparative Genomics and Evolutionary Analysis of RNA-Binding Proteins of Burkholderia cenocepacia J2315 and Other Members of the B. cepacia Complex.

Authors:  Joana R Feliciano; António M M Seixas; Tiago Pita; Jorge H Leitão
Journal:  Genes (Basel)       Date:  2020-02-21       Impact factor: 4.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.