Literature DB >> 28334975

APRICOT: an integrated computational pipeline for the sequence-based identification and characterization of RNA-binding proteins.

Malvika Sharan1, Konrad U Förstner2, Ana Eulalio1, Jörg Vogel1.   

Abstract

RNA-binding proteins (RBPs) have been established as core components of several post-transcriptional gene regulation mechanisms. Experimental techniques such as cross-linking and co-immunoprecipitation have enabled the identification of RBPs, RNA-binding domains (RBDs) and their regulatory roles in the eukaryotic species such as human and yeast in large-scale. In contrast, our knowledge of the number and potential diversity of RBPs in bacteria is poorer due to the technical challenges associated with the existing global screening approaches. We introduce APRICOT, a computational pipeline for the sequence-based identification and characterization of proteins using RBDs known from experimental studies. The pipeline identifies functional motifs in protein sequences using position-specific scoring matrices and Hidden Markov Models of the functional domains and statistically scores them based on a series of sequence-based features. Subsequently, APRICOT identifies putative RBPs and characterizes them by several biological properties. Here we demonstrate the application and adaptability of the pipeline on large-scale protein sets, including the bacterial proteome of Escherichia coli. APRICOT showed better performance on various datasets compared to other existing tools for the sequence-based prediction of RBPs by achieving an average sensitivity and specificity of 0.90 and 0.91 respectively. The command-line tool and its documentation are available at https://pypi.python.org/pypi/bio-apricot.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28334975      PMCID: PMC5499795          DOI: 10.1093/nar/gkx137

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  64 in total

1.  SVM based prediction of RNA-binding proteins using binding residues and evolutionary information.

Authors:  Manish Kumar; M Michael Gromiha; Gajendra P S Raghava
Journal:  J Mol Recognit       Date:  2011 Mar-Apr       Impact factor: 2.137

2.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

3.  Insights into RNA biology from an atlas of mammalian mRNA-binding proteins.

Authors:  Alfredo Castello; Bernd Fischer; Katrin Eichelbaum; Rastislav Horos; Benedikt M Beckmann; Claudia Strein; Norman E Davey; David T Humphreys; Thomas Preiss; Lars M Steinmetz; Jeroen Krijgsveld; Matthias W Hentze
Journal:  Cell       Date:  2012-05-31       Impact factor: 41.582

4.  SPOT-Seq-RNA: predicting protein-RNA complex structure and RNA-binding function by fold recognition and binding affinity prediction.

Authors:  Yuedong Yang; Huiying Zhao; Jihua Wang; Yaoqi Zhou
Journal:  Methods Mol Biol       Date:  2014

5.  An RNA recognition motif (RRM) is required for the localization of PTB-associated splicing factor (PSF) to subnuclear speckles.

Authors:  B T Dye; J G Patton
Journal:  Exp Cell Res       Date:  2001-02-01       Impact factor: 3.905

6.  New and continuing developments at PROSITE.

Authors:  Christian J A Sigrist; Edouard de Castro; Lorenzo Cerutti; Béatrice A Cuche; Nicolas Hulo; Alan Bridge; Lydie Bougueleret; Ioannis Xenarios
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

7.  UniProt Knowledgebase: a hub of integrated protein data.

Authors:  Michele Magrane
Journal:  Database (Oxford)       Date:  2011-03-29       Impact factor: 3.451

8.  SUPERFAMILY 1.75 including a domain-centric gene ontology method.

Authors:  David A de Lima Morais; Hai Fang; Owen J L Rackham; Derek Wilson; Ralph Pethica; Cyrus Chothia; Julian Gough
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

9.  Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions.

Authors:  Jaina Mistry; Robert D Finn; Sean R Eddy; Alex Bateman; Marco Punta
Journal:  Nucleic Acids Res       Date:  2013-04-17       Impact factor: 16.971

10.  Global analysis of yeast mRNPs.

Authors:  Sarah F Mitchell; Saumya Jain; Meipei She; Roy Parker
Journal:  Nat Struct Mol Biol       Date:  2012-12-09       Impact factor: 15.369

View more
  6 in total

1.  TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs.

Authors:  Annkatrin Bressin; Roman Schulte-Sasse; Davide Figini; Erika C Urdaneta; Benedikt M Beckmann; Annalisa Marsico
Journal:  Nucleic Acids Res       Date:  2019-05-21       Impact factor: 16.971

2.  RBP-TSTL is a two-stage transfer learning framework for genome-scale prediction of RNA-binding proteins.

Authors:  Xinxin Peng; Xiaoyu Wang; Yuming Guo; Zongyuan Ge; Fuyi Li; Xin Gao; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  A systematic analysis of the RNA-targeting potential of secreted bacterial effector proteins.

Authors:  Caroline Tawk; Malvika Sharan; Ana Eulalio; Jörg Vogel
Journal:  Sci Rep       Date:  2017-08-24       Impact factor: 4.379

4.  Deep-RBPPred: Predicting RNA binding proteins in the proteome scale based on deep learning.

Authors:  Jinfang Zheng; Xiaoli Zhang; Xunyi Zhao; Xiaoxue Tong; Xu Hong; Juan Xie; Shiyong Liu
Journal:  Sci Rep       Date:  2018-10-15       Impact factor: 4.379

5.  Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.

Authors:  Mehdi Poursheikhali Asghari; Parviz Abdolmaleki
Journal:  Avicenna J Med Biotechnol       Date:  2019 Jan-Mar

6.  Comparative Genomics and Evolutionary Analysis of RNA-Binding Proteins of Burkholderia cenocepacia J2315 and Other Members of the B. cepacia Complex.

Authors:  Joana R Feliciano; António M M Seixas; Tiago Pita; Jorge H Leitão
Journal:  Genes (Basel)       Date:  2020-02-21       Impact factor: 4.096

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.