Literature DB >> 26304539

DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

Graham B Motion1, Andrew J M Howden2, Edgar Huitema3, Susan Jones4.   

Abstract

There are currently 151 plants with draft genomes available but levels of functional annotation for putative protein products are low. Therefore, accurate computational predictions are essential to annotate genomes in the first instance, and to provide focus for the more costly and time consuming functional assays that follow. DNA-binding proteins are an important class of proteins that require annotation, but current computational methods are not applicable for genome wide predictions in plant species. Here, we explore the use of species and lineage specific models for the prediction of DNA-binding proteins in plants. We show that a species specific support vector machine model based on Arabidopsis sequence data is more accurate (accuracy 81%) than a generic model (74%), and based on this we develop a plant specific model for predicting DNA-binding proteins. We apply this model to the tomato proteome and demonstrate its ability to perform accurate high-throughput prediction of DNA-binding proteins. In doing so, we have annotated 36 currently uncharacterised proteins by assigning a putative DNA-binding function. Our model is publically available and we propose it be used in combination with existing tools to help increase annotation levels of DNA-binding proteins encoded in plant genomes.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26304539      PMCID: PMC4678848          DOI: 10.1093/nar/gkv805

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  50 in total

Review 1.  The evolution of gene regulation by transcription factors and microRNAs.

Authors:  Kevin Chen; Nikolaus Rajewsky
Journal:  Nat Rev Genet       Date:  2007-02       Impact factor: 53.242

Review 2.  Predicting protein function from sequence and structure.

Authors:  David Lee; Oliver Redfern; Christine Orengo
Journal:  Nat Rev Mol Cell Biol       Date:  2007-12       Impact factor: 94.444

Review 3.  Computational approaches for predicting the binding sites and understanding the recognition mechanism of protein-DNA complexes.

Authors:  M Michael Gromiha; R Nagarajan
Journal:  Adv Protein Chem Struct Biol       Date:  2013       Impact factor: 3.507

4.  newDNA-Prot: Prediction of DNA-binding proteins by employing support vector machine and a comprehensive sequence representation.

Authors:  Yanping Zhang; Jun Xu; Wei Zheng; Chen Zhang; Xingye Qiu; Ke Chen; Jishou Ruan
Journal:  Comput Biol Chem       Date:  2014-09-15       Impact factor: 2.877

Review 5.  Transcription dynamics in plant immunity.

Authors:  John W Moore; Gary J Loake; Steven H Spoel
Journal:  Plant Cell       Date:  2011-08-12       Impact factor: 11.277

6.  agriGO: a GO analysis toolkit for the agricultural community.

Authors:  Zhou Du; Xin Zhou; Yi Ling; Zhenhai Zhang; Zhen Su
Journal:  Nucleic Acids Res       Date:  2010-04-30       Impact factor: 16.971

7.  PiRaNhA: a server for the computational prediction of RNA-binding residues in protein sequences.

Authors:  Yoichi Murakami; Ruth V Spriggs; Haruki Nakamura; Susan Jones
Journal:  Nucleic Acids Res       Date:  2010-05-27       Impact factor: 16.971

8.  UniProt Knowledgebase: a hub of integrated protein data.

Authors:  Michele Magrane
Journal:  Database (Oxford)       Date:  2011-03-29       Impact factor: 3.451

9.  QuickGO: a web-based tool for Gene Ontology searching.

Authors:  David Binns; Emily Dimmer; Rachael Huntley; Daniel Barrell; Claire O'Donovan; Rolf Apweiler
Journal:  Bioinformatics       Date:  2009-09-10       Impact factor: 6.937

10.  Identification of DNA-binding proteins using support vector machines and evolutionary profiles.

Authors:  Manish Kumar; Michael M Gromiha; Gajendra P S Raghava
Journal:  BMC Bioinformatics       Date:  2007-11-27       Impact factor: 3.169

View more
  9 in total

1.  Genomic insights into HSFs as candidate genes for high-temperature stress adaptation and gene editing with minimal off-target effects in flax.

Authors:  Dipnarayan Saha; Pranit Mukherjee; Sourav Dutta; Kanti Meena; Surja Kumar Sarkar; Asit Baran Mandal; Tapash Dasgupta; Jiban Mitra
Journal:  Sci Rep       Date:  2019-04-03       Impact factor: 4.379

2.  Detection of nucleic acid-protein interactions in plant leaves using fluorescence lifetime imaging microscopy.

Authors:  Laurent Camborde; Alain Jauneau; Christian Brière; Laurent Deslandes; Bernard Dumas; Elodie Gaulin
Journal:  Nat Protoc       Date:  2017-08-24       Impact factor: 13.491

3.  On the prediction of DNA-binding proteins only from primary sequences: A deep learning approach.

Authors:  Yu-Hui Qu; Hua Yu; Xiu-Jun Gong; Jia-Hui Xu; Hong-Shun Lee
Journal:  PLoS One       Date:  2017-12-29       Impact factor: 3.240

Review 4.  Use of Natural Diversity and Biotechnology to Increase the Quality and Nutritional Content of Tomato and Grape.

Authors:  Quentin Gascuel; Gianfranco Diretto; Antonio J Monforte; Ana M Fortes; Antonio Granell
Journal:  Front Plant Sci       Date:  2017-05-12       Impact factor: 5.753

5.  PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method.

Authors:  Jun Wang; Huiwen Zheng; Yang Yang; Wanyue Xiao; Taigang Liu
Journal:  Biomed Res Int       Date:  2020-04-13       Impact factor: 3.411

6.  An improved deep learning method for predicting DNA-binding proteins based on contextual features in amino acid sequences.

Authors:  Siquan Hu; Ruixiong Ma; Haiou Wang
Journal:  PLoS One       Date:  2019-11-14       Impact factor: 3.240

Review 7.  Single-Stranded DNA Binding Proteins and Their Identification Using Machine Learning-Based Approaches.

Authors:  Jun-Tao Guo; Fareeha Malik
Journal:  Biomolecules       Date:  2022-08-26

8.  Improved detection of DNA-binding proteins via compression technology on PSSM information.

Authors:  Yubo Wang; Yijie Ding; Fei Guo; Leyi Wei; Jijun Tang
Journal:  PLoS One       Date:  2017-09-29       Impact factor: 3.240

9.  HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.

Authors:  Xiuzhi Sang; Wanyue Xiao; Huiwen Zheng; Yang Yang; Taigang Liu
Journal:  Comput Math Methods Med       Date:  2020-03-28       Impact factor: 2.238

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.