Literature DB >> 25240115

newDNA-Prot: Prediction of DNA-binding proteins by employing support vector machine and a comprehensive sequence representation.

Yanping Zhang1, Jun Xu2, Wei Zheng2, Chen Zhang2, Xingye Qiu2, Ke Chen3, Jishou Ruan2.   

Abstract

Identification of DNA-binding proteins is essential in studying cellular activities as the DNA-binding proteins play a pivotal role in gene regulation. In this study, we propose newDNA-Prot, a DNA-binding protein predictor that employs support vector machine classifier and a comprehensive feature representation. The sequence representation are categorized into 6 groups: primary sequence based, evolutionary profile based, predicted secondary structure based, predicted relative solvent accessibility based, physicochemical property based and biological function based features. The mRMR, wrapper and two-stage feature selection methods are employed for removing irrelevant features and reducing redundant features. Experiments demonstrate that the two-stage method performs better than the mRMR and wrapper methods. We also perform a statistical analysis on the selected features and results show that more than 95% of the selected features are statistically significant and they cover all 6 feature groups. The newDNA-Prot method is compared with several state of the art algorithms, including iDNA-Prot, DNAbinder and DNA-Prot. The results demonstrate that newDNA-Prot method outperforms the iDNA-Prot, DNAbinder and DNA-Prot methods. More specific, newDNA-Prot improves the runner-up method, DNA-Prot for around 10% on several evaluation measures. The proposed newDNA-Prot method is available at http://sourceforge.net/projects/newdnaprot/
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  DNA-binding proteins; Feature selection methods; Features; ROC; SVM

Mesh:

Substances:

Year:  2014        PMID: 25240115     DOI: 10.1016/j.compbiolchem.2014.09.002

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  10 in total

Review 1.  DNA-protein interaction: identification, prediction and data analysis.

Authors:  Abbasali Emamjomeh; Darush Choobineh; Behzad Hajieghrari; Nafiseh MahdiNezhad; Amir Khodavirdipour
Journal:  Mol Biol Rep       Date:  2019-03-26       Impact factor: 2.316

2.  Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning.

Authors:  Guobin Li; Xiuquan Du; Xinlu Li; Le Zou; Guanhong Zhang; Zhize Wu
Journal:  PeerJ       Date:  2021-05-03       Impact factor: 2.984

Review 3.  An overview of the prediction of protein DNA-binding sites.

Authors:  Jingna Si; Rui Zhao; Rongling Wu
Journal:  Int J Mol Sci       Date:  2015-03-06       Impact factor: 5.923

4.  DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.

Authors:  Xin Ma; Jing Guo; Xiao Sun
Journal:  PLoS One       Date:  2016-12-01       Impact factor: 3.240

5.  Integrating sequence and gene expression information predicts genome-wide DNA-binding proteins and suggests a cooperative mechanism.

Authors:  Shandar Ahmad; Philip Prathipati; Lokesh P Tripathi; Yi-An Chen; Ajay Arya; Yoichi Murakami; Kenji Mizuguchi
Journal:  Nucleic Acids Res       Date:  2018-01-09       Impact factor: 16.971

6.  Identification and Analysis of Blood Gene Expression Signature for Osteoarthritis With Advanced Feature Selection Methods.

Authors:  Jing Li; Chun-Na Lan; Ying Kong; Song-Shan Feng; Tao Huang
Journal:  Front Genet       Date:  2018-08-30       Impact factor: 4.599

7.  Implications of Newly Identified Brain eQTL Genes and Their Interactors in Schizophrenia.

Authors:  Lei Cai; Tao Huang; Jingjing Su; Xinxin Zhang; Wenzhong Chen; Fuquan Zhang; Lin He; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2018-07-11       Impact factor: 8.886

8.  DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

Authors:  Graham B Motion; Andrew J M Howden; Edgar Huitema; Susan Jones
Journal:  Nucleic Acids Res       Date:  2015-08-24       Impact factor: 16.971

9.  Harnessing the evolutionary information on oxygen binding proteins through Support Vector Machines based modules.

Authors:  Selvaraj Muthukrishnan; Munish Puri
Journal:  BMC Res Notes       Date:  2018-05-11

10.  Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.

Authors:  Mehdi Poursheikhali Asghari; Parviz Abdolmaleki
Journal:  Avicenna J Med Biotechnol       Date:  2019 Jan-Mar
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.