Literature DB >> 24189096

Predicting DNA binding proteins using support vector machine with hybrid fractal features.

Xiao-Hui Niu1, Xue-Hai Hu2, Feng Shi1, Jing-Bo Xia1.   

Abstract

DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances.
© 2013 The Authors. Published by Elsevier Ltd All rights reserved.

Entities:  

Keywords:  Chaos game representation; Cross validation; Fractal dimension; Protein classification

Mesh:

Substances:

Year:  2013        PMID: 24189096     DOI: 10.1016/j.jtbi.2013.10.009

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  7 in total

1.  PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

Authors:  Liqi Li; Xiang Cui; Sanjiu Yu; Yuan Zhang; Zhong Luo; Hua Yang; Yue Zhou; Xiaoqi Zheng
Journal:  PLoS One       Date:  2014-03-27       Impact factor: 3.240

2.  Integrating sequence and gene expression information predicts genome-wide DNA-binding proteins and suggests a cooperative mechanism.

Authors:  Shandar Ahmad; Philip Prathipati; Lokesh P Tripathi; Yi-An Chen; Ajay Arya; Yoichi Murakami; Kenji Mizuguchi
Journal:  Nucleic Acids Res       Date:  2018-01-09       Impact factor: 16.971

3.  Small-Angle Scattering and Multifractal Analysis of DNA Sequences.

Authors:  lEugen Mircea Anitas
Journal:  Int J Mol Sci       Date:  2020-06-30       Impact factor: 5.923

4.  Prediction of RNA-protein interactions using conjoint triad feature and chaos game representation.

Authors:  Hongchu Wang; Pengfei Wu
Journal:  Bioengineered       Date:  2018       Impact factor: 3.269

5.  DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

Authors:  Graham B Motion; Andrew J M Howden; Edgar Huitema; Susan Jones
Journal:  Nucleic Acids Res       Date:  2015-08-24       Impact factor: 16.971

6.  UMAP-DBP: An Improved DNA-Binding Proteins Prediction Method Based on Uniform Manifold Approximation and Projection.

Authors:  Jinyue Wang; Shengli Zhang; Huijuan Qiao; Jiesheng Wang
Journal:  Protein J       Date:  2021-06-27       Impact factor: 2.371

7.  Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.

Authors:  Mehdi Poursheikhali Asghari; Parviz Abdolmaleki
Journal:  Avicenna J Med Biotechnol       Date:  2019 Jan-Mar
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.