Literature DB >> 17624492

Predicting DNA-binding proteins: approached from Chou's pseudo amino acid composition and other specific sequence features.

Y Fang1, Y Guo, Y Feng, M Li.   

Abstract

DNA-binding proteins play a pivotal role in gene regulation. It is vitally important to develop an automated and efficient method for timely identification of novel DNA-binding proteins. In this study, we proposed a method based on alone the primary sequences of proteins to predict the DNA-binding proteins. DNA-binding proteins were encoded by autocross-covariance transform, pseudo-amino acid composition, dipeptide composition, respectively and also the different combinations of the three encoded methods; further, these feature matrices were applied to support vector machine classifiers to predict the DNA-binding proteins. All modules were trained and validated by the jackknife cross-validation test. Through comparing the performance of these substituted modules, the best result was obtained from pseudo-amino acid composition with the overall accuracy of 96.6% and the sensitivity of 90.7%. The results suggest that it can efficiently predict the novel DNA-binding proteins only using the primary sequences.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17624492     DOI: 10.1007/s00726-007-0568-2

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  32 in total

1.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

2.  Prediction of beta-turn in protein using E-SSpred and support vector machine.

Authors:  Lirong Liu; Yaping Fang; Menglong Li; Cuicui Wang
Journal:  Protein J       Date:  2009-05       Impact factor: 2.371

Review 3.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

4.  A new method for predicting the subcellular localization of eukaryotic proteins with both single and multiple sites: Euk-mPLoc 2.0.

Authors:  Kuo-Chen Chou; Hong-Bin Shen
Journal:  PLoS One       Date:  2010-04-01       Impact factor: 3.240

5.  Prediction of DNA binding proteins using local features and long-term dependencies with primary sequences based on deep learning.

Authors:  Guobin Li; Xiuquan Du; Xinlu Li; Le Zou; Guanhong Zhang; Zhize Wu
Journal:  PeerJ       Date:  2021-05-03       Impact factor: 2.984

6.  DNA-binding protein prediction using plant specific support vector machines: validation and application of a new genome annotation tool.

Authors:  Graham B Motion; Andrew J M Howden; Edgar Huitema; Susan Jones
Journal:  Nucleic Acids Res       Date:  2015-08-24       Impact factor: 16.971

7.  Predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model.

Authors:  Wei-Zhong Lin; Jian-An Fang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-11-26       Impact factor: 3.240

8.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

9.  iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition.

Authors:  Wei Chen; Peng-Mian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Nucleic Acids Res       Date:  2013-01-08       Impact factor: 16.971

10.  A comparison of computational methods for identifying virulence factors.

Authors:  Lu-Lu Zheng; Yi-Xue Li; Juan Ding; Xiao-Kui Guo; Kai-Yan Feng; Ya-Jun Wang; Le-Le Hu; Yu-Dong Cai; Pei Hao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-08-03       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.