Literature DB >> 34058979

A sequence-based multiple kernel model for identifying DNA-binding proteins.

Yuqing Qian1, Limin Jiang2, Yijie Ding3, Jijun Tang2, Fei Guo4.   

Abstract

BACKGROUND: DNA-Binding Proteins (DBP) plays a pivotal role in biological system. A mounting number of researchers are studying the mechanism and detection methods. To detect DBP, the tradition experimental method is time-consuming and resource-consuming. In recent years, Machine Learning methods have been used to detect DBP. However, it is difficult to adequately describe the information of proteins in predicting DNA-binding proteins. In this study, we extract six features from protein sequence and use Multiple Kernel Learning-based on Centered Kernel Alignment to integrate these features. The integrated feature is fed into Support Vector Machine to build predictive model and detect new DBP.
RESULTS: In our work, date sets of PDB1075 and PDB186 are employed to test our method. From the results, our model obtains better results (accuracy) than other existing methods on PDB1075 ([Formula: see text]) and PDB186 ([Formula: see text]), respectively.
CONCLUSION: Multiple kernel learning could fuse the complementary information between different features. Compared with existing methods, our method achieves comparable and best results on benchmark data sets.

Entities:  

Keywords:  Centered kernel alignment; DNA-binding proteins; Feature extraction; Multiple kernel learning; Support vector machine

Year:  2021        PMID: 34058979     DOI: 10.1186/s12859-020-03875-x

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  32 in total

1.  Support vector machines for predicting rRNA-, RNA-, and DNA-binding proteins from amino acid sequence.

Authors:  Yu-dong Cai; Shuo Liang Lin
Journal:  Biochim Biophys Acta       Date:  2003-05-30

Review 2.  Moment-based prediction of DNA-binding proteins.

Authors:  Shandar Ahmad; Akinori Sarai
Journal:  J Mol Biol       Date:  2004-07-30       Impact factor: 5.469

3.  Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machines.

Authors:  Xiaojing Yu; Jianping Cao; Yudong Cai; Tieliu Shi; Yixue Li
Journal:  J Theor Biol       Date:  2005-11-07       Impact factor: 2.691

Review 4.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

5.  Treatment of hyperprolactinaemia with pergolide mesylate: acute effects and preliminary evaluation of long-term treatment.

Authors:  S Franks; P M Horrocks; S S Lynch; W R Butt; D R London
Journal:  Lancet       Date:  1981-09-26       Impact factor: 79.321

6.  iDBPs: a web server for the identification of DNA binding proteins.

Authors:  Guy Nimrod; Maya Schushan; András Szilágyi; Christina Leslie; Nir Ben-Tal
Journal:  Bioinformatics       Date:  2010-01-19       Impact factor: 6.937

7.  Fetal oxygen consumption, umbilical circulation and electrocortical activity transitions in fetal lambs.

Authors:  A M Walker; J Fleming; J Smolich; R Stunden; R Horne; J Maloney
Journal:  J Dev Physiol       Date:  1984-06

8.  Kernel-based machine learning protocol for predicting DNA-binding proteins.

Authors:  Nitin Bhardwaj; Robert E Langlois; Guijun Zhao; Hui Lu
Journal:  Nucleic Acids Res       Date:  2005-11-10       Impact factor: 16.971

9.  DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.

Authors:  Bin Liu; Shanyi Wang; Xiaolong Wang
Journal:  Sci Rep       Date:  2015-10-20       Impact factor: 4.379

10.  Improved detection of DNA-binding proteins via compression technology on PSSM information.

Authors:  Yubo Wang; Yijie Ding; Fei Guo; Leyi Wei; Jijun Tang
Journal:  PLoS One       Date:  2017-09-29       Impact factor: 3.240

View more
  1 in total

1.  DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.

Authors:  Omar Barukab; Yaser Daanial Khan; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Appl Bionics Biomech       Date:  2022-04-13       Impact factor: 1.664

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.