Literature DB >> 34743286

FTWSVM-SR: DNA-Binding Proteins Identification via Fuzzy Twin Support Vector Machines on Self-Representation.

Yi Zou1, Yijie Ding2, Li Peng3, Quan Zou4.   

Abstract

Due to the high cost of DNA-binding proteins (DBPs) detection, many machine learning algorithms (ML) have been utilized to large-scale process and detect DBPs. The previous methods took no count of the processing of noise samples. In this study, a fuzzy twin support vector machine (FTWSVM) is employed to detect DBPs. First, multiple types of protein sequence features are formed into kernel matrices; Then, multiple kernel learning (MKL) algorithm is utilized to linear combine multiple kernels; next, self-representation-based membership function is utilized to estimate membership value (weight) of each training sample; finally, we feed the integrated kernel matrix and membership values into the FTWSVM-SR model for training and testing. On comparison with other predictive models, FTWSVM based on SR (FTWSVM-SR) obtains the best performance of Matthew's correlation coefficient (MCC): 0.7410 and 0.5909 on two independent testing sets (PDB186 and PDB2272 datasets), respectively. The results confirm that our method can be an effective DBPs detection tool. Before the biochemical experiment, our model can screen and analyze DBPs on a large scale.
© 2021. International Association of Scientists in the Interdisciplinary Areas.

Entities:  

Keywords:  DNA-binding proteins; Fuzzy membership; Fuzzy twin support vector machine; Multiple kernel fusion; Self-representation

Mesh:

Substances:

Year:  2021        PMID: 34743286     DOI: 10.1007/s12539-021-00489-6

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   2.233


  20 in total

Review 1.  Moment-based prediction of DNA-binding proteins.

Authors:  Shandar Ahmad; Akinori Sarai
Journal:  J Mol Biol       Date:  2004-07-30       Impact factor: 5.469

2.  DNA-Prot: identification of DNA binding proteins from protein sequence information using random forest.

Authors:  K Krishna Kumar; Ganesan Pugalenthi; P N Suganthan
Journal:  J Biomol Struct Dyn       Date:  2009-06

3.  MsDBP: Exploring DNA-Binding Proteins by Integrating Multiscale Sequence Information via Chou's Five-Step Rule.

Authors:  Xiuquan Du; Yanyu Diao; Heng Liu; Shuo Li
Journal:  J Proteome Res       Date:  2019-07-17       Impact factor: 4.466

4.  Identifying DNA-binding proteins based on multi-features and LASSO feature selection.

Authors:  Shengli Zhang; Fu Zhu; Qianhao Yu; Xiaoyue Zhu
Journal:  Biopolymers       Date:  2021-01-21       Impact factor: 2.505

5.  Effective DNA binding protein prediction by using key features via Chou's general PseAAC.

Authors:  Sheikh Adilina; Dewan Md Farid; Swakkhar Shatabda
Journal:  J Theor Biol       Date:  2018-10-11       Impact factor: 2.691

6.  DPP-PseAAC: A DNA-binding protein prediction model using Chou's general PseAAC.

Authors:  M Saifur Rahman; Swakkhar Shatabda; Sanjay Saha; M Kaykobad; M Sohel Rahman
Journal:  J Theor Biol       Date:  2018-05-16       Impact factor: 2.691

7.  iDNA-Prot: identification of DNA binding proteins using random forest with grey model.

Authors:  Wei-Zhong Lin; Jian-An Fang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-09-15       Impact factor: 3.240

8.  iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition.

Authors:  Bin Liu; Jinghao Xu; Xun Lan; Ruifeng Xu; Jiyun Zhou; Xiaolong Wang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2014-09-03       Impact factor: 3.240

9.  DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation.

Authors:  Bin Liu; Shanyi Wang; Xiaolong Wang
Journal:  Sci Rep       Date:  2015-10-20       Impact factor: 4.379

10.  UMAP-DBP: An Improved DNA-Binding Proteins Prediction Method Based on Uniform Manifold Approximation and Projection.

Authors:  Jinyue Wang; Shengli Zhang; Huijuan Qiao; Jiesheng Wang
Journal:  Protein J       Date:  2021-06-27       Impact factor: 2.371

View more
  3 in total

1.  Research on DNA-Binding Protein Identification Method Based on LSTM-CNN Feature Fusion.

Authors:  Weizhong Lu; Xiaoyi Chen; Yu Zhang; Hongjie Wu; Yijie Ding; Jiawei Shen; Shixuan Guan; Haiou Li
Journal:  Comput Math Methods Med       Date:  2022-06-02       Impact factor: 2.809

2.  DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.

Authors:  Omar Barukab; Yaser Daanial Khan; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Appl Bionics Biomech       Date:  2022-04-13       Impact factor: 1.664

3.  Identifying Membrane Protein Types Based on Lifelong Learning With Dynamically Scalable Networks.

Authors:  Weizhong Lu; Jiawei Shen; Yu Zhang; Hongjie Wu; Yuqing Qian; Xiaoyi Chen; Qiming Fu
Journal:  Front Genet       Date:  2022-03-14       Impact factor: 4.599

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.