Literature DB >> 31380766

Efficient Classification of Hot Spots and Hub Protein Interfaces by Recursive Feature Elimination and Gradient Boosting.

Xiaoli Lin, Xiaolong Zhang, Xin Xu.   

Abstract

Proteins are not isolated biological molecules, which have the specific three-dimensional structures and interact with other proteins to perform functions. A small number of residues (hot spots) in protein-protein interactions (PPIs) play the vital role in bioinformatics to influence and control of biological processes. This paper uses the boosting algorithm and gradient boosting algorithm based on two feature selection strategies to classify hot spots with three common datasets and two hub protein datasets. First, the correlation-based feature selection is used to remove the highly related features for improving accuracy of prediction. Then, the recursive feature elimination based on support vector machine (SVM-RFE) is adopted to select the optimal feature subset to improve the training performance. Finally, boosting and gradient boosting (G-boosting) methods are invoked to generate classification results. Gradient boosting is capable of obtaining an excellent model by reducing the loss function in the gradient direction to avoid overfitting. Five datasets from different protein databases are used to verify our models in the experiments. Experimental results show that our proposed classification models have the competitive performance compared with existing classification methods.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31380766     DOI: 10.1109/TCBB.2019.2931717

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  2 in total

1.  An Ensemble-Based Multiclass Classifier for Intrusion Detection Using Internet of Things.

Authors:  Deepti Rani; Nasib Singh Gill; Preeti Gulia; Jyotir Moy Chatterjee
Journal:  Comput Intell Neurosci       Date:  2022-05-20

2.  Identification of hot regions in hub protein-protein interactions by clustering and PPRA optimization.

Authors:  Xiaoli Lin; Xiaolong Zhang
Journal:  BMC Med Inform Decis Mak       Date:  2021-05-03       Impact factor: 2.796

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.