Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Efficient Classification of Hot Spots and Hub Protein Interfaces by Recursive Feature Elimination and Gradient Boosting.

Literature DB >> 31380766

Efficient Classification of Hot Spots and Hub Protein Interfaces by Recursive Feature Elimination and Gradient Boosting.

Abstract

Proteins are not isolated biological molecules, which have the specific three-dimensional structures and interact with other proteins to perform functions. A small number of residues (hot spots) in protein-protein interactions (PPIs) play the vital role in bioinformatics to influence and control of biological processes. This paper uses the boosting algorithm and gradient boosting algorithm based on two feature selection strategies to classify hot spots with three common datasets and two hub protein datasets. First, the correlation-based feature selection is used to remove the highly related features for improving accuracy of prediction. Then, the recursive feature elimination based on support vector machine (SVM-RFE) is adopted to select the optimal feature subset to improve the training performance. Finally, boosting and gradient boosting (G-boosting) methods are invoked to generate classification results. Gradient boosting is capable of obtaining an excellent model by reducing the loss function in the gradient direction to avoid overfitting. Five datasets from different protein databases are used to verify our models in the experiments. Experimental results show that our proposed classification models have the competitive performance compared with existing classification methods.

Entities: Chemical

Mesh：

Substances：
Proteins

Year: 2019 PMID： 31380766 DOI： 10.1109/TCBB.2019.2931717

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

Keyword Cloud
Cited

2 in total

1. An Ensemble-Based Multiclass Classifier for Intrusion Detection Using Internet of Things.

Authors: Deepti Rani; Nasib Singh Gill; Preeti Gulia; Jyotir Moy Chatterjee
Journal: Comput Intell Neurosci Date: 2022-05-20

2. Identification of hot regions in hub protein-protein interactions by clustering and PPRA optimization.

Authors: Xiaoli Lin; Xiaolong Zhang
Journal: BMC Med Inform Decis Mak Date: 2021-05-03 Impact factor: 2.796

2 in total