Literature DB >> 31751283

Imbalance Data Processing Strategy for Protein Interaction Sites Prediction.

Bing Wang, Changqing Mei, Yuanyuan Wang, Yuming Zhou, Mu-Tian Cheng, Chun-Hou Zheng, Lei Wang, Jun Zhang, Peng Chen, Yan Xiong.   

Abstract

Protein-protein interactions play essential roles in various biological progresses. Identifying protein interaction sites can facilitate researchers to understand life activities and therefore will be helpful for drug design. However, the number of experimental determined protein interaction sites is far less than that of protein sites in protein-protein interaction or protein complexes. Therefore, the negative and positive samples are usually imbalanced, which is common but bring result bias on the prediction of protein interaction sites by computational approaches. In this work, we presented three imbalance data processing strategies to reconstruct the original dataset, and then extracted protein features from the evolutionary conservation of amino acids to build a predictor for identification of protein interaction sites. On a dataset with 10,430 surface residues but only 2,299 interface residues, the imbalance dataset processing strategies can obviously reduce the prediction bias, and therefore improve the prediction performance of protein interaction sites. The experimental results show that our prediction models can achieve a better prediction performance, such as a prediction accuracy of 0.758, or a high F-measure of 0.737, which demonstrated the effectiveness of our method.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 31751283     DOI: 10.1109/TCBB.2019.2953908

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  4 in total

1.  A Deep Learning-Based Method for Identification of Bacteriophage-Host Interaction.

Authors:  Menglu Li; Yanan Wang; Fuyi Li; Yun Zhao; Mengya Liu; Sijia Zhang; Yannan Bin; A Ian Smith; Geoffrey I Webb; Jian Li; Jiangning Song; Junfeng Xia
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2021-10-07       Impact factor: 3.702

2.  Diagnosis of Cervical Cancer With Parametrial Invasion on Whole-Tumor Dynamic Contrast-Enhanced Magnetic Resonance Imaging Combined With Whole-Lesion Texture Analysis Based on T2- Weighted Images.

Authors:  Xin-Xiang Li; Ting-Ting Lin; Bin Liu; Wei Wei
Journal:  Front Bioeng Biotechnol       Date:  2020-06-11

3.  A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.

Authors:  Xiongfei Tian; Ling Shen; Zhenwu Wang; Liqian Zhou; Lihong Peng
Journal:  Sci Rep       Date:  2021-09-23       Impact factor: 4.379

4.  Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm.

Authors:  Aijun Deng; Huan Zhang; Wenyan Wang; Jun Zhang; Dingdong Fan; Peng Chen; Bing Wang
Journal:  Int J Mol Sci       Date:  2020-03-25       Impact factor: 5.923

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.