| Literature DB >> 30452960 |
Baoguang Tian1, Xue Wu1, Cheng Chen1, Wenying Qiu1, Qin Ma2, Bin Yu3.
Abstract
Research on protein-protein interactions (PPIs) not only helps to reveal the nature of life activities but also plays a driving role in understanding the mechanisms of disease activity and the development of effective drugs. The rapid development of machine learning provides new opportunities and challenges for understanding the mechanism of PPIs. It plays an important role in the field of proteomics research. In recent years, an increasing number of computational methods for predicting PPIs have been developed. This paper proposes a new method for predicting PPIs based on multi-information fusion. First, the pseudo-amino acid composition (PseAAC), auto-covariance (AC) and encoding based on grouped weight (EBGW) methods are used to extract the features of protein sequences, and the extracted three groups of feature vectors were fused. Secondly, the fused feature vectors are denoised by two-dimensional (2-D) wavelet denoising. Finally, the denoised feature vectors are input to the support vector machine (SVM) classifier to predict the PPIs. The ACC of PPIs of Helicobacter pylori (H. pylori) and Saccharomyces cerevisiae (S. cerevisiae) datasets were 95.97% and 95.55% by 5-fold cross-validation test and compared with other prediction methods. The experimental results show that the proposed multi-information fusion prediction method can effectively improve the prediction performance of PPIs. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/PPIs-WDSVM/.Entities:
Keywords: Machine learning; Multi-information fusion; Protein–protein interactions; Pseudo-amino acid composition; Support vector machine; Two-dimensional wavelet denoising
Mesh:
Substances:
Year: 2018 PMID: 30452960 DOI: 10.1016/j.jtbi.2018.11.011
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691