Literature DB >> 25908206

iPPI-Esml: An ensemble classifier for identifying the interactions of proteins by incorporating their physicochemical properties and wavelet transforms into PseAAC.

Jianhua Jia1, Zi Liu2, Xuan Xiao3, Bingxiang Liu4, Kuo-Chen Chou5.   

Abstract

A cell contains thousands of proteins. Many important functions of cell are carried out through the proteins therein. Proteins rarely function alone. Most of their functions essential to life are associated with various types of protein-protein interactions (PPIs). Therefore, knowledge of PPIs is fundamental for both basic research and drug development. With the avalanche of proteins sequences generated in the postgenomic age, it is highly desired to develop computational methods for timely acquiring this kind of knowledge. Here, a new predictor, called "iPPI-Emsl", is developed. In the predictor, a protein sample is formulated by incorporating the following two types of information into the general form of PseAAC (pseudo amino acid composition): (1) the physicochemical properties derived from the constituent amino acids of a protein; and (2) the wavelet transforms derived from the numerical series along a protein chain. The operation engine to run the predictor is an ensemble classifier formed by fusing seven individual random forest engines via a voting system. It is demonstrated with the benchmark dataset from Saccharomyces cerevisiae as well as the dataset from Helicobacter pylori that the new predictor achieves remarkably higher success rates than any of the existing predictors in this area. The new predictor׳ web-server has been established at http://www.jci-bioinfo.cn/iPPI-Esml. For the convenience of most experimental scientists, we have further provided a step-by-step guide, by which users can easily get their desired results without the need to follow the complicated mathematics involved during its development.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Ensemble classifier; Fusion; Physicochemical properties; Pseudo amino acid composition; Random forests; Voting system; Wavelets transforms

Mesh:

Substances:

Year:  2015        PMID: 25908206     DOI: 10.1016/j.jtbi.2015.04.011

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  56 in total

1.  Prediction of Protein-Protein Interaction Sites with Machine-Learning-Based Data-Cleaning and Post-Filtering Procedures.

Authors:  Guang-Hui Liu; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-11-12       Impact factor: 1.843

2.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

3.  Modelling the molecular mechanism of protein-protein interactions and their inhibition: CypD-p53 case study.

Authors:  S M Fayaz; G K Rajanikant
Journal:  Mol Divers       Date:  2015-07-14       Impact factor: 2.943

4.  Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Authors:  Xiaoying Wang; Bin Yu; Anjun Ma; Cheng Chen; Bingqiang Liu; Qin Ma
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

5.  i6mA-VC: A Multi-Classifier Voting Method for the Computational Identification of DNA N6-methyladenine Sites.

Authors:  Tian Xue; Shengli Zhang; Huijuan Qiao
Journal:  Interdiscip Sci       Date:  2021-04-08       Impact factor: 2.233

6.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

Review 7.  Structural Variability in the RLR-MAVS Pathway and Sensitive Detection of Viral RNAs.

Authors:  Qiu-Xing Jiang
Journal:  Med Chem       Date:  2019       Impact factor: 2.745

8.  Classifying Multifunctional Enzymes by Incorporating Three Different Models into Chou's General Pseudo Amino Acid Composition.

Authors:  Hong-Liang Zou; Xuan Xiao
Journal:  J Membr Biol       Date:  2016-04-25       Impact factor: 1.843

9.  Comparative analysis of housekeeping and tissue-selective genes in human based on network topologies and biological properties.

Authors:  Lei Yang; Shiyuan Wang; Meng Zhou; Xiaowen Chen; Yongchun Zuo; Dianjun Sun; Yingli Lv
Journal:  Mol Genet Genomics       Date:  2016-02-20       Impact factor: 3.291

10.  Adaptive empirical pattern transformation (ADEPT) with application to walking stride segmentation.

Authors:  Marta Karas; Marcin Stra Czkiewicz; William Fadel; Jaroslaw Harezlak; Ciprian M Crainiceanu; Jacek K Urbanek
Journal:  Biostatistics       Date:  2021-04-10       Impact factor: 5.899

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.