Literature DB >> 28320624

Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier.

Leyi Wei1, Pengwei Xing1, Jiancang Zeng2, JinXiu Chen2, Ran Su3, Fei Guo4.   

Abstract

Computational methods are employed in bioinformatics to predict protein-protein interactions (PPIs). PPIs and protein-protein non-interactions (PPNIs) display different levels of development, and the number of PPIs is considerably greater than that of PPNIs. This significant difference in the number of PPIs and PPNIs increases the cost of constructing a balanced dataset. PPIs can be classified as either physical or genetic. However, ready-made PPNI databases were proven only to have no physical interactions and were not proven to have no genetic interactions. Hence, ready-made PPNI databases contain false negative non-interactions. In this study, two PPNI datasets were artificially generated from a PPI database. In contrast to various traditional PPI feature extraction methods based on sequential information, two types of novel feature extraction methods were proposed. One is based on secondary structure information, and the other is based on the physicochemical properties of proteins. The experimental results of the RandomPairs dataset validate the efficiency and effectiveness of the proposed prediction model. These results reveal the potential of constructing a PPI negative dataset to reduce false negatives. Related datasets, tools, and source codes are accessible at http://lab.malab.cn/soft/PPIPre/PPIPre.html.
Copyright © 2017 Elsevier B.V. All rights reserved.

Keywords:  Ensemble classifier; Feature selection; Negative samples; Protein-protein interaction; Protein–protein non-interactions

Mesh:

Substances:

Year:  2017        PMID: 28320624     DOI: 10.1016/j.artmed.2017.03.001

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  63 in total

1.  Clustering and classification methods for single-cell RNA-sequencing data.

Authors:  Ren Qi; Anjun Ma; Qin Ma; Quan Zou
Journal:  Brief Bioinform       Date:  2020-07-15       Impact factor: 11.622

2.  Computational Methods for Predicting Protein-Protein Interactions Using Various Protein Features.

Authors:  Ziyun Ding; Daisuke Kihara
Journal:  Curr Protoc Protein Sci       Date:  2018-06-21

3.  sgRNACNN: identifying sgRNA on-target activity in four crops using ensembles of convolutional neural networks.

Authors:  Mengting Niu; Yuan Lin; Quan Zou
Journal:  Plant Mol Biol       Date:  2021-01-01       Impact factor: 4.076

4.  iDNA-MT: Identification DNA Modification Sites in Multiple Species by Using Multi-Task Learning Based a Neural Network Tool.

Authors:  Xiao Yang; Xiucai Ye; Xuehong Li; Lesong Wei
Journal:  Front Genet       Date:  2021-03-31       Impact factor: 4.599

5.  iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.

Authors:  Zhao-Chun Xu; Peng Wang; Wang-Ren Qiu; Xuan Xiao
Journal:  Sci Rep       Date:  2017-08-15       Impact factor: 4.379

6.  Prediction and analysis of essential genes using the enrichments of gene ontology and KEGG pathways.

Authors:  Lei Chen; Yu-Hang Zhang; ShaoPeng Wang; YunHua Zhang; Tao Huang; Yu-Dong Cai
Journal:  PLoS One       Date:  2017-09-05       Impact factor: 3.240

Review 7.  Machine learning approaches and databases for prediction of drug-target interaction: a survey paper.

Authors:  Maryam Bagherian; Elyas Sabeti; Kai Wang; Maureen A Sartor; Zaneta Nikolovska-Coleska; Kayvan Najarian
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

8.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

9.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

10.  AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest.

Authors:  Pratiti Bhadra; Jielu Yan; Jinyan Li; Simon Fong; Shirley W I Siu
Journal:  Sci Rep       Date:  2018-01-26       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.