Literature DB >> 25586513

More challenges for machine-learning protein interactions.

Tobias Hamp1, Burkhard Rost1.   

Abstract

MOTIVATION: Machine learning may be the most popular computational tool in molecular biology. Providing sustained performance estimates is challenging. The standard cross-validation protocols usually fail in biology. Park and Marcotte found that even refined protocols fail for protein-protein interactions (PPIs).
RESULTS: Here, we sketch additional problems for the prediction of PPIs from sequence alone. First, it not only matters whether proteins A or B of a target interaction A-B are similar to proteins of training interactions (positives), but also whether A or B are similar to proteins of non-interactions (negatives). Second, training on multiple interaction partners per protein did not improve performance for new proteins (not used to train). In contrary, a strictly non-redundant training that ignored good data slightly improved the prediction of difficult cases. Third, which prediction method appears to be best crucially depends on the sequence similarity between the test and the training set, how many true interactions should be found and the expected ratio of negatives to positives. The correct assessment of performance is the most complicated task in the development of prediction methods. Our analyses suggest that PPIs square the challenge for this task.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Substances:

Year:  2015        PMID: 25586513     DOI: 10.1093/bioinformatics/btu857

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Authors:  Xiaoying Wang; Bin Yu; Anjun Ma; Cheng Chen; Bingqiang Liu; Qin Ma
Journal:  Bioinformatics       Date:  2019-07-15       Impact factor: 6.937

2.  Predicted Arabidopsis Interactome Resource and Gene Set Linkage Analysis: A Transcriptomic Analysis Resource.

Authors:  Heng Yao; Xiaoxuan Wang; Pengcheng Chen; Ling Hai; Kang Jin; Lixia Yao; Chuanzao Mao; Xin Chen
Journal:  Plant Physiol       Date:  2018-03-12       Impact factor: 8.340

Review 3.  Protein function in precision medicine: deep understanding with machine learning.

Authors:  Burkhard Rost; Predrag Radivojac; Yana Bromberg
Journal:  FEBS Lett       Date:  2016-08-06       Impact factor: 4.124

4.  Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field.

Authors:  Tamar Schlick; Stephanie Portillo-Ledesma; Christopher G Myers; Lauren Beljak; Justin Chen; Sami Dakhel; Daniel Darling; Sayak Ghosh; Joseph Hall; Mikaeel Jan; Emily Liang; Sera Saju; Mackenzie Vohr; Chris Wu; Yifan Xu; Eva Xue
Journal:  Annu Rev Biophys       Date:  2021-02-19       Impact factor: 12.981

Review 5.  Progress and challenges in predicting protein interfaces.

Authors:  Reyhaneh Esmaielbeiki; Konrad Krawczyk; Bernhard Knapp; Jean-Christophe Nebel; Charlotte M Deane
Journal:  Brief Bioinform       Date:  2015-05-13       Impact factor: 11.622

6.  Regularized logistic regression with network-based pairwise interaction for biomarker identification in breast cancer.

Authors:  Meng-Yun Wu; Xiao-Fei Zhang; Dao-Qing Dai; Le Ou-Yang; Yuan Zhu; Hong Yan
Journal:  BMC Bioinformatics       Date:  2016-02-27       Impact factor: 3.169

7.  Predicting Protein-Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids.

Authors:  Tzu-Hao Kuo; Kuo-Bin Li
Journal:  Int J Mol Sci       Date:  2016-10-26       Impact factor: 5.923

8.  Predicting protein-binding regions in RNA using nucleotide profiles and compositions.

Authors:  Daesik Choi; Byungkyu Park; Hanju Chae; Wook Lee; Kyungsook Han
Journal:  BMC Syst Biol       Date:  2017-03-14

9.  Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes.

Authors:  Albros Hermes Poot Velez; Fernando Fontove; Gabriel Del Rio
Journal:  Int J Mol Sci       Date:  2020-07-06       Impact factor: 5.923

Review 10.  Prediction of Genetic Interactions Using Machine Learning and Network Properties.

Authors:  Neel S Madhukar; Olivier Elemento; Gaurav Pandey
Journal:  Front Bioeng Biotechnol       Date:  2015-10-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.