Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 More challenges for machine-learning protein interactions.

Literature DB >> 25586513

More challenges for machine-learning protein interactions.

Abstract

MOTIVATION: Machine learning may be the most popular computational tool in molecular biology. Providing sustained performance estimates is challenging. The standard cross-validation protocols usually fail in biology. Park and Marcotte found that even refined protocols fail for protein-protein interactions (PPIs).
RESULTS: Here, we sketch additional problems for the prediction of PPIs from sequence alone. First, it not only matters whether proteins A or B of a target interaction A-B are similar to proteins of training interactions (positives), but also whether A or B are similar to proteins of non-interactions (negatives). Second, training on multiple interaction partners per protein did not improve performance for new proteins (not used to train). In contrary, a strictly non-redundant training that ignored good data slightly improved the prediction of difficult cases. Third, which prediction method appears to be best crucially depends on the sequence similarity between the test and the training set, how many true interactions should be found and the expected ratio of negatives to positives. The correct assessment of performance is the most complicated task in the development of prediction methods. Our analyses suggest that PPIs square the challenge for this task.

Mesh：

Substances：

Year: 2015 PMID： 25586513 DOI： 10.1093/bioinformatics/btu857

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

15 in total

1. Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Authors: Xiaoying Wang; Bin Yu; Anjun Ma; Cheng Chen; Bingqiang Liu; Qin Ma
Journal: Bioinformatics Date: 2019-07-15 Impact factor: 6.937

2. Predicted Arabidopsis Interactome Resource and Gene Set Linkage Analysis: A Transcriptomic Analysis Resource.

Authors: Heng Yao; Xiaoxuan Wang; Pengcheng Chen; Ling Hai; Kang Jin; Lixia Yao; Chuanzao Mao; Xin Chen
Journal: Plant Physiol Date: 2018-03-12 Impact factor: 8.340

Review 3. Protein function in precision medicine: deep understanding with machine learning.

Authors: Burkhard Rost; Predrag Radivojac; Yana Bromberg
Journal: FEBS Lett Date: 2016-08-06 Impact factor: 4.124

4. Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field.

Authors: Tamar Schlick; Stephanie Portillo-Ledesma; Christopher G Myers; Lauren Beljak; Justin Chen; Sami Dakhel; Daniel Darling; Sayak Ghosh; Joseph Hall; Mikaeel Jan; Emily Liang; Sera Saju; Mackenzie Vohr; Chris Wu; Yifan Xu; Eva Xue
Journal: Annu Rev Biophys Date: 2021-02-19 Impact factor: 12.981

Review 5. Progress and challenges in predicting protein interfaces.

Authors: Reyhaneh Esmaielbeiki; Konrad Krawczyk; Bernhard Knapp; Jean-Christophe Nebel; Charlotte M Deane
Journal: Brief Bioinform Date: 2015-05-13 Impact factor: 11.622

6. Regularized logistic regression with network-based pairwise interaction for biomarker identification in breast cancer.

Authors: Meng-Yun Wu; Xiao-Fei Zhang; Dao-Qing Dai; Le Ou-Yang; Yuan Zhu; Hong Yan
Journal: BMC Bioinformatics Date: 2016-02-27 Impact factor: 3.169

7. Predicting Protein-Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids.

Authors: Tzu-Hao Kuo; Kuo-Bin Li
Journal: Int J Mol Sci Date: 2016-10-26 Impact factor: 5.923