Literature DB >> 32142788

ProNA2020 predicts protein-DNA, protein-RNA, and protein-protein binding proteins and residues from sequence.

Jiajun Qiu1, Michael Bernhofer2, Michael Heinzinger2, Sofie Kemper3, Tomas Norambuena4, Francisco Melo5, Burkhard Rost6.   

Abstract

The intricate details of how proteins bind to proteins, DNA, and RNA are crucial for the understanding of almost all biological processes. Disease-causing sequence variants often affect binding residues. Here, we described a new, comprehensive system of in silico methods that take only protein sequence as input to predict binding of protein to DNA, RNA, and other proteins. Firstly, we needed to develop several new methods to predict whether or not proteins bind (per-protein prediction). Secondly, we developed independent methods that predict which residues bind (per-residue). Not requiring three-dimensional information, the system can predict the actual binding residue. The system combined homology-based inference with machine learning and motif-based profile-kernel approaches with word-based (ProtVec) solutions to machine learning protein level predictions. This achieved an overall non-exclusive three-state accuracy of 77% ± 1% (±one standard error) corresponding to a 1.8 fold improvement over random (best classification for protein-protein with F1 = 91 ± 0.8%). Standard neural networks for per-residue binding residue predictions appeared best for DNA-binding (Q2 = 81 ± 0.9%) followed by RNA-binding (Q2 = 80 ± 1%) and worst for protein-protein binding (Q2 = 69 ± 0.8%). The new method, dubbed ProNA2020, is available as code through github (https://github.com/Rostlab/ProNA2020.git) and through PredictProtein (www.predictprotein.org).
Copyright © 2020 Elsevier Ltd. All rights reserved.

Keywords:  ProtVec; binding protein prediction; binding residue prediction; machine learning; profile kernel SVM

Year:  2020        PMID: 32142788     DOI: 10.1016/j.jmb.2020.02.026

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  9 in total

1.  RBP-TSTL is a two-stage transfer learning framework for genome-scale prediction of RNA-binding proteins.

Authors:  Xinxin Peng; Xiaoyu Wang; Yuming Guo; Zongyuan Ge; Fuyi Li; Xin Gao; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  ProB-Site: Protein Binding Site Prediction Using Local Features.

Authors:  Sharzil Haris Khan; Hilal Tayara; Kil To Chong
Journal:  Cells       Date:  2022-07-05       Impact factor: 7.666

3.  Introduction to Bioinformatics Resources for Post-transcriptional Regulation of Gene Expression.

Authors:  Eliana Destefanis; Erik Dassi
Journal:  Methods Mol Biol       Date:  2022

Review 4.  Zooming in on protein-RNA interactions: a multi-level workflow to identify interaction partners.

Authors:  Alessio Colantoni; Jakob Rupert; Andrea Vandelli; Gian Gaetano Tartaglia; Elsa Zacco
Journal:  Biochem Soc Trans       Date:  2020-08-28       Impact factor: 5.407

5.  Embeddings from deep learning transfer GO annotations beyond homology.

Authors:  Maria Littmann; Michael Heinzinger; Christian Dallago; Tobias Olenyi; Burkhard Rost
Journal:  Sci Rep       Date:  2021-01-13       Impact factor: 4.379

6.  Protein embeddings and deep learning predict binding residues for various ligand classes.

Authors:  Maria Littmann; Michael Heinzinger; Christian Dallago; Konstantin Weissenow; Burkhard Rost
Journal:  Sci Rep       Date:  2021-12-13       Impact factor: 4.379

7.  Loss of Protein Function Causing Severe Phenotypes of Female-Restricted Wieacker Wolff Syndrome due to a Novel Nonsense Mutation in the ZC4H2 Gene.

Authors:  Jing-Jing Sun; Qin Cai; Miao Xu; Yan-Na Liu; Wan-Rui Li; Juan Li; Li Ma; Cheng Cai; Xiao-Hui Gong; Yi-Tao Zeng; Zhao-Rui Ren; Fanyi Zeng
Journal:  Genes (Basel)       Date:  2022-08-29       Impact factor: 4.141

Review 8.  Single-Stranded DNA Binding Proteins and Their Identification Using Machine Learning-Based Approaches.

Authors:  Jun-Tao Guo; Fareeha Malik
Journal:  Biomolecules       Date:  2022-08-26

9.  PredictProtein - Predicting Protein Structure and Function for 29 Years.

Authors:  Michael Bernhofer; Christian Dallago; Tim Karl; Venkata Satagopam; Michael Heinzinger; Maria Littmann; Tobias Olenyi; Jiajun Qiu; Konstantin Schütze; Guy Yachdav; Haim Ashkenazy; Nir Ben-Tal; Yana Bromberg; Tatyana Goldberg; Laszlo Kajan; Sean O'Donoghue; Chris Sander; Andrea Schafferhans; Avner Schlessinger; Gerrit Vriend; Milot Mirdita; Piotr Gawron; Wei Gu; Yohan Jarosz; Christophe Trefois; Martin Steinegger; Reinhard Schneider; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.