Literature DB >> 32166223

Mutation effect estimation on protein-protein interactions using deep contextualized representation learning.

Guangyu Zhou1, Muhao Chen1,2, Chelsea J T Ju1, Zheng Wang1, Jyun-Yu Jiang1, Wei Wang1.   

Abstract

The functional impact of protein mutations is reflected on the alteration of conformation and thermodynamics of protein-protein interactions (PPIs). Quantifying the changes of two interacting proteins upon mutations is commonly carried out by computational approaches. Hence, extensive research efforts have been put to the extraction of energetic or structural features on proteins, followed by statistical learning methods to estimate the effects of mutations on PPI properties. Nonetheless, such features require extensive human labors and expert knowledge to obtain, and have limited abilities to reflect point mutations. We present an end-to-end deep learning framework, MuPIPR (Mutation Effects in Protein-protein Interaction PRediction Using Contextualized Representations), to estimate the effects of mutations on PPIs. MuPIPR incorporates a contextualized representation mechanism of amino acids to propagate the effects of a point mutation to surrounding amino acid representations, therefore amplifying the subtle change in a long protein sequence. On top of that, MuPIPR leverages a Siamese residual recurrent convolutional neural encoder to encode a wild-type protein pair and its mutation pair. Multi-layer perceptron regressors are applied to the protein pair representations to predict the quantifiable changes of PPI properties upon mutations. Experimental evaluations show that, with only sequence information, MuPIPR outperforms various state-of-the-art systems on estimating the changes of binding affinity for SKEMPI v1, and offers comparable performance on SKEMPI v2. Meanwhile, MuPIPR also demonstrates state-of-the-art performance on estimating the changes of buried surface areas. The software implementation is available at https://github.com/guangyu-zhou/MuPIPR.
© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Entities:  

Year:  2020        PMID: 32166223      PMCID: PMC7059401          DOI: 10.1093/nargab/lqaa015

Source DB:  PubMed          Journal:  NAR Genom Bioinform        ISSN: 2631-9268


  4 in total

1.  Diagnostic Prediction with Sequence-of-sets Representation Learning for Clinical Events.

Authors:  Tianran Zhang; Muhao Chen; Alex A T Bui
Journal:  Artif Intell Med Conf Artif Intell Med (2005-)       Date:  2020-09-26

Review 2.  Implications of disease-related mutations at protein-protein interfaces.

Authors:  Dapeng Xiong; Dongjin Lee; Le Li; Qiuye Zhao; Haiyuan Yu
Journal:  Curr Opin Struct Biol       Date:  2021-12-24       Impact factor: 6.809

3.  Learning the protein language: Evolution, structure, and function.

Authors:  Tristan Bepler; Bonnie Berger
Journal:  Cell Syst       Date:  2021-06-16       Impact factor: 11.091

4.  Embeddings from protein language models predict conservation and variant effects.

Authors:  Céline Marquet; Michael Heinzinger; Tobias Olenyi; Christian Dallago; Kyra Erckert; Michael Bernhofer; Dmitrii Nechaev; Burkhard Rost
Journal:  Hum Genet       Date:  2021-12-30       Impact factor: 5.881

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.