Literature DB >> 33827920

Learning the molecular grammar of protein condensates from sequence determinants and embeddings.

Kadi L Saar1,2, Alexey S Morgunov1, Runzhang Qi1, William E Arter1, Georg Krainer1, Alpha A Lee2, Tuomas P J Knowles3,2.   

Abstract

Intracellular phase separation of proteins into biomolecular condensates is increasingly recognized as a process with a key role in cellular compartmentalization and regulation. Different hypotheses about the parameters that determine the tendency of proteins to form condensates have been proposed, with some of them probed experimentally through the use of constructs generated by sequence alterations. To broaden the scope of these observations, we established an in silico strategy for understanding on a global level the associations between protein sequence and phase behavior and further constructed machine-learning models for predicting protein liquid-liquid phase separation (LLPS). Our analysis highlighted that LLPS-prone proteins are more disordered, less hydrophobic, and of lower Shannon entropy than sequences in the Protein Data Bank or the Swiss-Prot database and that they show a fine balance in their relative content of polar and hydrophobic residues. To further learn in a hypothesis-free manner the sequence features underpinning LLPS, we trained a neural network-based language model and found that a classifier constructed on such embeddings learned the underlying principles of phase behavior at a comparable accuracy to a classifier that used knowledge-based features. By combining knowledge-based features with unsupervised embeddings, we generated an integrated model that distinguished LLPS-prone sequences both from structured proteins and from unstructured proteins with a lower LLPS propensity and further identified such sequences from the human proteome at a high accuracy. These results provide a platform rooted in molecular principles for understanding protein phase behavior. The predictor, termed DeePhase, is accessible from https://deephase.ch.cam.ac.uk/.
Copyright © 2021 the Author(s). Published by PNAS.

Entities:  

Keywords:  biomolecular condensates; language models; liquid–liquid phase separation; machine learning; protein biophysics

Year:  2021        PMID: 33827920     DOI: 10.1073/pnas.2019053118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  13 in total

1.  Evolved interactions stabilize many coexisting phases in multicomponent liquids.

Authors:  David Zwicker; Liedewij Laan
Journal:  Proc Natl Acad Sci U S A       Date:  2022-07-06       Impact factor: 12.779

2.  Probing Liquid-Liquid Phase Separation of RNA-Binding Proteins In Vitro and In Vivo.

Authors:  Stephanie Heinrich; Maria Hondele
Journal:  Methods Mol Biol       Date:  2022

Review 3.  Protein conformation and biomolecular condensates.

Authors:  Diego S Vazquez; Pamela L Toledo; Alejo R Gianotti; Mario R Ermácora
Journal:  Curr Res Struct Biol       Date:  2022-09-14

4.  Enhancer-Promoter Communication: It's Not Just About Contact.

Authors:  Annabelle Wurmser; Srinjan Basu
Journal:  Front Mol Biosci       Date:  2022-04-19

5.  BIAPSS: A Comprehensive Physicochemical Analyzer of Proteins Undergoing Liquid-Liquid Phase Separation.

Authors:  Aleksandra E Badaczewska-Dawid; Vladimir N Uversky; Davit A Potoyan
Journal:  Int J Mol Sci       Date:  2022-05-31       Impact factor: 6.208

Review 6.  Liquid-liquid phase separation: a principal organizer of the cell's biochemical activity architecture.

Authors:  Jason Z Zhang; Sohum Mehta; Jin Zhang
Journal:  Trends Pharmacol Sci       Date:  2021-08-06       Impact factor: 14.819

Review 7.  Matrin3: Disorder and ALS Pathogenesis.

Authors:  Ahmed Salem; Carter J Wilson; Benjamin S Rutledge; Allison Dilliott; Sali Farhan; Wing-Yiu Choy; Martin L Duennwald
Journal:  Front Mol Biosci       Date:  2022-01-10

8.  Liquid-liquid phase separation underpins the formation of replication factories in rotaviruses.

Authors:  Florian Geiger; Julia Acker; Guido Papa; Xinyu Wang; William E Arter; Kadi L Saar; Nadia A Erkamp; Runzhang Qi; Jack Pk Bravo; Sebastian Strauss; Georg Krainer; Oscar R Burrone; Ralf Jungmann; Tuomas Pj Knowles; Hanna Engelke; Alexander Borodavka
Journal:  EMBO J       Date:  2021-09-15       Impact factor: 14.012

9.  LLPSDB v2.0: an updated database of proteins undergoing liquid-liquid phase separation in vitro.

Authors:  Xi Wang; Xiang Zhou; Qinglin Yan; Shaofeng Liao; Wenqin Tang; Peiyu Xu; Yangzhenyu Gao; Qian Li; Zhihui Dou; Weishan Yang; Beifang Huang; Jinhong Li; Zhuqing Zhang
Journal:  Bioinformatics       Date:  2022-01-13       Impact factor: 6.937

10.  Human cytomegalovirus forms phase-separated compartments at viral genomes to facilitate viral replication.

Authors:  Enrico Caragliano; Stefano Bonazza; Giada Frascaroli; Jiajia Tang; Timothy K Soh; Kay Grünewald; Jens B Bosse; Wolfram Brune
Journal:  Cell Rep       Date:  2022-03-08       Impact factor: 9.423

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.