Literature DB >> 34187888

Deep representation learning improves prediction of LacI-mediated transcriptional repression.

Alexander S Garruss1,2,3, Katherine M Collins2,4, George M Church5,2,3.   

Abstract

Recent progress in DNA synthesis and sequencing technology has enabled systematic studies of protein function at a massive scale. We explore a deep mutational scanning study that measured the transcriptional repression function of 43,669 variants of the Escherichia coli LacI protein. We analyze structural and evolutionary aspects that relate to how the function of this protein is maintained, including an in-depth look at the C-terminal domain. We develop a deep neural network to predict transcriptional repression mediated by the lac repressor of Escherichia coli using experimental measurements of variant function. When measured across 10 separate training and validation splits using 5,009 single mutations of the lac repressor, our best-performing model achieved a median Pearson correlation of 0.79, exceeding any previous model. We demonstrate that deep representation learning approaches, first trained in an unsupervised manner across millions of diverse proteins, can be fine-tuned in a supervised fashion using lac repressor experimental datasets to more effectively predict a variant's effect on repression. These findings suggest a deep representation learning model may improve the prediction of other important properties of proteins.

Entities:  

Keywords:  deep representation learning; lac repressor; machine learning

Mesh:

Substances:

Year:  2021        PMID: 34187888      PMCID: PMC8271634          DOI: 10.1073/pnas.2022838118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   12.779


  36 in total

1.  Genetic regulatory mechanisms in the synthesis of proteins.

Authors:  F JACOB; J MONOD
Journal:  J Mol Biol       Date:  1961-06       Impact factor: 5.469

2.  Structural dynamics of the lac repressor-DNA complex revealed by a multiscale simulation.

Authors:  Elizabeth Villa; Alexander Balaeff; Klaus Schulten
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-29       Impact factor: 11.205

3.  Isolation of the lac repressor.

Authors:  W Gilbert; B Müller-Hill
Journal:  Proc Natl Acad Sci U S A       Date:  1966-12       Impact factor: 11.205

4.  Mutation effects predicted from sequence co-variation.

Authors:  Thomas A Hopf; John B Ingraham; Frank J Poelwijk; Charlotta P I Schärfe; Michael Springer; Chris Sander; Debora S Marks
Journal:  Nat Biotechnol       Date:  2017-01-16       Impact factor: 54.908

5.  Crystal structure of the lactose operon repressor and its complexes with DNA and inducer.

Authors:  M Lewis; G Chang; N C Horton; M A Kercher; H C Pace; M A Schumacher; R G Brennan; P Lu
Journal:  Science       Date:  1996-03-01       Impact factor: 47.728

6.  Protein 3D structure computed from evolutionary sequence variation.

Authors:  Debora S Marks; Lucy J Colwell; Robert Sheridan; Thomas A Hopf; Andrea Pagnani; Riccardo Zecchina; Chris Sander
Journal:  PLoS One       Date:  2011-12-07       Impact factor: 3.240

7.  Evaluating Protein Transfer Learning with TAPE.

Authors:  Roshan Rao; Nicholas Bhattacharya; Neil Thomas; Yan Duan; Xi Chen; John Canny; Pieter Abbeel; Yun S Song
Journal:  Adv Neural Inf Process Syst       Date:  2019-12

8.  Mapping DNA sequence to transcription factor binding energy in vivo.

Authors:  Stephanie L Barnes; Nathan M Belliveau; William T Ireland; Justin B Kinney; Rob Phillips
Journal:  PLoS Comput Biol       Date:  2019-02-04       Impact factor: 4.475

9.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR.

Authors:  Jiajie Zhang; Kassian Kobert; Tomáš Flouri; Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2013-10-18       Impact factor: 6.937

10.  Local fitness landscape of the green fluorescent protein.

Authors:  Karen S Sarkisyan; Dmitry A Bolotin; Margarita V Meer; Dinara R Usmanova; Alexander S Mishin; George V Sharonov; Dmitry N Ivankov; Nina G Bozhanova; Mikhail S Baranov; Onuralp Soylemez; Natalya S Bogatyreva; Peter K Vlasov; Evgeny S Egorov; Maria D Logacheva; Alexey S Kondrashov; Dmitry M Chudakov; Ekaterina V Putintseva; Ilgar Z Mamedov; Dan S Tawfik; Konstantin A Lukyanov; Fyodor A Kondrashov
Journal:  Nature       Date:  2016-05-11       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.