Literature DB >> 30250057

Deep generative models of genetic variation capture the effects of mutations.

Adam J Riesselman1,2, John B Ingraham1,3, Debora S Marks4.   

Abstract

The functions of proteins and RNAs are defined by the collective interactions of many residues, and yet most statistical models of biological sequences consider sites nearly independently. Recent approaches have demonstrated benefits of including interactions to capture pairwise covariation, but leave higher-order dependencies out of reach. Here we show how it is possible to capture higher-order, context-dependent constraints in biological sequences via latent variable models with nonlinear dependencies. We found that DeepSequence ( https://github.com/debbiemarkslab/DeepSequence ), a probabilistic model for sequence families, predicted the effects of mutations across a variety of deep mutational scanning experiments substantially better than existing methods based on the same evolutionary data. The model, learned in an unsupervised manner solely on the basis of sequence information, is grounded with biologically motivated priors, reveals the latent organization of sequence families, and can be used to explore new parts of sequence space.

Entities:  

Mesh:

Year:  2018        PMID: 30250057      PMCID: PMC6693876          DOI: 10.1038/s41592-018-0138-4

Source DB:  PubMed          Journal:  Nat Methods        ISSN: 1548-7091            Impact factor:   28.547


  80 in total

1.  Optimization of therapeutic antibodies by predicting antigen specificity from antibody sequence via deep learning.

Authors:  Derek M Mason; Simon Friedensohn; Cédric R Weber; Christian Jordi; Bastian Wagner; Simon M Meng; Roy A Ehling; Lucia Bonati; Jan Dahinden; Pablo Gainza; Bruno E Correia; Sai T Reddy
Journal:  Nat Biomed Eng       Date:  2021-04-15       Impact factor: 25.671

2.  Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations.

Authors:  Kahini Wadhawan; Inkit Padhi; Sebastian Gehrmann; Payel Das; Tom Sercu; Flaviu Cipcigan; Vijil Chenthamarakshan; Hendrik Strobelt; Cicero Dos Santos; Pin-Yu Chen; Yi Yan Yang; Jeremy P K Tan; James Hedrick; Jason Crain; Aleksandra Mojsilovic
Journal:  Nat Biomed Eng       Date:  2021-03-11       Impact factor: 25.671

Review 3.  Biophysical and Mechanistic Models for Disease-Causing Protein Variants.

Authors:  Amelie Stein; Douglas M Fowler; Rasmus Hartmann-Petersen; Kresten Lindorff-Larsen
Journal:  Trends Biochem Sci       Date:  2019-01-31       Impact factor: 13.807

4.  Machine learning-assisted directed protein evolution with combinatorial libraries.

Authors:  Zachary Wu; S B Jennifer Kan; Russell D Lewis; Bruce J Wittmann; Frances H Arnold
Journal:  Proc Natl Acad Sci U S A       Date:  2019-04-12       Impact factor: 11.205

5.  Learning protein constitutive motifs from sequence data.

Authors:  Jérôme Tubiana; Simona Cocco; Rémi Monasson
Journal:  Elife       Date:  2019-03-12       Impact factor: 8.140

6.  Evolutionary coupling analysis identifies the impact of disease-associated variants at less-conserved sites.

Authors:  Donghyo Kim; Seong Kyu Han; Kwanghwan Lee; Inhae Kim; JungHo Kong; Sanguk Kim
Journal:  Nucleic Acids Res       Date:  2019-09-19       Impact factor: 16.971

Review 7.  Global Genetic Networks and the Genotype-to-Phenotype Relationship.

Authors:  Michael Costanzo; Elena Kuzmin; Jolanda van Leeuwen; Barbara Mair; Jason Moffat; Charles Boone; Brenda Andrews
Journal:  Cell       Date:  2019-03-21       Impact factor: 41.582

8.  Unified rational protein engineering with sequence-based deep representation learning.

Authors:  Ethan C Alley; Grigory Khimulya; Surojit Biswas; Mohammed AlQuraishi; George M Church
Journal:  Nat Methods       Date:  2019-10-21       Impact factor: 28.547

9.  Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning.

Authors:  Hyebin Song; Bennett J Bremer; Emily C Hinds; Garvesh Raskutti; Philip A Romero
Journal:  Cell Syst       Date:  2020-11-18       Impact factor: 10.304

10.  Frustration and Direct-Coupling Analyses to Predict Formation and Function of Adeno-Associated Virus.

Authors:  Nicole N Thadani; Qin Zhou; Kiara Reyes Gamas; Susan Butler; Carlos Bueno; Nicholas P Schafer; Faruck Morcos; Peter G Wolynes; Junghae Suh
Journal:  Biophys J       Date:  2020-12-25       Impact factor: 4.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.