Literature DB >> 32669542

Large-scale DNA-based phenotypic recording and deep learning enable highly accurate sequence-function mapping.

Simon Höllerer1, Laetitia Papaxanthos1,2, Anja Cathrin Gumpinger1,2, Katrin Fischer1, Christian Beisel1, Karsten Borgwardt3,4, Yaakov Benenson5, Markus Jeschek6.   

Abstract

Predicting effects of gene regulatory elements (GREs) is a longstanding challenge in biology. Machine learning may address this, but requires large datasets linking GREs to their quantitative function. However, experimental methods to generate such datasets are either application-specific or technically complex and error-prone. Here, we introduce DNA-based phenotypic recording as a widely applicable, practicable approach to generate large-scale sequence-function datasets. We use a site-specific recombinase to directly record a GRE's effect in DNA, enabling readout of both sequence and quantitative function for extremely large GRE-sets via next-generation sequencing. We record translation kinetics of over 300,000 bacterial ribosome binding sites (RBSs) in >2.7 million sequence-function pairs in a single experiment. Further, we introduce a deep learning approach employing ensembling and uncertainty modelling that predicts RBS function with high accuracy, outperforming state-of-the-art methods. DNA-based phenotypic recording combined with deep learning represents a major advance in our ability to predict function from genetic sequence.

Entities:  

Mesh:

Year:  2020        PMID: 32669542      PMCID: PMC7363850          DOI: 10.1038/s41467-020-17222-4

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  49 in total

Review 1.  High-throughput screening: new technology for the 21st century.

Authors:  R P Hertzberg; A J Pope
Journal:  Curr Opin Chem Biol       Date:  2000-08       Impact factor: 8.822

2.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

3.  Deep learning for regulatory genomics.

Authors:  Yongjin Park; Manolis Kellis
Journal:  Nat Biotechnol       Date:  2015-08       Impact factor: 54.908

4.  Observing Biosynthetic Activity Utilizing Next Generation Sequencing and the DNA Linked Enzyme Coupled Assay.

Authors:  Markus de Raad; Cyrus Modavi; David J Sukovich; J Christopher Anderson
Journal:  ACS Chem Biol       Date:  2016-12-05       Impact factor: 5.100

5.  Precise and reliable gene expression via standard transcription and translation initiation elements.

Authors:  Vivek K Mutalik; Joao C Guimaraes; Guillaume Cambray; Colin Lam; Marc Juul Christoffersen; Quynh-Anh Mai; Andrew B Tran; Morgan Paull; Jay D Keasling; Adam P Arkin; Drew Endy
Journal:  Nat Methods       Date:  2013-03-10       Impact factor: 28.547

Review 6.  Coming of age: ten years of next-generation sequencing technologies.

Authors:  Sara Goodwin; John D McPherson; W Richard McCombie
Journal:  Nat Rev Genet       Date:  2016-05-17       Impact factor: 53.242

7.  Predicting effects of noncoding variants with deep learning-based sequence model.

Authors:  Jian Zhou; Olga G Troyanskaya
Journal:  Nat Methods       Date:  2015-08-24       Impact factor: 28.547

8.  Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters.

Authors:  Eilon Sharon; Yael Kalma; Ayala Sharp; Tali Raveh-Sadka; Michal Levo; Danny Zeevi; Leeat Keren; Zohar Yakhini; Adina Weinberger; Eran Segal
Journal:  Nat Biotechnol       Date:  2012-05-20       Impact factor: 54.908

Review 9.  Large-scale de novo DNA synthesis: technologies and applications.

Authors:  Sriram Kosuri; George M Church
Journal:  Nat Methods       Date:  2014-05       Impact factor: 28.547

10.  High-resolution mapping of protein sequence-function relationships.

Authors:  Douglas M Fowler; Carlos L Araya; Sarel J Fleishman; Elizabeth H Kellogg; Jason J Stephany; David Baker; Stanley Fields
Journal:  Nat Methods       Date:  2010-08-15       Impact factor: 28.547

View more
  7 in total

Review 1.  Deep Learning Concepts and Applications for Synthetic Biology.

Authors:  William A V Beardall; Guy-Bart Stan; Mary J Dunlop
Journal:  GEN Biotechnol       Date:  2022-08-18

2.  Feasibility of predicting allele specific expression from DNA sequencing using machine learning.

Authors:  Zhenhua Zhang; Freerk van Dijk; Niek de Klein; Mariëlle E van Gijn; Lude H Franke; Richard J Sinke; Morris A Swertz; K Joeri van der Velde
Journal:  Sci Rep       Date:  2021-05-19       Impact factor: 4.379

3.  Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently.

Authors:  Douglas B Kell; Soumitra Samanta; Neil Swainston
Journal:  Biochem J       Date:  2020-12-11       Impact factor: 3.857

Review 4.  Intelligent host engineering for metabolic flux optimisation in biotechnology.

Authors:  Lachlan J Munro; Douglas B Kell
Journal:  Biochem J       Date:  2021-10-29       Impact factor: 3.857

5.  Machine Learning Guided Batched Design of a Bacterial Ribosome Binding Site.

Authors:  Mengyan Zhang; Maciej Bartosz Holowko; Huw Hayman Zumpe; Cheng Soon Ong
Journal:  ACS Synth Biol       Date:  2022-06-15       Impact factor: 5.249

Review 6.  Host-pathogen protein-nucleic acid interactions: A comprehensive review.

Authors:  Anuja Jain; Shikha Mittal; Lokesh P Tripathi; Ruth Nussinov; Shandar Ahmad
Journal:  Comput Struct Biotechnol J       Date:  2022-08-04       Impact factor: 6.155

Review 7.  Importance of the 5' regulatory region to bacterial synthetic biology applications.

Authors:  Lisa Tietze; Rahmi Lale
Journal:  Microb Biotechnol       Date:  2021-06-25       Impact factor: 5.813

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.