Literature DB >> 34508359

Deciphering enhancer sequence using thermodynamics-based models and convolutional neural networks.

Payam Dibaeinia1, Saurabh Sinha1,2,3.   

Abstract

Deciphering the sequence-function relationship encoded in enhancers holds the key to interpreting non-coding variants and understanding mechanisms of transcriptomic variation. Several quantitative models exist for predicting enhancer function and underlying mechanisms; however, there has been no systematic comparison of these models characterizing their relative strengths and shortcomings. Here, we interrogated a rich data set of neuroectodermal enhancers in Drosophila, representing cis- and trans- sources of expression variation, with a suite of biophysical and machine learning models. We performed rigorous comparisons of thermodynamics-based models implementing different mechanisms of activation, repression and cooperativity. Moreover, we developed a convolutional neural network (CNN) model, called CoNSEPT, that learns enhancer 'grammar' in an unbiased manner. CoNSEPT is the first general-purpose CNN tool for predicting enhancer function in varying conditions, such as different cell types and experimental conditions, and we show that such complex models can suggest interpretable mechanisms. We found model-based evidence for mechanisms previously established for the studied system, including cooperative activation and short-range repression. The data also favored one hypothesized activation mechanism over another and suggested an intriguing role for a direct, distance-independent repression mechanism. Our modeling shows that while fundamentally different models can yield similar fits to data, they vary in their utility for mechanistic inference. CoNSEPT is freely available at: https://github.com/PayamDiba/CoNSEPT.
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2021        PMID: 34508359      PMCID: PMC8501998          DOI: 10.1093/nar/gkab765

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  61 in total

1.  Predicting gene expression from sequence.

Authors:  Michael A Beer; Saeed Tavazoie
Journal:  Cell       Date:  2004-04-16       Impact factor: 41.582

2.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

3.  Quantitatively predictable control of Drosophila transcriptional enhancers in vivo with engineered transcription factors.

Authors:  Justin Crocker; Garth R Ilsley; David L Stern
Journal:  Nat Genet       Date:  2016-02-08       Impact factor: 38.330

4.  Syntax compensates for poor binding sites to encode tissue specificity of developmental enhancers.

Authors:  Emma K Farley; Katrina M Olson; Wei Zhang; Daniel S Rokhsar; Michael S Levine
Journal:  Proc Natl Acad Sci U S A       Date:  2016-05-06       Impact factor: 11.205

5.  TWO-LAYER MATHEMATICAL MODELING OF GENE EXPRESSION: INCORPORATING DNA-LEVEL INFORMATION AND SYSTEM DYNAMICS.

Authors:  Jacqueline M Dresch; Marc A Thompson; David N Arnosti; Chichia Chiu
Journal:  SIAM J Appl Math       Date:  2013-03-01       Impact factor: 2.080

6.  Quantitative analysis of the Drosophila segmentation regulatory network using pattern generating potentials.

Authors:  Majid Kazemian; Charles Blatti; Adam Richards; Michael McCutchan; Noriko Wakabayashi-Ito; Ann S Hammonds; Susan E Celniker; Sudhir Kumar; Scot A Wolfe; Michael H Brodsky; Saurabh Sinha
Journal:  PLoS Biol       Date:  2010-08-17       Impact factor: 8.029

7.  CtBP-independent repression in the Drosophila embryo.

Authors:  Yutaka Nibu; Kate Senger; Michael Levine
Journal:  Mol Cell Biol       Date:  2003-06       Impact factor: 4.272

8.  Multiple modes of dorsal-bHLH transcriptional synergy in the Drosophila embryo.

Authors:  P Szymanski; M Levine
Journal:  EMBO J       Date:  1995-05-15       Impact factor: 11.598

9.  A genome-integrated massively parallel reporter assay reveals DNA sequence determinants of cis-regulatory activity in neural cells.

Authors:  Brett B Maricque; Joseph D Dougherty; Barak A Cohen
Journal:  Nucleic Acids Res       Date:  2017-02-28       Impact factor: 16.971

10.  Simulations of enhancer evolution provide mechanistic insights into gene regulation.

Authors:  Thyago Duque; Md Abul Hassan Samee; Majid Kazemian; Hannah N Pham; Michael H Brodsky; Saurabh Sinha
Journal:  Mol Biol Evol       Date:  2013-10-04       Impact factor: 16.240

View more
  1 in total

1.  DeepSTARR predicts enhancer activity from DNA sequence and enables the de novo design of synthetic enhancers.

Authors:  Bernardo P de Almeida; Franziska Reiter; Michaela Pagani; Alexander Stark
Journal:  Nat Genet       Date:  2022-05-12       Impact factor: 41.307

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.