Literature DB >> 26709360

motifDiverge: a model for assessing the statistical significance of gene regulatory motif divergence between two DNA sequences.

Dennis Kostka1, Tara Friedrich2, Alisha K Holloway3, Katherine S Pollard4.   

Abstract

Next-generation sequencing technology enables the identification of thousands of gene regulatory sequences in many cell types and organisms. We consider the problem of testing if two such sequences differ in their number of binding site motifs for a given transcription factor (TF) protein. Binding site motifs impart regulatory function by providing TFs the opportunity to bind to genomic elements and thereby affect the expression of nearby genes. Evolutionary changes to such functional DNA are hypothesized to be major contributors to phenotypic diversity within and between species; but despite the importance of TF motifs for gene expression, no method exists to test for motif loss or gain. Assuming that motif counts are Binomially distributed, and allowing for dependencies between motif instances in evolutionarily related sequences, we derive the probability mass function of the difference in motif counts between two nucleotide sequences. We provide a method to numerically estimate this distribution from genomic data and show through simulations that our estimator is accurate. Finally, we introduce the R package motifDiverge that implements our methodology and illustrate its application to gene regulatory enhancers identified by a mouse developmental time course experiment. While this study was motivated by analysis of regulatory motifs, our results can be applied to any problem involving two correlated Bernoulli trials.

Entities:  

Keywords:  Binomial; ChIP-seq; Gene regulation; Motif; Regulatory evolution; Testing; Transcription factor

Year:  2015        PMID: 26709360      PMCID: PMC4689439          DOI: 10.4310/SII.2015.v8.n4.a6

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  18 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  Improved models for transcription factor binding site identification using nonindependent interactions.

Authors:  Yue Zhao; Shuxiang Ruan; Manishi Pandey; Gary D Stormo
Journal:  Genetics       Date:  2012-04-13       Impact factor: 4.562

Review 3.  Characterization of enhancer function from genome-wide analyses.

Authors:  Glenn A Maston; Stephen G Landt; Michael Snyder; Michael R Green
Journal:  Annu Rev Genomics Hum Genet       Date:  2012-06-11       Impact factor: 8.929

Review 4.  RNA sequencing: advances, challenges and opportunities.

Authors:  Fatih Ozsolak; Patrice M Milos
Journal:  Nat Rev Genet       Date:  2010-12-30       Impact factor: 53.242

5.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

Review 6.  Topology of mammalian developmental enhancers and their regulatory landscapes.

Authors:  Wouter de Laat; Denis Duboule
Journal:  Nature       Date:  2013-10-24       Impact factor: 49.962

7.  FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin.

Authors:  Paul G Giresi; Jonghwan Kim; Ryan M McDaniell; Vishwanath R Iyer; Jason D Lieb
Journal:  Genome Res       Date:  2006-12-19       Impact factor: 9.043

Review 8.  Transcriptomics in the RNA-seq era.

Authors:  Paul A McGettigan
Journal:  Curr Opin Chem Biol       Date:  2013-01-02       Impact factor: 8.822

9.  Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix.

Authors:  Rahul Siddharthan
Journal:  PLoS One       Date:  2010-03-22       Impact factor: 3.240

10.  Compound poisson approximation of the number of occurrences of a position frequency matrix (PFM) on both strands.

Authors:  Utz J Pape; Sven Rahmann; Fengzhu Sun; Martin Vingron
Journal:  J Comput Biol       Date:  2008 Jul-Aug       Impact factor: 1.479

View more
  2 in total

1.  OCT2 pre-positioning facilitates cell fate transition and chromatin architecture changes in humoral immunity.

Authors:  Ashley S Doane; Chi-Shuen Chu; Dafne Campigli Di Giammartino; Martín A Rivas; Johannes C Hellmuth; Yanwen Jiang; Nevin Yusufova; Alicia Alonso; Robert G Roeder; Effie Apostolou; Ari M Melnick; Olivier Elemento
Journal:  Nat Immunol       Date:  2021-09-23       Impact factor: 25.606

2.  Bat Accelerated Regions Identify a Bat Forelimb Specific Enhancer in the HoxD Locus.

Authors:  Betty M Booker; Tara Friedrich; Mandy K Mason; Julia E VanderMeer; Jingjing Zhao; Walter L Eckalbar; Malcolm Logan; Nicola Illing; Katherine S Pollard; Nadav Ahituv
Journal:  PLoS Genet       Date:  2016-03-28       Impact factor: 5.917

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.