Literature DB >> 30903136

SArKS: de novo discovery of gene expression regulatory motif sites and domains by suffix array kernel smoothing.

Dennis C Wylie1, Hans A Hofmann1,2,3,4, Boris V Zemelman2,4,5,6.   

Abstract

MOTIVATION: We set out to develop an algorithm that can mine differential gene expression data to identify candidate cell type-specific DNA regulatory sequences. Differential expression is usually quantified as a continuous score-fold-change, test-statistic, P-value-comparing biological classes. Unlike existing approaches, our de novo strategy, termed SArKS, applies non-parametric kernel smoothing to uncover promoter motif sites that correlate with elevated differential expression scores. SArKS detects motif k-mers by smoothing sequence scores over sequence similarity. A second round of smoothing over spatial proximity reveals multi-motif domains (MMDs). Discovered motif sites can then be merged or extended based on adjacency within MMDs. False positive rates are estimated and controlled by permutation testing.
RESULTS: We applied SArKS to published gene expression data representing distinct neocortical neuron classes in Mus musculus and interneuron developmental states in Homo sapiens. When benchmarked against several existing algorithms using a cross-validation procedure, SArKS identified larger motif sets that formed the basis for regression models with higher correlative power.
AVAILABILITY AND IMPLEMENTATION: https://github.com/denniscwylie/sarks. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30903136      PMCID: PMC7963082          DOI: 10.1093/bioinformatics/btz198

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  39 in total

1.  Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification.

Authors:  L Marsan; M F Sagot
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

Review 2.  Applied bioinformatics for the identification of regulatory elements.

Authors:  Wyeth W Wasserman; Albin Sandelin
Journal:  Nat Rev Genet       Date:  2004-04       Impact factor: 53.242

3.  A discriminative model for identifying spatial cis-regulatory modules.

Authors:  Eran Segal; Roded Sharan
Journal:  J Comput Biol       Date:  2005 Jul-Aug       Impact factor: 1.479

Review 4.  Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping.

Authors:  Albertha J M Walhout
Journal:  Genome Res       Date:  2006-10-19       Impact factor: 9.043

5.  A universal framework for regulatory element discovery across all genomes and data types.

Authors:  Olivier Elemento; Noam Slonim; Saeed Tavazoie
Journal:  Mol Cell       Date:  2007-10-26       Impact factor: 17.970

6.  DECOD: fast and accurate discriminative DNA motif finding.

Authors:  Peter Huggins; Shan Zhong; Idit Shiff; Rachel Beckerman; Oleg Laptenko; Carol Prives; Marcel H Schulz; Itamar Simon; Ziv Bar-Joseph
Journal:  Bioinformatics       Date:  2011-07-12       Impact factor: 6.937

7.  Why transcription factor binding sites are ten nucleotides long.

Authors:  Alexander J Stewart; Sridhar Hannenhalli; Joshua B Plotkin
Journal:  Genetics       Date:  2012-08-10       Impact factor: 4.562

8.  FIMO: scanning for occurrences of a given motif.

Authors:  Charles E Grant; Timothy L Bailey; William Stafford Noble
Journal:  Bioinformatics       Date:  2011-02-16       Impact factor: 6.937

9.  STEME: efficient EM to find motifs in large data sets.

Authors:  John E Reid; Lorenz Wernisch
Journal:  Nucleic Acids Res       Date:  2011-07-23       Impact factor: 16.971

10.  Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.

Authors:  Emma Redhead; Timothy L Bailey
Journal:  BMC Bioinformatics       Date:  2007-10-15       Impact factor: 3.169

View more
  1 in total

1.  Functional Access to Neuron Subclasses in Rodent and Primate Forebrain.

Authors:  Preeti Mehta; Lauren Kreeger; Dennis C Wylie; Jagruti J Pattadkal; Tara Lusignan; Matthew J Davis; Gergely F Turi; Wen-Ke Li; Matthew P Whitmire; Yuzhi Chen; Bridget L Kajs; Eyal Seidemann; Nicholas J Priebe; Attila Losonczy; Boris V Zemelman
Journal:  Cell Rep       Date:  2019-03-05       Impact factor: 9.423

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.