Literature DB >> 33760053

STREME: Accurate and versatile sequence motif discovery.

Timothy L Bailey1.   

Abstract

MOTIVATION: Sequence motif discovery algorithms can identify novel sequence patterns that perform biological functions in DNA, RNA and protein sequences-for example, the binding site motifs of DNA-and RNA-binding proteins.
RESULTS: The STREME algorithm presented here advances the state-of-the-art in ab initio motif discovery in terms of both accuracy and versatility. Using in vivo DNA (ChIP-seq) and RNA (CLIP-seq) data, and validating motifs with reference motifs derived from in vitro data, we show that STREME is more accurate, sensitive and thorough than several widely used algorithms (DREME, HOMER, MEME, Peak-motifs) and two other representative algorithms (ProSampler and Weeder). STREME's capabilities include the ability to find motifs in datasets with hundreds of thousands of sequences, to find both short and long motifs (from 3 to 30 positions), to perform differential motif discovery in pairs of sequence datasets, and to find motifs in sequences over virtually any alphabet (DNA, RNA, protein and user-defined alphabets). Unlike most motif discovery algorithms, STREME reports a useful estimate of the statistical significance of each motif it discovers. STREME is easy to use individually via its web server or via the command line, and is completely integrated with the widely-used MEME Suite of sequence analysis tools. The name STREME stands for "Simple, Thorough, Rapid, Enriched Motif Elicitation". AVAILABILITY: The STREME web server and source code are provided freely for non-commercial use at http://meme-suite.org.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 33760053      PMCID: PMC8479671          DOI: 10.1093/bioinformatics/btab203

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  17 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  Computing the P-value of the information content from an alignment of multiple sequences.

Authors:  Niranjan Nagarajan; Neil Jones; Uri Keich
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

3.  Sequence logos: a new way to display consensus sequences.

Authors:  T D Schneider; R M Stephens
Journal:  Nucleic Acids Res       Date:  1990-10-25       Impact factor: 16.971

4.  DNA-binding specificities of human transcription factors.

Authors:  Arttu Jolma; Jian Yan; Thomas Whitington; Jarkko Toivonen; Kazuhiro R Nitta; Pasi Rastas; Ekaterina Morgunova; Martin Enge; Mikko Taipale; Gonghong Wei; Kimmo Palin; Juan M Vaquerizas; Renaud Vincentelli; Nicholas M Luscombe; Timothy R Hughes; Patrick Lemaire; Esko Ukkonen; Teemu Kivioja; Jussi Taipale
Journal:  Cell       Date:  2013-01-17       Impact factor: 41.582

5.  Probability plotting methods for the analysis of data.

Authors:  M B Wilk; R Gnanadesikan
Journal:  Biometrika       Date:  1968-03       Impact factor: 2.445

6.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities.

Authors:  Sven Heinz; Christopher Benner; Nathanael Spann; Eric Bertolino; Yin C Lin; Peter Laslo; Jason X Cheng; Cornelis Murre; Harinder Singh; Christopher K Glass
Journal:  Mol Cell       Date:  2010-05-28       Impact factor: 17.970

7.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

8.  MEME-ChIP: motif analysis of large DNA datasets.

Authors:  Philip Machanick; Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-04-12       Impact factor: 6.937

9.  DREME: motif discovery in transcription factor ChIP-seq data.

Authors:  Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-05-04       Impact factor: 6.937

10.  Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP).

Authors:  Eric L Van Nostrand; Gabriel A Pratt; Alexander A Shishkin; Chelsea Gelboin-Burkhart; Mark Y Fang; Balaji Sundararaman; Steven M Blue; Thai B Nguyen; Christine Surka; Keri Elkins; Rebecca Stanton; Frank Rigo; Mitchell Guttman; Gene W Yeo
Journal:  Nat Methods       Date:  2016-03-28       Impact factor: 28.547

View more
  46 in total

1.  Ultra-high-diversity factorizable libraries for efficient therapeutic discovery.

Authors:  Zheng Dai; Sachit D Saksena; Geraldine Horny; Christine Banholzer; Stefan Ewert; David K Gifford
Journal:  Genome Res       Date:  2022-06-23       Impact factor: 9.438

2.  NetTIME: a multitask and base-pair resolution framework for improved transcription factor binding site prediction.

Authors:  Ren Yi; Kyunghyun Cho; Richard Bonneau
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

3.  Integrative mRNA and Long Noncoding RNA Analysis Reveals the Regulatory Network of Floral Bud Induction in Longan (Dimocarpus longan Lour.).

Authors:  Fan Liang; Yiyong Zhang; Xiaodan Wang; Shuo Yang; Ting Fang; Shaoquan Zheng; Lihui Zeng
Journal:  Front Plant Sci       Date:  2022-06-14       Impact factor: 6.627

4.  Prediction and Experimental Validation of a New Salinity-Responsive Cis-Regulatory Element (CRE) in a Tilapia Cell Line.

Authors:  Chanhee Kim; Xiaodan Wang; Dietmar Kültz
Journal:  Life (Basel)       Date:  2022-05-25

5.  Sex-specific variation in R-loop formation in Drosophila melanogaster.

Authors:  Timothy J Stanek; Weihuan Cao; Rohan M Mehra; Christopher E Ellison
Journal:  PLoS Genet       Date:  2022-06-10       Impact factor: 6.020

6.  Distinct Roles of NANOS1 and NANOS3 in the Cell Cycle and NANOS3-PUM1-FOXM1 Axis to Control G2/M Phase in a Human Primordial Germ Cell Model.

Authors:  Erkut Ilaslan; Krystyna Kwiatkowska; Maciej Jerzy Smialek; Marcin Piotr Sajek; Zaneta Lemanska; Matisa Alla; Damian Mikolaj Janecki; Jadwiga Jaruzelska; Kamila Kusz-Zamelczyk
Journal:  Int J Mol Sci       Date:  2022-06-13       Impact factor: 6.208

7.  Ribosome profiling elucidates differential gene expression in bundle sheath and mesophyll cells in maize.

Authors:  Prakitchai Chotewutmontri; Alice Barkan
Journal:  Plant Physiol       Date:  2021-09-04       Impact factor: 8.005

8.  Phage Annotation Guide: Guidelines for Assembly and High-Quality Annotation.

Authors:  Dann Turner; Evelien M Adriaenssens; Igor Tolstoy; Andrew M Kropinski
Journal:  Phage (New Rochelle)       Date:  2021-12-16

9.  Systematic Screening of Penetratin's Protein Targets by Yeast Proteome Microarrays.

Authors:  Pramod Shah; Chien-Sheng Chen
Journal:  Int J Mol Sci       Date:  2022-01-10       Impact factor: 5.923

10.  High quality genome assembly of the amitochondriate eukaryote Monocercomonoides exilis.

Authors:  Sebastian Cristian Treitli; Priscila Peña-Diaz; Paweł Hałakuc; Anna Karnkowska; Vladimír Hampl
Journal:  Microb Genom       Date:  2021-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.