Literature DB >> 19193149

GADEM: a genetic algorithm guided formation of spaced dyads coupled with an EM algorithm for motif discovery.

Leping Li1.   

Abstract

Genome-wide analyses of protein binding sites generate large amounts of data; a ChIP dataset might contain 10,000 sites. Unbiased motif discovery in such datasets is not generally feasible using current methods that employ probabilistic models. We propose an efficient method, GADEM, which combines spaced dyads and an expectation-maximization (EM) algorithm. Candidate words (four to six nucleotides) for constructing spaced dyads are prioritized by their degree of overrepresentation in the input sequence data. Spaced dyads are converted into starting position weight matrices (PWMs). GADEM then employs a genetic algorithm (GA), with an embedded EM algorithm to improve starting PWMs, to guide the evolution of a population of spaced dyads toward one whose entropy scores are more statistically significant. Spaced dyads whose entropy scores reach a pre-specified significance threshold are declared motifs. GADEM performed comparably with MEME on 500 sets of simulated "ChIP" sequences with embedded known P53 binding sites. The major advantage of GADEM is its computational efficiency on large ChIP datasets compared to competitors. We applied GADEM to six genome-wide ChIP datasets. Approximately, 15 to 30 motifs of various lengths were identified in each dataset. Remarkably, without any prior motif information, the expected known motif (e.g., P53 in P53 data) was identified every time. GADEM discovered motifs of various lengths (6-40 bp) and characteristics in these datasets containing from 0.5 to >13 million nucleotides with run times of 5 to 96 h. GADEM can be viewed as an extension of the well-known MEME algorithm and is an efficient tool for de novo motif discovery in large-scale genome-wide data. The GADEM software is available at (www.niehs.nih.gov/research/resources/software/GADEM/).

Entities:  

Mesh:

Year:  2009        PMID: 19193149      PMCID: PMC2756050          DOI: 10.1089/cmb.2008.16TT

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  47 in total

1.  Discovering regulatory elements in non-coding sequences by analysis of spaced dyads.

Authors:  J van Helden; A F Rios; J Collado-Vides
Journal:  Nucleic Acids Res       Date:  2000-04-15       Impact factor: 16.971

Review 2.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

3.  Combinatorial approaches to finding subtle signals in DNA sequences.

Authors:  P A Pevzner; S H Sze
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

4.  fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control.

Authors:  Leping Li; Robert L Bass; Yu Liang
Journal:  Bioinformatics       Date:  2008-02-22       Impact factor: 6.937

5.  Combinatorial patterns of histone acetylations and methylations in the human genome.

Authors:  Zhibin Wang; Chongzhi Zang; Jeffrey A Rosenfeld; Dustin E Schones; Artem Barski; Suresh Cuddapah; Kairong Cui; Tae-Young Roh; Weiqun Peng; Michael Q Zhang; Keji Zhao
Journal:  Nat Genet       Date:  2008-06-15       Impact factor: 38.330

6.  Whole-genome analysis of histone H3 lysine 4 and lysine 27 methylation in human embryonic stem cells.

Authors:  Guangjin Pan; Shulan Tian; Jeff Nie; Chuhu Yang; Victor Ruotti; Hairong Wei; Gudrun A Jonsdottir; Ron Stewart; James A Thomson
Journal:  Cell Stem Cell       Date:  2007-09-13       Impact factor: 24.633

7.  Transcription factor and microRNA motif discovery: the Amadeus platform and a compendium of metazoan target sets.

Authors:  Chaim Linhart; Yonit Halperin; Ron Shamir
Journal:  Genome Res       Date:  2008-04-14       Impact factor: 9.043

Review 8.  Finding regulatory elements and regulatory motifs: a general probabilistic framework.

Authors:  Erik van Nimwegen
Journal:  BMC Bioinformatics       Date:  2007-09-27       Impact factor: 3.169

9.  Dynamic regulation of nucleosome positioning in the human genome.

Authors:  Dustin E Schones; Kairong Cui; Suresh Cuddapah; Tae-Young Roh; Artem Barski; Zhibin Wang; Gang Wei; Keji Zhao
Journal:  Cell       Date:  2008-03-07       Impact factor: 41.582

10.  Whole-genome cartography of estrogen receptor alpha binding sites.

Authors:  Chin-Yo Lin; Vinsensius B Vega; Jane S Thomsen; Tao Zhang; Say Li Kong; Min Xie; Kuo Ping Chiu; Leonard Lipovich; Daniel H Barnett; Fabio Stossi; Ailing Yeo; Joshy George; Vladimir A Kuznetsov; Yew Kok Lee; Tze Howe Charn; Nallasivam Palanisamy; Lance D Miller; Edwin Cheung; Benita S Katzenellenbogen; Yijun Ruan; Guillaume Bourque; Chia-Lin Wei; Edison T Liu
Journal:  PLoS Genet       Date:  2007-04-17       Impact factor: 5.917

View more
  53 in total

1.  Identification and characterization of Hoxa9 binding sites in hematopoietic cells.

Authors:  Yongsheng Huang; Kajal Sitwala; Joel Bronstein; Daniel Sanders; Monisha Dandekar; Cailin Collins; Gordon Robertson; James MacDonald; Timothee Cezard; Misha Bilenky; Nina Thiessen; Yongjun Zhao; Thomas Zeng; Martin Hirst; Alfred Hero; Steven Jones; Jay L Hess
Journal:  Blood       Date:  2011-11-09       Impact factor: 22.113

2.  Locus co-occupancy, nucleosome positioning, and H3K4me1 regulate the functionality of FOXA2-, HNF4A-, and PDX1-bound loci in islets and liver.

Authors:  Brad G Hoffman; Gordon Robertson; Bogard Zavaglia; Mike Beach; Rebecca Cullum; Sam Lee; Galina Soukhatcheva; Leping Li; Elizabeth D Wederell; Nina Thiessen; Mikhail Bilenky; Timothee Cezard; Angela Tam; Baljit Kamoh; Inanc Birol; Derek Dai; Yongjun Zhao; Martin Hirst; C Bruce Verchere; Cheryl D Helgason; Marco A Marra; Steven J M Jones; Pamela A Hoodless
Journal:  Genome Res       Date:  2010-06-15       Impact factor: 9.043

3.  NeuroD1 reprograms chromatin and transcription factor landscapes to induce the neuronal program.

Authors:  Abhijeet Pataskar; Johannes Jung; Pawel Smialowski; Florian Noack; Federico Calegari; Tobias Straub; Vijay K Tiwari
Journal:  EMBO J       Date:  2015-10-29       Impact factor: 11.598

4.  coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq data.

Authors:  Mengyuan Xu; Clarice R Weinberg; David M Umbach; Leping Li
Journal:  Bioinformatics       Date:  2011-07-19       Impact factor: 6.937

5.  Improved specificity of TALE-based genome editing using an expanded RVD repertoire.

Authors:  Jeffrey C Miller; Lei Zhang; Danny F Xia; John J Campo; Irina V Ankoudinova; Dmitry Y Guschin; Joshua E Babiarz; Xiangdong Meng; Sarah J Hinkley; Stephen C Lam; David E Paschon; Anna I Vincent; Gladys P Dulay; Kyle A Barlow; David A Shivak; Elo Leung; Jinwon D Kim; Rainier Amora; Fyodor D Urnov; Philip D Gregory; Edward J Rebar
Journal:  Nat Methods       Date:  2015-03-23       Impact factor: 28.547

6.  Differences in DNA Binding Specificity of Floral Homeotic Protein Complexes Predict Organ-Specific Target Genes.

Authors:  Cezary Smaczniak; Jose M Muiño; Dijun Chen; Gerco C Angenent; Kerstin Kaufmann
Journal:  Plant Cell       Date:  2017-07-21       Impact factor: 11.277

7.  Research resource: whole-genome estrogen receptor α binding in mouse uterine tissue revealed by ChIP-seq.

Authors:  Sylvia C Hewitt; Leping Li; Sara A Grimm; Yu Chen; Liwen Liu; Yin Li; Pierre R Bushel; David Fargo; Kenneth S Korach
Journal:  Mol Endocrinol       Date:  2012-03-22

Review 8.  Noncoding Variants Functional Prioritization Methods Based on Predicted Regulatory Factor Binding Sites.

Authors:  Haoyue Fu; Xiangde Zhang
Journal:  Curr Genomics       Date:  2017-08       Impact factor: 2.236

9.  DNA methylation prevents CTCF-mediated silencing of the oncogene BCL6 in B cell lymphomas.

Authors:  Anne Y Lai; Mehrnaz Fatemi; Archana Dhasarathy; Christine Malone; Steve E Sobol; Cissy Geigerman; David L Jaye; Deepak Mav; Ruchir Shah; Leping Li; Paul A Wade
Journal:  J Exp Med       Date:  2010-08-23       Impact factor: 14.307

10.  Estrogen-mediated regulation of Igf1 transcription and uterine growth involves direct binding of estrogen receptor alpha to estrogen-responsive elements.

Authors:  Sylvia C Hewitt; Yin Li; Leping Li; Kenneth S Korach
Journal:  J Biol Chem       Date:  2009-11-17       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.