Literature DB >> 24369152

Discriminative motif optimization based on perceptron training.

Ronak Y Patel1, Gary D Stormo.   

Abstract

MOTIVATION: Generating accurate transcription factor (TF) binding site motifs from data generated using the next-generation sequencing, especially ChIP-seq, is challenging. The challenge arises because a typical experiment reports a large number of sequences bound by a TF, and the length of each sequence is relatively long. Most traditional motif finders are slow in handling such enormous amount of data. To overcome this limitation, tools have been developed that compromise accuracy with speed by using heuristic discrete search strategies or limited optimization of identified seed motifs. However, such strategies may not fully use the information in input sequences to generate motifs. Such motifs often form good seeds and can be further improved with appropriate scoring functions and rapid optimization.
RESULTS: We report a tool named discriminative motif optimizer (DiMO). DiMO takes a seed motif along with a positive and a negative database and improves the motif based on a discriminative strategy. We use area under receiver-operating characteristic curve (AUC) as a measure of discriminating power of motifs and a strategy based on perceptron training that maximizes AUC rapidly in a discriminative manner. Using DiMO, on a large test set of 87 TFs from human, drosophila and yeast, we show that it is possible to significantly improve motifs identified by nine motif finders. The motifs are generated/optimized using training sets and evaluated on test sets. The AUC is improved for almost 90% of the TFs on test sets and the magnitude of increase is up to 39%.
AVAILABILITY AND IMPLEMENTATION: DiMO is available at http://stormo.wustl.edu/DiMO

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24369152      PMCID: PMC3967114          DOI: 10.1093/bioinformatics/btt748

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  31 in total

1.  Deep and wide digging for binding motifs in ChIP-Seq data.

Authors:  I V Kulakovskiy; V A Boeva; A V Favorov; V J Makeev
Journal:  Bioinformatics       Date:  2010-08-24       Impact factor: 6.937

2.  Identification of context-dependent motifs by contrasting ChIP binding data.

Authors:  Mike J Mason; Kathrin Plath; Qing Zhou
Journal:  Bioinformatics       Date:  2010-09-23       Impact factor: 6.937

Review 3.  Determining the specificity of protein-DNA interactions.

Authors:  Gary D Stormo; Yue Zhao
Journal:  Nat Rev Genet       Date:  2010-09-28       Impact factor: 53.242

4.  An algorithm for finding protein-DNA binding sites with applications to chromatin-immunoprecipitation microarray experiments.

Authors:  X Shirley Liu; Douglas L Brutlag; Jun S Liu
Journal:  Nat Biotechnol       Date:  2002-07-08       Impact factor: 54.908

5.  DREME: motif discovery in transcription factor ChIP-seq data.

Authors:  Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-05-04       Impact factor: 6.937

6.  DISCOVER: a feature-based discriminative method for motif search in complex genomes.

Authors:  Wenjie Fu; Pradipta Ray; Eric P Xing
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

7.  DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices.

Authors:  Isabelle da Piedade; Man-Hung Eric Tang; Olivier Elemento
Journal:  BMC Bioinformatics       Date:  2009-11-26       Impact factor: 3.169

8.  Seeder: discriminative seeding DNA motif discovery.

Authors:  François Fauteux; Mathieu Blanchette; Martina V Strömvik
Journal:  Bioinformatics       Date:  2008-08-21       Impact factor: 6.937

9.  Discriminative motif discovery in DNA and protein sequences using the DEME algorithm.

Authors:  Emma Redhead; Timothy L Bailey
Journal:  BMC Bioinformatics       Date:  2007-10-15       Impact factor: 3.169

10.  PhyloGibbs-MP: module prediction and discriminative motif-finding by Gibbs sampling.

Authors:  Rahul Siddharthan
Journal:  PLoS Comput Biol       Date:  2008-08-29       Impact factor: 4.475

View more
  10 in total

1.  Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination.

Authors:  Joshua L Wetzel; Mona Singh
Journal:  Nucleic Acids Res       Date:  2020-01-24       Impact factor: 16.971

2.  DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.

Authors:  Anthony Mathelier; Beibei Xin; Tsu-Pei Chiu; Lin Yang; Remo Rohs; Wyeth W Wasserman
Journal:  Cell Syst       Date:  2016-08-18       Impact factor: 10.304

3.  Identification of Predictive Cis-Regulatory Elements Using a Discriminative Objective Function and a Dynamic Search Space.

Authors:  Rahul Karnik; Michael A Beer
Journal:  PLoS One       Date:  2015-10-14       Impact factor: 3.240

4.  SeAMotE: a method for high-throughput motif discovery in nucleic acid sequences.

Authors:  Federico Agostini; Davide Cirillo; Riccardo Delli Ponti; Gian Gaetano Tartaglia
Journal:  BMC Genomics       Date:  2014-10-23       Impact factor: 3.969

5.  Comparison of discriminative motif optimization using matrix and DNA shape-based models.

Authors:  Shuxiang Ruan; Gary D Stormo
Journal:  BMC Bioinformatics       Date:  2018-03-06       Impact factor: 3.169

6.  WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data.

Authors:  Hongbo Zhang; Lin Zhu; De-Shuang Huang
Journal:  Sci Rep       Date:  2017-06-12       Impact factor: 4.379

7.  A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction.

Authors:  Yuchun Guo; Kevin Tian; Haoyang Zeng; Xiaoyun Guo; David Kenneth Gifford
Journal:  Genome Res       Date:  2018-04-13       Impact factor: 9.043

8.  A map of direct TF-DNA interactions in the human genome.

Authors:  Marius Gheorghe; Geir Kjetil Sandve; Aziz Khan; Jeanne Chèneby; Benoit Ballester; Anthony Mathelier
Journal:  Nucleic Acids Res       Date:  2019-02-28       Impact factor: 16.971

9.  BEESEM: estimation of binding energy models using HT-SELEX data.

Authors:  Shuxiang Ruan; S Joshua Swamidass; Gary D Stormo
Journal:  Bioinformatics       Date:  2017-08-01       Impact factor: 6.937

10.  Direct AUC optimization of regulatory motifs.

Authors:  Lin Zhu; Hong-Bo Zhang; De-Shuang Huang
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.