Literature DB >> 11473011

An algorithm for finding signals of unknown length in DNA sequences.

G Pavesi1, G Mauri, G Pesole.   

Abstract

Pattern discovery in unaligned DNA sequences is a challenging problem in both computer science and molecular biology. Several different methods and techniques have been proposed so far, but in most of the cases signals in DNA sequences are very complicated and avoid detection. Exact exhaustive methods can solve the problem only for short signals with a limited number of mutations. In this work, we extend exhaustive enumeration also to longer patterns. More in detail, the basic version of algorithm presented in this paper, given as input a set of sequences and an error ratio epsilon < 1, finds all patterns that occur in at least q sequences of the set with at most epsilonm mutations, where m is the length of the pattern. The only restriction is imposed on the location of mutations along the signal. That is, a valid occurrence of a pattern can present at most [epsiloni] mismatches in the first i nucleotides, and so on. However, we show how the algorithm can be used also when no assumption can be made on the position of mutations. In this case, it is also possible to have an estimate of the probability of finding a signal according to the signal length, the error ratio, and the input parameters. Finally, we discuss some significance measures that can be used to sort the patterns output by the algorithm.

Mesh:

Substances:

Year:  2001        PMID: 11473011     DOI: 10.1093/bioinformatics/17.suppl_1.s207

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  117 in total

1.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes.

Authors:  Giulio Pavesi; Paolo Mereghetti; Giancarlo Mauri; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

Authors:  Gurmukh Sahota; Gary D Stormo
Journal:  Bioinformatics       Date:  2010-08-31       Impact factor: 6.937

3.  Bhlhb5 and Prdm8 form a repressor complex involved in neuronal circuit assembly.

Authors:  Sarah E Ross; Alejandra E McCord; Cynthia Jung; Denize Atan; Stephanie I Mok; Martin Hemberg; Tae-Kyung Kim; John Salogiannis; Linda Hu; Sonia Cohen; Yingxi Lin; Dana Harrar; Roderick R McInnes; Michael E Greenberg
Journal:  Neuron       Date:  2012-01-26       Impact factor: 17.173

Review 4.  Toward a complete in silico, multi-layered embryonic stem cell regulatory network.

Authors:  Huilei Xu; Christoph Schaniel; Ihor R Lemischka; Avi Ma'ayan
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2010 Nov-Dec

5.  Towards a map of cis-regulatory sequences in the human genome.

Authors:  Meng Niu; Ehsan Tabari; Pengyu Ni; Zhengchang Su
Journal:  Nucleic Acids Res       Date:  2018-06-20       Impact factor: 16.971

Review 6.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.

Authors:  Dmitry A Rodionov
Journal:  Chem Rev       Date:  2007-07-18       Impact factor: 60.622

7.  Tmod: toolbox of motif discovery.

Authors:  Hanchang Sun; Yuan Yuan; Yibo Wu; Hui Liu; Jun S Liu; Hongwei Xie
Journal:  Bioinformatics       Date:  2009-12-10       Impact factor: 6.937

8.  Zfp206, Oct4, and Sox2 are integrated components of a transcriptional regulatory network in embryonic stem cells.

Authors:  Hong-bing Yu; Galih Kunarso; Felicia Huimei Hong; Lawrence W Stanton
Journal:  J Biol Chem       Date:  2009-09-09       Impact factor: 5.157

9.  Transcription factor and microRNA motif discovery: the Amadeus platform and a compendium of metazoan target sets.

Authors:  Chaim Linhart; Yonit Halperin; Ron Shamir
Journal:  Genome Res       Date:  2008-04-14       Impact factor: 9.043

10.  Discriminative prediction of mammalian enhancers from DNA sequence.

Authors:  Dongwon Lee; Rachel Karchin; Michael A Beer
Journal:  Genome Res       Date:  2011-08-29       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.