Literature DB >> 17668213

Multiple pattern matching: a Markov chain approach.

Manuel E Lladser1, M D Betterton, Rob Knight.   

Abstract

RNA motifs typically consist of short, modular patterns that include base pairs formed within and between modules. Estimating the abundance of these patterns is of fundamental importance for assessing the statistical significance of matches in genomewide searches, and for predicting whether a given function has evolved many times in different species or arose from a single common ancestor. In this manuscript, we review in an integrated and self-contained manner some basic concepts of automata theory, generating functions and transfer matrix methods that are relevant to pattern analysis in biological sequences. We formalize, in a general framework, the concept of Markov chain embedding to analyze patterns in random strings produced by a memoryless source. This conceptualization, together with the capability of automata to recognize complicated patterns, allows a systematic analysis of problems related to the occurrence and frequency of patterns in random strings. The applications we present focus on the concept of synchronization of automata, as well as automata used to search for a finite number of keywords (including sets of patterns generated according to base pairing rules) in a general text.

Mesh:

Substances:

Year:  2007        PMID: 17668213     DOI: 10.1007/s00285-007-0109-3

Source DB:  PubMed          Journal:  J Math Biol        ISSN: 0303-6812            Impact factor:   2.259


  25 in total

1.  Structural diversity of self-cleaving ribozymes.

Authors:  J Tang; R R Breaker
Journal:  Proc Natl Acad Sci U S A       Date:  2000-05-23       Impact factor: 11.205

2.  The distribution of RNA motifs in natural sequences.

Authors:  V Bourdeau; G Ferbeyre; M Pageau; B Paquin; R Cedergren
Journal:  Nucleic Acids Res       Date:  1999-11-15       Impact factor: 16.971

3.  In vitro evolution suggests multiple origins for the hammerhead ribozyme.

Authors:  K Salehi-Ashtiani; J W Szostak
Journal:  Nature       Date:  2001-11-01       Impact factor: 49.962

4.  Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets.

Authors:  Benjamin P Lewis; Christopher B Burge; David P Bartel
Journal:  Cell       Date:  2005-01-14       Impact factor: 41.582

5.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

6.  Accessing rare activities from random RNA sequences: the importance of the length of molecules in the starting pool.

Authors:  P C Sabeti; P J Unrau; D P Bartel
Journal:  Chem Biol       Date:  1997-10

7.  Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression.

Authors:  Wade Winkler; Ali Nahvi; Ronald R Breaker
Journal:  Nature       Date:  2002-10-16       Impact factor: 49.962

8.  Estimation of evolutionary distances between homologous nucleotide sequences.

Authors:  M Kimura
Journal:  Proc Natl Acad Sci U S A       Date:  1981-01       Impact factor: 11.205

9.  The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme.

Authors:  C Guerrier-Takada; K Gardiner; T Marsh; N Pace; S Altman
Journal:  Cell       Date:  1983-12       Impact factor: 41.582

10.  Rfam: annotating non-coding RNAs in complete genomes.

Authors:  Sam Griffiths-Jones; Simon Moxon; Mhairi Marshall; Ajay Khanna; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  6 in total

1.  Approximation of sojourn-times via maximal couplings: motif frequency distributions.

Authors:  Manuel E Lladser; Stephen R Chestnut
Journal:  J Math Biol       Date:  2013-06-06       Impact factor: 2.259

2.  Natural and artificial RNAs occupy the same restricted region of sequence space.

Authors:  Ryan Kennedy; Manuel E Lladser; Zhiyuan Wu; Chen Zhang; Michael Yarus; Hans De Sterck; Rob Knight
Journal:  RNA       Date:  2009-12-23       Impact factor: 4.942

3.  Algorithms for hidden markov models restricted to occurrences of regular expressions.

Authors:  Paula Tataru; Andreas Sand; Asger Hobolth; Thomas Mailund; Christian N S Pedersen
Journal:  Biology (Basel)       Date:  2013-11-08

4.  Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models.

Authors:  Mireille Régnier; Evgenia Furletova; Victor Yakovlev; Mikhail Roytberg
Journal:  Algorithms Mol Biol       Date:  2014-12-16       Impact factor: 1.405

5.  Regmex: a statistical tool for exploring motifs in ranked sequence lists from genomics experiments.

Authors:  Morten Muhlig Nielsen; Paula Tataru; Tobias Madsen; Asger Hobolth; Jakob Skou Pedersen
Journal:  Algorithms Mol Biol       Date:  2018-12-08       Impact factor: 1.405

6.  Efficient exact motif discovery.

Authors:  Tobias Marschall; Sven Rahmann
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.