Literature DB >> 8324630

A system for pattern matching applications on biosequences.

G Mehldau1, G Myers.   

Abstract

ANREP is a system for finding matches to patterns composed of (i) spacing constraints called 'spacers', and (ii) approximate matches to 'motifs' that are, recursively, patterns composed of 'atomic' symbols. A user specifies such patterns via a declarative, free-format and strongly typed language called A that is presented here in a tutorial style through a series of progressively more complex examples. The sample patterns are for protein and DNA sequences, the application domain for which ANREP was specifically created. ANREP provides a unified framework for almost all previously proposed biosequence patterns and extends them by providing approximate matching, a feature heretofore unavailable except for the limited case of individual sequences. The performance of ANREP is discussed and an appendix gives a concise specification of syntax and semantics. A portable C software package implementing ANREP is available via anonymous remote file transfer.

Mesh:

Year:  1993        PMID: 8324630     DOI: 10.1093/bioinformatics/9.3.299

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  11 in total

1.  HyPaLib: a database of RNAs and RNA structural elements defined by hybrid patterns.

Authors:  S Gräf; D Strothmann; S Kurtz; G Steger
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  MultiPipMaker and supporting tools: Alignments and analysis of multiple genomic DNA sequences.

Authors:  Scott Schwartz; Laura Elnitski; Mei Li; Matt Weirauch; Cathy Riemer; Arian Smit; Eric D Green; Ross C Hardison; Webb Miller
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  PepPat, a pattern-based oligopeptide homology search method and the identification of a novel tachykinin-like peptide.

Authors:  Ying Jiang; Ge Gao; Gang Fang; Eric L Gustafson; Maureen Laverty; Yanbin Yin; Yong Zhang; Jingchu Luo; Jonathan R Greene; Marvin L Bayne; Joseph A Hedrick; Nicholas J Murgolo
Journal:  Mamm Genome       Date:  2003-05       Impact factor: 2.957

4.  SMOTIF: efficient structured pattern and profile motif search.

Authors:  Yongqiang Zhang; Mohammed J Zaki
Journal:  Algorithms Mol Biol       Date:  2006-11-21       Impact factor: 1.405

5.  Protein sequence similarity searches using patterns as seeds.

Authors:  Z Zhang; A A Schäffer; W Miller; T L Madden; D J Lipman; E V Koonin; S F Altschul
Journal:  Nucleic Acids Res       Date:  1998-09-01       Impact factor: 16.971

6.  Palingol: a declarative programming language to describe nucleic acids' secondary structures and to scan sequence database.

Authors:  B Billoud; M Kontic; A Viari
Journal:  Nucleic Acids Res       Date:  1996-04-15       Impact factor: 16.971

7.  PRINTS--a database of protein motif fingerprints.

Authors:  T K Attwood; M E Beck; A J Bleasby; D J Parry-Smith
Journal:  Nucleic Acids Res       Date:  1994-09       Impact factor: 16.971

8.  ModuleOrganizer: detecting modules in families of transposable elements.

Authors:  Sebastien Tempel; Christine Rousseau; Fariza Tahi; Jacques Nicolas
Journal:  BMC Bioinformatics       Date:  2010-09-22       Impact factor: 3.169

9.  SCANMOT: searching for similar sequences using a simultaneous scan of multiple sequence motifs.

Authors:  Saikat Chakrabarti; A Prem Anand; Nitin Bhardwaj; Ganesan Pugalenthi; R Sowdhamini
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

10.  PCAS--a precomputed proteome annotation database resource.

Authors:  Yong Zhang; Yanbin Yin; Yunjia Chen; Ge Gao; Peng Yu; Jingchu Luo; Ying Jiang
Journal:  BMC Genomics       Date:  2003-11-01       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.